[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"similar-safe-graph--DGFraud":3,"tool-safe-graph--DGFraud":64},[4,17,27,35,43,56],{"id":5,"name":6,"github_repo":7,"description_zh":8,"stars":9,"difficulty_score":10,"last_commit_at":11,"category_tags":12,"status":16},3808,"stable-diffusion-webui","AUTOMATIC1111\u002Fstable-diffusion-webui","stable-diffusion-webui 是一个基于 Gradio 构建的网页版操作界面，旨在让用户能够轻松地在本地运行和使用强大的 Stable Diffusion 图像生成模型。它解决了原始模型依赖命令行、操作门槛高且功能分散的痛点，将复杂的 AI 绘图流程整合进一个直观易用的图形化平台。\n\n无论是希望快速上手的普通创作者、需要精细控制画面细节的设计师，还是想要深入探索模型潜力的开发者与研究人员，都能从中获益。其核心亮点在于极高的功能丰富度：不仅支持文生图、图生图、局部重绘（Inpainting）和外绘（Outpainting）等基础模式，还独创了注意力机制调整、提示词矩阵、负向提示词以及“高清修复”等高级功能。此外，它内置了 GFPGAN 和 CodeFormer 等人脸修复工具，支持多种神经网络放大算法，并允许用户通过插件系统无限扩展能力。即使是显存有限的设备，stable-diffusion-webui 也提供了相应的优化选项，让高质量的 AI 艺术创作变得触手可及。",162132,3,"2026-04-05T11:01:52",[13,14,15],"开发框架","图像","Agent","ready",{"id":18,"name":19,"github_repo":20,"description_zh":21,"stars":22,"difficulty_score":23,"last_commit_at":24,"category_tags":25,"status":16},1381,"everything-claude-code","affaan-m\u002Feverything-claude-code","everything-claude-code 是一套专为 AI 编程助手（如 Claude Code、Codex、Cursor 等）打造的高性能优化系统。它不仅仅是一组配置文件，而是一个经过长期实战打磨的完整框架，旨在解决 AI 代理在实际开发中面临的效率低下、记忆丢失、安全隐患及缺乏持续学习能力等核心痛点。\n\n通过引入技能模块化、直觉增强、记忆持久化机制以及内置的安全扫描功能，everything-claude-code 能显著提升 AI 在复杂任务中的表现，帮助开发者构建更稳定、更智能的生产级 AI 代理。其独特的“研究优先”开发理念和针对 Token 消耗的优化策略，使得模型响应更快、成本更低，同时有效防御潜在的攻击向量。\n\n这套工具特别适合软件开发者、AI 研究人员以及希望深度定制 AI 工作流的技术团队使用。无论您是在构建大型代码库，还是需要 AI 协助进行安全审计与自动化测试，everything-claude-code 都能提供强大的底层支持。作为一个曾荣获 Anthropic 黑客大奖的开源项目，它融合了多语言支持与丰富的实战钩子（hooks），让 AI 真正成长为懂上",138956,2,"2026-04-05T11:33:21",[13,15,26],"语言模型",{"id":28,"name":29,"github_repo":30,"description_zh":31,"stars":32,"difficulty_score":23,"last_commit_at":33,"category_tags":34,"status":16},2271,"ComfyUI","Comfy-Org\u002FComfyUI","ComfyUI 是一款功能强大且高度模块化的视觉 AI 引擎，专为设计和执行复杂的 Stable Diffusion 图像生成流程而打造。它摒弃了传统的代码编写模式，采用直观的节点式流程图界面，让用户通过连接不同的功能模块即可构建个性化的生成管线。\n\n这一设计巧妙解决了高级 AI 绘图工作流配置复杂、灵活性不足的痛点。用户无需具备编程背景，也能自由组合模型、调整参数并实时预览效果，轻松实现从基础文生图到多步骤高清修复等各类复杂任务。ComfyUI 拥有极佳的兼容性，不仅支持 Windows、macOS 和 Linux 全平台，还广泛适配 NVIDIA、AMD、Intel 及苹果 Silicon 等多种硬件架构，并率先支持 SDXL、Flux、SD3 等前沿模型。\n\n无论是希望深入探索算法潜力的研究人员和开发者，还是追求极致创作自由度的设计师与资深 AI 绘画爱好者，ComfyUI 都能提供强大的支持。其独特的模块化架构允许社区不断扩展新功能，使其成为当前最灵活、生态最丰富的开源扩散模型工具之一，帮助用户将创意高效转化为现实。",107662,"2026-04-03T11:11:01",[13,14,15],{"id":36,"name":37,"github_repo":38,"description_zh":39,"stars":40,"difficulty_score":23,"last_commit_at":41,"category_tags":42,"status":16},3704,"NextChat","ChatGPTNextWeb\u002FNextChat","NextChat 是一款轻量且极速的 AI 助手，旨在为用户提供流畅、跨平台的大模型交互体验。它完美解决了用户在多设备间切换时难以保持对话连续性，以及面对众多 AI 模型不知如何统一管理的痛点。无论是日常办公、学习辅助还是创意激发，NextChat 都能让用户随时随地通过网页、iOS、Android、Windows、MacOS 或 Linux 端无缝接入智能服务。\n\n这款工具非常适合普通用户、学生、职场人士以及需要私有化部署的企业团队使用。对于开发者而言，它也提供了便捷的自托管方案，支持一键部署到 Vercel 或 Zeabur 等平台。\n\nNextChat 的核心亮点在于其广泛的模型兼容性，原生支持 Claude、DeepSeek、GPT-4 及 Gemini Pro 等主流大模型，让用户在一个界面即可自由切换不同 AI 能力。此外，它还率先支持 MCP（Model Context Protocol）协议，增强了上下文处理能力。针对企业用户，NextChat 提供专业版解决方案，具备品牌定制、细粒度权限控制、内部知识库整合及安全审计等功能，满足公司对数据隐私和个性化管理的高标准要求。",87618,"2026-04-05T07:20:52",[13,26],{"id":44,"name":45,"github_repo":46,"description_zh":47,"stars":48,"difficulty_score":23,"last_commit_at":49,"category_tags":50,"status":16},2268,"ML-For-Beginners","microsoft\u002FML-For-Beginners","ML-For-Beginners 是由微软推出的一套系统化机器学习入门课程，旨在帮助零基础用户轻松掌握经典机器学习知识。这套课程将学习路径规划为 12 周，包含 26 节精炼课程和 52 道配套测验，内容涵盖从基础概念到实际应用的完整流程，有效解决了初学者面对庞大知识体系时无从下手、缺乏结构化指导的痛点。\n\n无论是希望转型的开发者、需要补充算法背景的研究人员，还是对人工智能充满好奇的普通爱好者，都能从中受益。课程不仅提供了清晰的理论讲解，还强调动手实践，让用户在循序渐进中建立扎实的技能基础。其独特的亮点在于强大的多语言支持，通过自动化机制提供了包括简体中文在内的 50 多种语言版本，极大地降低了全球不同背景用户的学习门槛。此外，项目采用开源协作模式，社区活跃且内容持续更新，确保学习者能获取前沿且准确的技术资讯。如果你正寻找一条清晰、友好且专业的机器学习入门之路，ML-For-Beginners 将是理想的起点。",84991,"2026-04-05T10:45:23",[14,51,52,53,15,54,26,13,55],"数据工具","视频","插件","其他","音频",{"id":57,"name":58,"github_repo":59,"description_zh":60,"stars":61,"difficulty_score":10,"last_commit_at":62,"category_tags":63,"status":16},3128,"ragflow","infiniflow\u002Fragflow","RAGFlow 是一款领先的开源检索增强生成（RAG）引擎，旨在为大语言模型构建更精准、可靠的上下文层。它巧妙地将前沿的 RAG 技术与智能体（Agent）能力相结合，不仅支持从各类文档中高效提取知识，还能让模型基于这些知识进行逻辑推理和任务执行。\n\n在大模型应用中，幻觉问题和知识滞后是常见痛点。RAGFlow 通过深度解析复杂文档结构（如表格、图表及混合排版），显著提升了信息检索的准确度，从而有效减少模型“胡编乱造”的现象，确保回答既有据可依又具备时效性。其内置的智能体机制更进一步，使系统不仅能回答问题，还能自主规划步骤解决复杂问题。\n\n这款工具特别适合开发者、企业技术团队以及 AI 研究人员使用。无论是希望快速搭建私有知识库问答系统，还是致力于探索大模型在垂直领域落地的创新者，都能从中受益。RAGFlow 提供了可视化的工作流编排界面和灵活的 API 接口，既降低了非算法背景用户的上手门槛，也满足了专业开发者对系统深度定制的需求。作为基于 Apache 2.0 协议开源的项目，它正成为连接通用大模型与行业专有知识之间的重要桥梁。",77062,"2026-04-04T04:44:48",[15,14,13,26,54],{"id":65,"github_repo":66,"name":67,"description_en":68,"description_zh":69,"ai_summary_zh":69,"readme_en":70,"readme_zh":71,"quickstart_zh":72,"use_case_zh":73,"hero_image_url":74,"owner_login":75,"owner_name":76,"owner_avatar_url":77,"owner_bio":78,"owner_company":79,"owner_location":79,"owner_email":80,"owner_twitter":79,"owner_website":81,"owner_url":82,"languages":83,"stars":88,"forks":89,"last_commit_at":90,"license":91,"difficulty_score":92,"env_os":93,"env_gpu":94,"env_ram":94,"env_deps":95,"category_tags":103,"github_topics":104,"view_count":23,"oss_zip_url":79,"oss_zip_packed_at":79,"status":16,"created_at":125,"updated_at":126,"faqs":127,"releases":177},2393,"safe-graph\u002FDGFraud","DGFraud","A Deep Graph-based Toolbox for Fraud Detection","DGFraud 是一款专为欺诈检测打造的深度学习工具箱，核心基于图神经网络（GNN）技术。在金融风控、社交网络等场景中，欺诈行为往往隐蔽且相互关联，传统方法难以捕捉复杂的团伙作案特征。DGFraud 通过构建用户与交易之间的图谱关系，利用先进的图算法精准识别伪装成正常用户的欺诈者，有效解决了现有模型在面对“伪装欺诈”时效果不佳及数据不一致的痛点。\n\n该工具集成了多种业界领先的欺诈检测模型，支持开发者直接调用并进行横向对比实验，极大地降低了复现前沿算法的门槛。其独特亮点在于不仅提供了完整的模型实现，还针对欺诈检测中的特殊挑战（如对抗性攻击）进行了优化，并已在 TensorFlow 2.0 版本中完成升级，保持了技术的先进性。\n\nDGFraud 主要面向人工智能研究人员、数据科学家以及从事风控系统开发的工程师。如果你正在探索图学习在异常检测领域的应用，或需要构建高效的反欺诈系统，DGFraud 将是一个开源、灵活且经过学术验证的有力助手。社区欢迎各方贡献新的检测算法，共同推动欺诈检测技术的发展。","\u003Cp align=\"center\">\n    \u003Cbr>\n    \u003Ca href=\"https:\u002F\u002Fimage.flaticon.com\u002Ficons\u002Fsvg\u002F1671\u002F1671517.svg\">\n        \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fsafe-graph_DGFraud_readme_43ed16403baf.png\" width=\"400\"\u002F>\n    \u003C\u002Fa>\n    \u003Cbr>\n\u003Cp>\n\u003Cp align=\"center\">\n    \u003Ca href=\"https:\u002F\u002Ftravis-ci.org\u002Fgithub\u002Fsafe-graph\u002FDGFraud\">\n        \u003Cimg alt=\"PRs Welcome\" src=\"https:\u002F\u002Ftravis-ci.org\u002Fsafe-graph\u002FDGFraud.svg?branch=master\">\n    \u003C\u002Fa>\n    \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002FLICENSE\">\n        \u003Cimg alt=\"GitHub\" src=\"https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Flicense\u002Fsafe-graph\u002FDGFraud\">\n    \u003C\u002Fa>\n    \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fpulls\">\n        \u003Cimg alt=\"GitHub release\" src=\"https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fv\u002Frelease\u002Fsafe-graph\u002FDGFraud?include_prereleases\">\n    \u003C\u002Fa>\n    \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Farchive\u002Fmaster.zip\">\n        \u003Cimg alt=\"PRs\" src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FPRs-welcome-brightgreen.svg\">\n    \u003C\u002Fa>\n\u003C\u002Fp>\n\n\u003Ch3 align=\"center\">\n\u003Cp>A Deep Graph-based Toolbox for Fraud Detection\n\u003C\u002Fh3>\n\n**Introduction** \n\n**May 2021 Update:** The DGFraud has upgraded to TensorFlow 2.0! Please check out [DGFraud-TF2](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud-TF2)\n\n**DGFraud** is a Graph Neural Network (GNN) based toolbox for fraud detection. It integrates the implementation & comparison of state-of-the-art GNN-based fraud detection models. The introduction of implemented models can be found [here](#implemented-models). \u003C!-- (Add introduction blogs links). -->\n\nWe welcome contributions on adding new fraud detectors and extending the features of the toolbox. Some of the planned features are listed in [TODO list](#todo-list). \n\nIf you use the toolbox in your project, please cite one of the two papers below and the [algorithms](#implemented-models) you used :\n\nCIKM'20 ([PDF](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2008.08692.pdf))\n```bibtex\n@inproceedings{dou2020enhancing,\n  title={Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters},\n  author={Dou, Yingtong and Liu, Zhiwei and Sun, Li and Deng, Yutong and Peng, Hao and Yu, Philip S},\n  booktitle={Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM'20)},\n  year={2020}\n}\n```\nSIGIR'20 ([PDF](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2005.00625.pdf))\n```bibtex\n@inproceedings{liu2020alleviating,\n  title={Alleviating the Inconsistency Problem of Applying Graph Neural Network to Fraud Detection},\n  author={Liu, Zhiwei and Dou, Yingtong and Yu, Philip S. and Deng, Yutong and Peng, Hao},\n  booktitle={Proceedings of the 43nd International ACM SIGIR Conference on Research and Development in Information Retrieval},\n  year={2020}\n}\n```\n\n**Useful Resources**\n- [PyGOD: A Python Library for Graph Outlier Detection (Anomaly Detection)](https:\u002F\u002Fgithub.com\u002Fpygod-team\u002Fpygod)\n- [UGFraud: An Unsupervised Graph-based Toolbox for Fraud Detection](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FUGFraud)\n- [Graph-based Fraud Detection Paper List](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002Fgraph-fraud-detection-papers) \n- [Awesome Fraud Detection Papers](https:\u002F\u002Fgithub.com\u002Fbenedekrozemberczki\u002Fawesome-fraud-detection-papers)\n- [Attack and Defense Papers on Graph Data](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002Fgraph-adversarial-learning-literature)\n- [PyOD: A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)](https:\u002F\u002Fgithub.com\u002Fyzhao062\u002Fpyod)\n- [PyODD: An End-to-end Outlier Detection System](https:\u002F\u002Fgithub.com\u002Fdatamllab\u002Fpyodds)\n- [DGL: Deep Graph Library](https:\u002F\u002Fgithub.com\u002Fdmlc\u002Fdgl)\n- [Outlier Detection DataSets (ODDS)](http:\u002F\u002Fodds.cs.stonybrook.edu\u002F)\n\n**Table of Contents**\n- [Installation](#installation)\n- [Datasets](#datasets)\n- [User Guide](#user-guide)\n- [Implemented Models](#implemented-models)\n- [Model Comparison](#model-comparison)\n- [TODO List](#todo-list)\n- [How to Contribute](#how-to-contribute)\n\n\n## Installation\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud.git\ncd DGFraud\npython setup.py install\n```\n### Requirements\n```bash\n* python 3.6, 3.7\n* tensorflow>=1.14.0,\u003C2.0\n* numpy>=1.16.4\n* scipy>=1.2.0\n* networkx\u003C=1.11\n```\n## Datasets\n\n### DBLP\nWe uses the pre-processed DBLP dataset from [Jhy1993\u002FHAN](https:\u002F\u002Fgithub.com\u002FJhy1993\u002FHAN)\nYou can run the FdGars, Player2Vec, GeniePath and GEM based on the DBLP dataset.\nUnzip the archive before using the dataset:\n```bash\ncd dataset\nunzip DBLP4057_GAT_with_idx_tra200_val_800.zip\n```\n\n### Example dataset\nWe implement example graphs for SemiGNN, GAS and GEM in `data_loader.py`. Because those models require unique graph structures or node types, which cannot be found in opensource datasets.\n\n\n### Yelp dataset\nFor [GraphConsis](https:\u002F\u002Farxiv.org\u002Fabs\u002F2005.00625), we preprocessed [Yelp Spam Review Dataset](http:\u002F\u002Fodds.cs.stonybrook.edu\u002Fyelpchi-dataset\u002F) with reviews as nodes and three relations as edges.\n\nThe dataset with `.mat` format is located at `\u002Fdataset\u002FYelpChi.zip`. The `.mat` file includes:\n- `net_rur, net_rtr, net_rsr`: three sparse matrices representing three homo-graphs defined in [GraphConsis](https:\u002F\u002Farxiv.org\u002Fabs\u002F2005.00625) paper;\n- `features`: a sparse matrix of 32-dimension handcrafted features;\n- `label`: a numpy array with the ground truth of nodes. `1` represents spam and `0` represents benign.\n\nThe YelpChi data preprocessing details can be found in our [CIKM'20](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2008.08692.pdf) paper.\nTo get the complete metadata of the Yelp dataset, please email to [ytongdou@gmail.com](mailto:ytongdou@gmail.com) for inquiry.\n\n\n## User Guide\n\n### Running the example code\nYou can find the implemented models in `algorithms` directory. For example, you can run Player2Vec using:\n```bash\npython Player2Vec_main.py \n```\nYou can specify parameters for models when running the code.\n\n### Running on your datasets\nHave a look at the load_data_dblp() function in utils\u002Futils.py for an example.\n\nIn order to use your own data, you have to provide:\n* adjacency matrices or adjlists (for GAS);\n* a feature matrix\n* a label matrix\nthen split feature matrix and label matrix into testing data and training data.\n\nYou can specify a dataset as follows:\n```bash\npython xx_main.py --dataset your_dataset \n```\nor by editing xx_main.py\n\n### The structure of code\nThe repository is organized as follows:\n- `algorithms\u002F` contains the implemented models and the corresponding example code;\n- `base_models\u002F` contains the basic models (GCN);\n- `dataset\u002F` contains the necessary dataset files;\n- `utils\u002F` contains:\n    * loading and splitting the data (`data_loader.py`);\n    * contains various utilities (`utils.py`).\n\n\n## Implemented Models\n\n| Model  | Paper  | Venue  | Reference  |\n|-------|--------|--------|--------|\n| **SemiGNN** | [A Semi-supervised Graph Attentive Network for Financial Fraud Detection](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2003.01171)  | ICDM 2019  | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fsemignn.txt) |\n| **Player2Vec** | [Key Player Identification in Underground Forums over Attributed Heterogeneous Information Network Embedding Framework](http:\u002F\u002Fmason.gmu.edu\u002F~lzhao9\u002Fmaterials\u002Fpapers\u002Flp0110-zhangA.pdf)  | CIKM 2019  | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fplayer2vec.txt)|\n| **GAS** | [Spam Review Detection with Graph Convolutional Networks](https:\u002F\u002Farxiv.org\u002Fabs\u002F1908.10679)  | CIKM 2019 | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgas.txt) |\n| **FdGars** | [FdGars: Fraudster Detection via Graph Convolutional Networks in Online App Review System](https:\u002F\u002Fdl.acm.org\u002Fcitation.cfm?id=3316586)  | WWW 2019 | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Ffdgars.txt) |\n| **GeniePath** | [GeniePath: Graph Neural Networks with Adaptive Receptive Paths](https:\u002F\u002Farxiv.org\u002Fabs\u002F1802.00910)  | AAAI 2019 | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgeniepath.txt)  |\n| **GEM** | [Heterogeneous Graph Neural Networks for Malicious Account Detection](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2002.12307.pdf)  | CIKM 2018 |[BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgem.txt) |\n| **GraphSAGE** | [Inductive Representation Learning on Large Graphs](https:\u002F\u002Farxiv.org\u002Fpdf\u002F1706.02216.pdf)  | NIPS 2017  | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgraphsage.txt) |\n| **GraphConsis** | [Alleviating the Inconsistency Problem of Applying Graph Neural Network to Fraud Detection](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2005.00625.pdf)  | SIGIR 2020  | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgraphconsis.txt) |\n| **HACUD** | [Cash-Out User Detection Based on Attributed Heterogeneous Information Network with a Hierarchical Attention Mechanism](https:\u002F\u002Faaai.org\u002Fojs\u002Findex.php\u002FAAAI\u002Farticle\u002Fview\u002F3884)  | AAAI 2019 |  [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fhacud.txt) |\n\n\n## Model Comparison\n| Model  | Application  | Graph Type  | Base Model  |\n|-------|--------|--------|--------|\n| **SemiGNN** | Financial Fraud  | Heterogeneous   | GAT, LINE, DeepWalk |\n| **Player2Vec** | Cyber Criminal  | Heterogeneous | GAT, GCN|\n| **GAS** | Opinion Fraud  | Heterogeneous | GCN, GAT |\n| **FdGars** |  Opinion Fraud | Homogeneous | GCN |\n| **GeniePath** | Financial Fraud | Homogeneous | GAT  |\n| **GEM** | Financial Fraud  | Heterogeneous |GCN |\n| **GraphSAGE** | Opinion Fraud  | Homogeneous   | GraphSAGE |\n| **GraphConsis** | Opinion Fraud  | Heterogeneous   | GraphSAGE |\n| **HACUD** | Financial Fraud | Heterogeneous | GAT |\n\n\n## TODO List\n- Implementing mini-batch training\n- The log loss for GEM model\n- Time-based sampling for GEM\n- Add sampling methods\n- Benchmarking SOTA models\n- Scalable implementation\n- Pytorch implementation\n\n## How to Contribute\nYou are welcomed to contribute to this open-source toolbox. The detailed instructions will be released soon. Currently, you can create issues or email to [bdscsafegraph@gmail.com](mailto:bdscsafegraph@gmail.com) for inquiry.\n\n","\u003Cp align=\"center\">\n    \u003Cbr>\n    \u003Ca href=\"https:\u002F\u002Fimage.flaticon.com\u002Ficons\u002Fsvg\u002F1671\u002F1671517.svg\">\n        \u003Cimg src=\"https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fsafe-graph_DGFraud_readme_43ed16403baf.png\" width=\"400\"\u002F>\n    \u003C\u002Fa>\n    \u003Cbr>\n\u003Cp>\n\u003Cp align=\"center\">\n    \u003Ca href=\"https:\u002F\u002Ftravis-ci.org\u002Fgithub\u002Fsafe-graph\u002FDGFraud\">\n        \u003Cimg alt=\"PRs Welcome\" src=\"https:\u002F\u002Ftravis-ci.org\u002Fsafe-graph\u002FDGFraud.svg?branch=master\">\n    \u003C\u002Fa>\n    \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002FLICENSE\">\n        \u003Cimg alt=\"GitHub\" src=\"https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Flicense\u002Fsafe-graph\u002FDGFraud\">\n    \u003C\u002Fa>\n    \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fpulls\">\n        \u003Cimg alt=\"GitHub release\" src=\"https:\u002F\u002Fimg.shields.io\u002Fgithub\u002Fv\u002Frelease\u002Fsafe-graph\u002FDGFraud?include_prereleases\">\n    \u003C\u002Fa>\n    \u003Ca href=\"https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Farchive\u002Fmaster.zip\">\n        \u003Cimg alt=\"PRs\" src=\"https:\u002F\u002Fimg.shields.io\u002Fbadge\u002FPRs-welcome-brightgreen.svg\">\n    \u003C\u002Fa>\n\u003C\u002Fp>\n\n\u003Ch3 align=\"center\">\n\u003Cp>基于深度图神经网络的欺诈检测工具箱\n\u003C\u002Fh3>\n\n**简介**\n\n**2021年5月更新：** DGFraud 已升级至 TensorFlow 2.0！请查看 [DGFraud-TF2](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud-TF2)\n\n**DGFraud** 是一个基于图神经网络（GNN）的欺诈检测工具箱。它集成了当前最先进的 GNN 款式欺诈检测模型的实现与对比。已实现模型的介绍可参见 [此处](#implemented-models)。\u003C!-- (添加介绍博客链接)。 -->\n\n我们欢迎贡献者加入，以新增欺诈检测器并扩展该工具箱的功能。部分计划中的功能列于 [待办事项列表](#todo-list)。\n\n如果您在项目中使用本工具箱，请引用以下两篇论文之一以及您所使用的 [算法](#implemented-models)：\n\nCIKM'20 ([PDF](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2008.08692.pdf))\n```bibtex\n@inproceedings{dou2020enhancing,\n  title={Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged Fraudsters},\n  author={Dou, Yingtong and Liu, Zhiwei and Sun, Li and Deng, Yutong and Peng, Hao and Yu, Philip S},\n  booktitle={Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM'20)},\n  year={2020}\n}\n```\nSIGIR'20 ([PDF](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2005.00625.pdf))\n```bibtex\n@inproceedings{liu2020alleviating,\n  title={Alleviating the Inconsistency Problem of Applying Graph Neural Network to Fraud Detection},\n  author={Liu, Zhiwei and Dou, Yingtong and Yu, Philip S. and Deng, Yutong and Peng, Hao},\n  booktitle={Proceedings of the 43nd International ACM SIGIR Conference on Research and Development in Information Retrieval},\n  year={2020}\n}\n```\n\n**有用资源**\n- [PyGOD：用于图异常检测的 Python 库](https:\u002F\u002Fgithub.com\u002Fpygod-team\u002Fpygod)\n- [UGFraud：无监督图基欺诈检测工具箱](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FUGFraud)\n- [图基欺诈检测论文列表](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002Fgraph-fraud-detection-papers) \n- [优秀欺诈检测论文](https:\u002F\u002Fgithub.com\u002Fbenedekrozemberczki\u002Fawesome-fraud-detection-papers)\n- [图数据上的攻击与防御相关论文](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002Fgraph-adversarial-learning-literature)\n- [PyOD：用于大规模异常检测的 Python 工具箱](https:\u002F\u002Fgithub.com\u002Fyzhao062\u002Fpyod)\n- [PyODD：端到端异常检测系统](https:\u002F\u002Fgithub.com\u002Fdatamllab\u002Fpyodds)\n- [DGL：深度图库](https:\u002F\u002Fgithub.com\u002Fdmlc\u002Fdgl)\n- [异常检测数据集（ODDS）](http:\u002F\u002Fodds.cs.stonybrook.edu\u002F)\n\n**目录**\n- [安装](#installation)\n- [数据集](#datasets)\n- [用户指南](#user-guide)\n- [已实现模型](#implemented-models)\n- [模型比较](#model-comparison)\n- [待办事项列表](#todo-list)\n- [如何贡献](#how-to-contribute)\n\n\n## 安装\n```bash\ngit clone https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud.git\ncd DGFraud\npython setup.py install\n```\n### 需求\n```bash\n* python 3.6, 3.7\n* tensorflow>=1.14.0,\u003C2.0\n* numpy>=1.16.4\n* scipy>=1.2.0\n* networkx\u003C=1.11\n```\n## 数据集\n\n### DBLP\n我们使用来自 [Jhy1993\u002FHAN](https:\u002F\u002Fgithub.com\u002FJhy1993\u002FHAN) 的预处理 DBLP 数据集。\n您可以基于 DBLP 数据集运行 FdGars、Player2Vec、GeniePath 和 GEM。\n在使用数据集之前，请先解压缩：\n```bash\ncd dataset\nunzip DBLP4057_GAT_with_idx_tra200_val_800.zip\n```\n\n### 示例数据集\n我们在 `data_loader.py` 中实现了 SemiGNN、GAS 和 GEM 的示例图。由于这些模型需要独特的图结构或节点类型，而开源数据集并不具备这些条件。\n\n\n### Yelp 数据集\n针对 [GraphConsis](https:\u002F\u002Farxiv.org\u002Fabs\u002F2005.00625)，我们对 [Yelp 垃圾评论数据集](http:\u002F\u002Fodds.cs.stonybrook.edu\u002Fyelpchi-dataset\u002F) 进行了预处理，将评论作为节点，三种关系作为边。\n\n该 `.mat` 格式的数据集位于 `\u002Fdataset\u002FYelpChi.zip`。`.mat` 文件包含：\n- `net_rur, net_rtr, net_rsr`：三张稀疏矩阵，分别代表 [GraphConsis](https:\u002F\u002Farxiv.org\u002Fabs\u002F2005.00625) 论文中定义的三个同质图；\n- `features`：一张 32 维手工特征的稀疏矩阵；\n- `label`：一个包含节点真实标签的 NumPy 数组。`1` 表示垃圾评论，`0` 表示正常评论。\n\nYelpChi 数据预处理的详细信息可在我们的 [CIKM'20](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2008.08692.pdf) 论文中找到。\n如需获取 Yelp 数据集的完整元数据，请发送邮件至 [ytongdou@gmail.com](mailto:ytongdou@gmail.com) 咨询。\n\n\n## 用户指南\n\n### 运行示例代码\n您可以在 `algorithms` 目录中找到已实现的模型。例如，您可以使用以下命令运行 Player2Vec：\n```bash\npython Player2Vec_main.py \n```\n运行代码时，您可以指定模型参数。\n\n### 在您的数据集上运行\n请参阅 `utils\u002Futils.py` 中的 `load_data_dblp()` 函数以获取示例。\n\n要使用您自己的数据，您需要提供：\n* 邻接矩阵或邻接表（适用于 GAS）；\n* 特征矩阵；\n* 标签矩阵；\n然后将特征矩阵和标签矩阵划分为测试数据和训练数据。\n\n您可以按如下方式指定数据集：\n```bash\npython xx_main.py --dataset your_dataset \n```\n或者直接编辑 `xx_main.py`。\n\n### 代码结构\n该仓库的组织如下：\n- `algorithms\u002F` 包含已实现的模型及其对应的示例代码；\n- `base_models\u002F` 包含基础模型（GCN）；\n- `dataset\u002F` 包含必要的数据文件；\n- `utils\u002F` 包括：\n    * 数据加载与拆分（`data_loader.py`）；\n    * 各种实用工具（`utils.py`）。\n\n## 已实现模型\n\n| 模型  | 论文  | 会议\u002F期刊  | 参考文献  |\n|-------|--------|--------|--------|\n| **SemiGNN** | [用于金融欺诈检测的半监督图注意力网络](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2003.01171)  | ICDM 2019  | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fsemignn.txt) |\n| **Player2Vec** | [基于属性异构信息网络嵌入框架的地下论坛关键人物识别](http:\u002F\u002Fmason.gmu.edu\u002F~lzhao9\u002Fmaterials\u002Fpapers\u002Flp0110-zhangA.pdf)  | CIKM 2019  | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fplayer2vec.txt)|\n| **GAS** | [利用图卷积网络进行垃圾评论检测](https:\u002F\u002Farxiv.org\u002Fabs\u002F1908.10679)  | CIKM 2019 | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgas.txt) |\n| **FdGars** | [FdGars：基于图卷积网络的在线应用评论系统中的欺诈者检测](https:\u002F\u002Fdl.acm.org\u002Fcitation.cfm?id=3316586)  | WWW 2019 | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Ffdgars.txt) |\n| **GeniePath** | [GeniePath：具有自适应感受野路径的图神经网络](https:\u002F\u002Farxiv.org\u002Fabs\u002F1802.00910)  | AAAI 2019 | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgeniepath.txt)  |\n| **GEM** | [用于恶意账号检测的异构图神经网络](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2002.12307.pdf)  | CIKM 2018 |[BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgem.txt) |\n| **GraphSAGE** | [大规模图上的归纳表示学习](https:\u002F\u002Farxiv.org\u002Fpdf\u002F1706.02216.pdf)  | NIPS 2017  | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgraphsage.txt) |\n| **GraphConsis** | [缓解图神经网络应用于欺诈检测时的一致性问题](https:\u002F\u002Farxiv.org\u002Fpdf\u002F2005.00625.pdf)  | SIGIR 2020  | [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fgraphconsis.txt) |\n| **HACUD** | [基于属性异构信息网络并结合层次化注意力机制的套现用户检测](https:\u002F\u002Faaai.org\u002Fojs\u002Findex.php\u002FAAAI\u002Farticle\u002Fview\u002F3884)  | AAAI 2019 |  [BibTex](https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fblob\u002Fmaster\u002Freference\u002Fhacud.txt) |\n\n\n## 模型对比\n| 模型  | 应用场景  | 图类型  | 基础模型  |\n|-------|--------|--------|--------|\n| **SemiGNN** | 金融欺诈  | 异构   | GAT, LINE, DeepWalk |\n| **Player2Vec** | 网络犯罪  | 异构 | GAT, GCN|\n| **GAS** | 评论欺诈  | 异构 | GCN, GAT |\n| **FdGars** | 评论欺诈 | 同构 | GCN |\n| **GeniePath** | 金融欺诈 | 同构 | GAT  |\n| **GEM** | 金融欺诈  | 异构 |GCN |\n| **GraphSAGE** | 评论欺诈  | 同构   | GraphSAGE |\n| **GraphConsis** | 评论欺诈  | 异构   | GraphSAGE |\n| **HACUD** | 金融欺诈 | 异构 | GAT |\n\n\n## 待办事项\n- 实现小批量训练\n- GEM模型的对数损失计算\n- GEM模型的时间采样方法\n- 添加更多采样方法\n- 对齐当前最优模型进行基准测试\n- 实现可扩展的版本\n- PyTorch实现\n\n## 如何贡献\n欢迎各位为本开源工具库贡献力量。详细的贡献指南将很快发布。目前，您可以通过创建Issue或发送邮件至[bdscsafegraph@gmail.com](mailto:bdscsafegraph@gmail.com)进行咨询。","# DGFraud 快速上手指南\n\nDGFraud 是一个基于图神经网络（GNN）的欺诈检测工具箱，集成了多种前沿的欺诈检测模型实现与对比。\n\n## 环境准备\n\n在开始之前，请确保您的系统满足以下要求：\n\n*   **操作系统**：Linux \u002F macOS \u002F Windows\n*   **Python 版本**：3.6 或 3.7\n*   **核心依赖**：\n    *   TensorFlow: `>=1.14.0, \u003C2.0` (注意：本项目基于 TF 1.x，若需 TF 2.0 版本请参考 DGFraud-TF2)\n    *   NumPy: `>=1.16.4`\n    *   SciPy: `>=1.2.0`\n    *   NetworkX: `\u003C=1.11`\n\n> **提示**：国内用户建议使用清华源或阿里源加速 Python 包的安装。\n\n## 安装步骤\n\n1.  **克隆项目代码**\n    ```bash\n    git clone https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud.git\n    cd DGFraud\n    ```\n\n2.  **安装依赖并构建**\n    执行以下命令自动安装所需依赖并完成项目配置：\n    ```bash\n    python setup.py install\n    ```\n    *如果上述命令未自动安装 Python 依赖，请手动执行：*\n    ```bash\n    pip install -r requirements.txt\n    # 或者手动安装指定版本\n    pip install \"tensorflow>=1.14.0,\u003C2.0\" \"numpy>=1.16.4\" \"scipy>=1.2.0\" \"networkx\u003C=1.11\"\n    ```\n\n3.  **准备数据集**\n    以 DBLP 数据集为例，进入数据目录并解压：\n    ```bash\n    cd dataset\n    unzip DBLP4057_GAT_with_idx_tra200_val_800.zip\n    cd ..\n    ```\n\n## 基本使用\n\n### 运行示例模型\n项目已在 `algorithms` 目录下实现了多种模型（如 Player2Vec, FdGars, GraphConsis 等）。直接运行对应的 `_main.py` 文件即可启动训练和评估。\n\n例如，运行 **Player2Vec** 模型：\n```bash\npython Player2Vec_main.py\n```\n\n### 使用自定义数据集\n若需使用自己的数据，需准备以下文件：\n*   邻接矩阵或邻接表（针对 GAS 模型）\n*   特征矩阵 (`features`)\n*   标签向量 (`label`)\n\n参考 `utils\u002Futils.py` 中的 `load_data_dblp()` 函数编写数据加载逻辑，将数据划分为训练集和测试集。\n\n运行自定义数据时，可通过参数指定：\n```bash\npython xx_main.py --dataset your_dataset\n```\n或者直接编辑 `xx_main.py` 文件中的数据加载路径和参数。\n\n### 代码结构说明\n*   `algorithms\u002F`: 包含已实现的各类欺诈检测模型及示例脚本。\n*   `base_models\u002F`: 包含基础模型（如 GCN）。\n*   `dataset\u002F`: 存放预处理后的数据集文件。\n*   `utils\u002F`: 包含数据加载 (`data_loader.py`) 和通用工具函数 (`utils.py`)。","某大型跨境电商平台的风控团队正面临“黑产团伙”利用虚假账号进行刷单和套现的严峻挑战，急需从海量交易数据中精准识别隐蔽的欺诈网络。\n\n### 没有 DGFraud 时\n- **孤立视角局限**：传统规则引擎仅分析单笔交易特征，无法捕捉欺诈者之间复杂的关联关系，导致团伙作案难以被发现。\n- **对抗能力薄弱**：黑产通过伪装成正常用户（如模拟真实浏览轨迹）轻松绕过检测，模型缺乏针对“伪装欺诈者”的防御机制。\n- **算法验证低效**：团队需手动复现论文中的图神经网络（GNN）算法，环境配置繁琐且不同模型间缺乏统一对比标准，研发周期长达数周。\n- **误报率居高不下**：由于无法有效解决图数据在欺诈检测中的不一致性问题，大量正常用户被误判，严重影响用户体验。\n\n### 使用 DGFraud 后\n- **全局关联洞察**：利用 DGFraud 内置的图神经网络模型，将用户、设备、IP 构建为异构图，成功挖掘出隐藏在正常交易背后的紧密欺诈子图。\n- **强化抗伪装性**：部署经 CIKM'20 论文验证的增强型检测器，专门针对伪装行为进行训练，显著提升了识别“高仿真”黑产账号的能力。\n- **一站式实验对比**：直接调用 DGFraud 集成的多种 SOTA 模型进行快速基准测试，统一了 TensorFlow 环境，将新算法的验证与上线时间缩短至几天。\n- **精度显著提升**：借助 SIGIR'20 提出的不一致性缓解技术，模型在保持高召回率的同时大幅降低误报，减少了对正常用户的打扰。\n\nDGFraud 通过将前沿图深度学习算法工程化，帮助风控团队从被动防御转向主动挖掘，从根本上提升了对抗有组织欺诈的核心战斗力。","https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Fsafe-graph_DGFraud_c6ba2ee9.png","safe-graph","SafeGraph","https:\u002F\u002Foss.gittoolsai.com\u002Favatars\u002Fsafe-graph_f8c42abc.png","Towards Secure Machine Learning on Graph Data",null,"bdscsafegraph@gmail.com","https:\u002F\u002Fbdsc.lab.uic.edu\u002FNSF1930941.html","https:\u002F\u002Fgithub.com\u002Fsafe-graph",[84],{"name":85,"color":86,"percentage":87},"Python","#3572A5",100,751,164,"2026-03-31T12:30:26","Apache-2.0",4,"","未说明",{"notes":96,"python":97,"dependencies":98},"该项目已升级至 TensorFlow 2.0 版本，建议查看新的仓库 DGFraud-TF2。部分模型（如 SemiGNN, GAS, GEM）需要使用代码中内置的示例图数据，因为开源数据集中缺乏其所需的特定图结构或节点类型。Yelp 数据集需解压 .mat 文件使用，完整元数据需联系作者获取。","3.6, 3.7",[99,100,101,102],"tensorflow>=1.14.0,\u003C2.0","numpy>=1.16.4","scipy>=1.2.0","networkx\u003C=1.11",[51,13,54],[105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124],"fraud-detection","fraud-prevention","anomaly-detection","graph-algorithms","graphneuralnetwork","machine-learning","opensource","toolkit","datamining","datascience","spamdetection","financial-engineering","security","security-tools","outlier-detection","yelp-dataset","dblp-dataset","graph","graph-convolutional-networks","graph-neural-networks","2026-03-27T02:49:30.150509","2026-04-06T05:16:36.760230",[128,133,138,143,148,153,158,163,168,173],{"id":129,"question_zh":130,"answer_zh":131,"source_url":132},11009,"在 DBLP 数据加载中，为什么要执行 `data['net_APA'] - np.eye(N)` 操作？","预处理后的 DBLP 数据已经为每个子图添加了自环（self-loops）。为了统一数据输入\u002F输出格式，代码中会删除现有的自环。该操作即是为了移除单位矩阵对应的自环连接。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F29",{"id":134,"question_zh":135,"answer_zh":136,"source_url":137},11010,"使用 YelpChi 数据集复现论文结果时，逻辑回归（Logistic Regression）的 AUC 仅为 0.5，如何解决？","AUC 为 0.5 通常意味着分类器未优化或过拟合。建议采取以下措施：\n1. 设置类别权重平衡：在 sklearn 的 LogisticRegression 中设置 `class_weights='balanced'`。\n2. 调整参数：使用 `solver='lbfgs'`, `multi_class='multinomial'`, `max_iter=1000`。\n3. 负采样：对负样本进行欠采样以平衡训练数据的类别分布，防止因类别极度不平衡导致的过拟合。\n注意：论文中使用的是 BoW 特征而非 word2vec，且文本嵌入在该任务中信息量有限，建议使用带有手工特征的新版 YelpChi 图数据集。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F34",{"id":139,"question_zh":140,"answer_zh":141,"source_url":142},11011,"如何获取论文中描述的 Amazon 数据集及其预处理方法？","原始数据包含帮助性投票信息，可从 UCSD 网站下载（搜索 'Amazon reviews' 中的 'Musical Instruments' 类别）。下载后，运行项目中的 `amazon_preprocess.py` 脚本，即可生成包含 11,949 个采样节点及其对应特征的数据集。也可以使用其他 Amazon 评论数据生成类似的图结构。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F39",{"id":144,"question_zh":145,"answer_zh":146,"source_url":147},11012,"SemiGNN 模型中的 Node-Attention 机制参数 `H_v` 的形状为何是固定的 `[hidden_size, 1]`，而不是随邻居数量变化？","注意力层 `H_v` 具有固定的形状，用于编码每个节点的输入嵌入。在反向传播过程中，仅更新 `H_v` 的参数，这与图结构无关。具体的 softmax 注意力系数是通过代码计算得出的（参考 GAT 论文及其实现理解其工作原理），而不是直接由 `H_v` 的形状决定邻居权重。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F36",{"id":149,"question_zh":150,"answer_zh":151,"source_url":152},11013,"SemiGNN 代码中似乎没有在 loss 函数内实现负采样（Negative Sampling），这是怎么回事？","负采样确实未在模型的 forward 函数中直接体现，而是在主函数（main function）中执行的。当前版本将采样逻辑放在外部，未来版本计划将其整合到 SemiGNN 的前向传播中以使结构更清晰。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F35",{"id":154,"question_zh":155,"answer_zh":156,"source_url":157},11014,"运行 SemiGNN 时遇到 `ResourceExhaustedError: OOM`（内存溢出）错误，特别是在计算内积时张量过大，该如何解决？","由于 SemiGNN 包含多重注意力和损失计算，对显存要求较高。解决方案包括：\n1. 使用显存更大的 GPU 运行代码。\n2. 采用小批量（minibatch）训练方式处理数据（注：当前代码版本尚未原生支持 minibatch 训练，需自行修改或等待后续更新）。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F33",{"id":159,"question_zh":160,"answer_zh":161,"source_url":162},11015,"在 GEM 算法中，如何将稀疏矩阵（sparse matrix）作为邻接表输入模型？","不要手动构建 SparseTensor 再进行 embedding_lookup。可以直接使用 TensorFlow 的 `tf.sparse.sparse_dense_matmul` 函数，将稀疏邻接矩阵 `self.placeholders['a'][d]` 与初始节点嵌入 `h` 直接相乘，从而高效地处理稀疏输入。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F32",{"id":164,"question_zh":165,"answer_zh":166,"source_url":167},11016,"面对大规模邻接矩阵（如 72 万 x72 万）和连续数值型特征，应如何进行数据预处理？","1. 大规模矩阵：建议在每种关系下仅保留少量具有信息量的邻居节点进行采样。采样方法应在减小图规模的同时，确保每个节点在每个关系下仍有足够的邻居。\n2. 连续特征：对于年龄等连续数值，可以尝试直接使用、分箱处理（binning）或 One-hot 编码，具体取决于不同编码方法在特定任务上的表现，通常需要通过实验确定最佳方案。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F30",{"id":169,"question_zh":170,"answer_zh":171,"source_url":172},11017,"GAS 数据加载器中的 `review_vecs`、`homo_adj` 和标签 `y` 是如何初始化的？","代码中展示的向量和矩阵仅是为了演示目的而随机初始化的。在实际使用中，用户需要根据自己的数据集来初始化特征矩阵、图结构（邻接矩阵）和标签向量。","https:\u002F\u002Fgithub.com\u002Fsafe-graph\u002FDGFraud\u002Fissues\u002F37",{"id":174,"question_zh":175,"answer_zh":176,"source_url":132},11018,"在 `train_test_split` 过程中，是否会导致节点和关系断裂？","`train_test_split` 仅选择标签用于训练或测试阶段的节点，它不会分割邻接矩阵。在训练过程中，模型利用训练集中节点的关系和邻居来学习其嵌入并预测类别，因此不会因为划分导致图结构本身的断裂，而是基于现有连接进行学习。",[178,183],{"id":179,"version":180,"summary_zh":181,"released_at":182},53484,"v0.1.0","这是 DGFraud 的正式发布版本。\n- 修复了若干 bug\n- 新增 GraphConsis、HACUD 和 GraphSAGE 模型\n- 新增 Yelp 数据集","2020-07-31T05:29:55",{"id":184,"version":185,"summary_zh":186,"released_at":187},53485,"v1.0-alpha","- 实现了六个基于图神经网络的欺诈检测模型","2020-05-05T04:58:02"]