[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"similar-refreshdotdev--web-eval-agent":3,"tool-refreshdotdev--web-eval-agent":64},[4,17,27,35,48,56],{"id":5,"name":6,"github_repo":7,"description_zh":8,"stars":9,"difficulty_score":10,"last_commit_at":11,"category_tags":12,"status":16},3808,"stable-diffusion-webui","AUTOMATIC1111\u002Fstable-diffusion-webui","stable-diffusion-webui 是一个基于 Gradio 构建的网页版操作界面，旨在让用户能够轻松地在本地运行和使用强大的 Stable Diffusion 图像生成模型。它解决了原始模型依赖命令行、操作门槛高且功能分散的痛点，将复杂的 AI 绘图流程整合进一个直观易用的图形化平台。\n\n无论是希望快速上手的普通创作者、需要精细控制画面细节的设计师，还是想要深入探索模型潜力的开发者与研究人员，都能从中获益。其核心亮点在于极高的功能丰富度：不仅支持文生图、图生图、局部重绘（Inpainting）和外绘（Outpainting）等基础模式，还独创了注意力机制调整、提示词矩阵、负向提示词以及“高清修复”等高级功能。此外，它内置了 GFPGAN 和 CodeFormer 等人脸修复工具，支持多种神经网络放大算法，并允许用户通过插件系统无限扩展能力。即使是显存有限的设备，stable-diffusion-webui 也提供了相应的优化选项，让高质量的 AI 艺术创作变得触手可及。",162132,3,"2026-04-05T11:01:52",[13,14,15],"开发框架","图像","Agent","ready",{"id":18,"name":19,"github_repo":20,"description_zh":21,"stars":22,"difficulty_score":23,"last_commit_at":24,"category_tags":25,"status":16},1381,"everything-claude-code","affaan-m\u002Feverything-claude-code","everything-claude-code 是一套专为 AI 编程助手（如 Claude Code、Codex、Cursor 等）打造的高性能优化系统。它不仅仅是一组配置文件，而是一个经过长期实战打磨的完整框架，旨在解决 AI 代理在实际开发中面临的效率低下、记忆丢失、安全隐患及缺乏持续学习能力等核心痛点。\n\n通过引入技能模块化、直觉增强、记忆持久化机制以及内置的安全扫描功能，everything-claude-code 能显著提升 AI 在复杂任务中的表现，帮助开发者构建更稳定、更智能的生产级 AI 代理。其独特的“研究优先”开发理念和针对 Token 消耗的优化策略，使得模型响应更快、成本更低，同时有效防御潜在的攻击向量。\n\n这套工具特别适合软件开发者、AI 研究人员以及希望深度定制 AI 工作流的技术团队使用。无论您是在构建大型代码库，还是需要 AI 协助进行安全审计与自动化测试，everything-claude-code 都能提供强大的底层支持。作为一个曾荣获 Anthropic 黑客大奖的开源项目，它融合了多语言支持与丰富的实战钩子（hooks），让 AI 真正成长为懂上",138956,2,"2026-04-05T11:33:21",[13,15,26],"语言模型",{"id":28,"name":29,"github_repo":30,"description_zh":31,"stars":32,"difficulty_score":23,"last_commit_at":33,"category_tags":34,"status":16},2271,"ComfyUI","Comfy-Org\u002FComfyUI","ComfyUI 是一款功能强大且高度模块化的视觉 AI 引擎，专为设计和执行复杂的 Stable Diffusion 图像生成流程而打造。它摒弃了传统的代码编写模式，采用直观的节点式流程图界面，让用户通过连接不同的功能模块即可构建个性化的生成管线。\n\n这一设计巧妙解决了高级 AI 绘图工作流配置复杂、灵活性不足的痛点。用户无需具备编程背景，也能自由组合模型、调整参数并实时预览效果，轻松实现从基础文生图到多步骤高清修复等各类复杂任务。ComfyUI 拥有极佳的兼容性，不仅支持 Windows、macOS 和 Linux 全平台，还广泛适配 NVIDIA、AMD、Intel 及苹果 Silicon 等多种硬件架构，并率先支持 SDXL、Flux、SD3 等前沿模型。\n\n无论是希望深入探索算法潜力的研究人员和开发者，还是追求极致创作自由度的设计师与资深 AI 绘画爱好者，ComfyUI 都能提供强大的支持。其独特的模块化架构允许社区不断扩展新功能，使其成为当前最灵活、生态最丰富的开源扩散模型工具之一，帮助用户将创意高效转化为现实。",107662,"2026-04-03T11:11:01",[13,14,15],{"id":36,"name":37,"github_repo":38,"description_zh":39,"stars":40,"difficulty_score":23,"last_commit_at":41,"category_tags":42,"status":16},2268,"ML-For-Beginners","microsoft\u002FML-For-Beginners","ML-For-Beginners 是由微软推出的一套系统化机器学习入门课程，旨在帮助零基础用户轻松掌握经典机器学习知识。这套课程将学习路径规划为 12 周，包含 26 节精炼课程和 52 道配套测验，内容涵盖从基础概念到实际应用的完整流程，有效解决了初学者面对庞大知识体系时无从下手、缺乏结构化指导的痛点。\n\n无论是希望转型的开发者、需要补充算法背景的研究人员，还是对人工智能充满好奇的普通爱好者，都能从中受益。课程不仅提供了清晰的理论讲解，还强调动手实践，让用户在循序渐进中建立扎实的技能基础。其独特的亮点在于强大的多语言支持，通过自动化机制提供了包括简体中文在内的 50 多种语言版本，极大地降低了全球不同背景用户的学习门槛。此外，项目采用开源协作模式，社区活跃且内容持续更新，确保学习者能获取前沿且准确的技术资讯。如果你正寻找一条清晰、友好且专业的机器学习入门之路，ML-For-Beginners 将是理想的起点。",84991,"2026-04-05T10:45:23",[14,43,44,45,15,46,26,13,47],"数据工具","视频","插件","其他","音频",{"id":49,"name":50,"github_repo":51,"description_zh":52,"stars":53,"difficulty_score":10,"last_commit_at":54,"category_tags":55,"status":16},3128,"ragflow","infiniflow\u002Fragflow","RAGFlow 是一款领先的开源检索增强生成（RAG）引擎，旨在为大语言模型构建更精准、可靠的上下文层。它巧妙地将前沿的 RAG 技术与智能体（Agent）能力相结合，不仅支持从各类文档中高效提取知识，还能让模型基于这些知识进行逻辑推理和任务执行。\n\n在大模型应用中，幻觉问题和知识滞后是常见痛点。RAGFlow 通过深度解析复杂文档结构（如表格、图表及混合排版），显著提升了信息检索的准确度，从而有效减少模型“胡编乱造”的现象，确保回答既有据可依又具备时效性。其内置的智能体机制更进一步，使系统不仅能回答问题，还能自主规划步骤解决复杂问题。\n\n这款工具特别适合开发者、企业技术团队以及 AI 研究人员使用。无论是希望快速搭建私有知识库问答系统，还是致力于探索大模型在垂直领域落地的创新者，都能从中受益。RAGFlow 提供了可视化的工作流编排界面和灵活的 API 接口，既降低了非算法背景用户的上手门槛，也满足了专业开发者对系统深度定制的需求。作为基于 Apache 2.0 协议开源的项目，它正成为连接通用大模型与行业专有知识之间的重要桥梁。",77062,"2026-04-04T04:44:48",[15,14,13,26,46],{"id":57,"name":58,"github_repo":59,"description_zh":60,"stars":61,"difficulty_score":10,"last_commit_at":62,"category_tags":63,"status":16},2181,"OpenHands","OpenHands\u002FOpenHands","OpenHands 是一个专注于 AI 驱动开发的开源平台，旨在让智能体（Agent）像人类开发者一样理解、编写和调试代码。它解决了传统编程中重复性劳动多、环境配置复杂以及人机协作效率低等痛点，通过自动化流程显著提升开发速度。\n\n无论是希望提升编码效率的软件工程师、探索智能体技术的研究人员，还是需要快速原型验证的技术团队，都能从中受益。OpenHands 提供了灵活多样的使用方式：既可以通过命令行（CLI）或本地图形界面在个人电脑上轻松上手，体验类似 Devin 的流畅交互；也能利用其强大的 Python SDK 自定义智能体逻辑，甚至在云端大规模部署上千个智能体并行工作。\n\n其核心技术亮点在于模块化的软件智能体 SDK，这不仅构成了平台的引擎，还支持高度可组合的开发模式。此外，OpenHands 在 SWE-bench 基准测试中取得了 77.6% 的优异成绩，证明了其解决真实世界软件工程问题的能力。平台还具备完善的企业级功能，支持与 Slack、Jira 等工具集成，并提供细粒度的权限管理，适合从个人开发者到大型企业的各类用户场景。",70612,"2026-04-05T11:12:22",[26,15,13,45],{"id":65,"github_repo":66,"name":67,"description_en":68,"description_zh":69,"ai_summary_zh":70,"readme_en":71,"readme_zh":72,"quickstart_zh":73,"use_case_zh":74,"hero_image_url":75,"owner_login":76,"owner_name":77,"owner_avatar_url":78,"owner_bio":79,"owner_company":80,"owner_location":80,"owner_email":80,"owner_twitter":81,"owner_website":82,"owner_url":83,"languages":84,"stars":97,"forks":98,"last_commit_at":99,"license":100,"difficulty_score":23,"env_os":101,"env_gpu":102,"env_ram":102,"env_deps":103,"category_tags":112,"github_topics":113,"view_count":23,"oss_zip_url":80,"oss_zip_packed_at":80,"status":16,"created_at":122,"updated_at":123,"faqs":124,"releases":159},2722,"refreshdotdev\u002Fweb-eval-agent","web-eval-agent","An MCP server that autonomously evaluates web applications. ","web-eval-agent 是一款专为开发者设计的自动化 Web 应用评估工具，旨在让 AI 编程助手具备“自我调试”能力。它作为一个 MCP 服务器运行，能够驱动浏览器自主执行测试任务，无需人工干预即可完成从登录到复杂业务流的全链路验证。\n\n该工具主要解决了开发过程中手动重复测试耗时费力、以及 AI 生成代码难以自动验证端到端功能的问题。通过集成 BrowserUse 技术，web-eval-agent 不仅能模拟用户操作导航网页，还能智能捕获网络请求、控制台错误并截取屏幕画面，最终生成详尽的用户体验报告。其独特亮点在于支持“状态预设”，允许用户预先在交互式浏览器中完成登录等操作，保存的凭证可被后续自动化任务复用，极大提升了测试效率。\n\nweb-eval-agent 特别适合全栈开发者、前端工程师以及使用 Cursor 等 AI 集成开发环境的团队使用。它能无缝嵌入开发工作流，当 AI 编写完代码后，可直接调用该工具进行即时验收，帮助开发者从繁琐的回归测试中解放出来，更专注于核心逻辑构建。需要注意的是，该项目目前已停止维护，团队正转向新的方向，但其设计理念仍为自动化测试提供了有价值","web-eval-agent 是一款专为开发者设计的自动化 Web 应用评估工具，旨在让 AI 编程助手具备“自我调试”能力。它作为一个 MCP 服务器运行，能够驱动浏览器自主执行测试任务，无需人工干预即可完成从登录到复杂业务流的全链路验证。\n\n该工具主要解决了开发过程中手动重复测试耗时费力、以及 AI 生成代码难以自动验证端到端功能的问题。通过集成 BrowserUse 技术，web-eval-agent 不仅能模拟用户操作导航网页，还能智能捕获网络请求、控制台错误并截取屏幕画面，最终生成详尽的用户体验报告。其独特亮点在于支持“状态预设”，允许用户预先在交互式浏览器中完成登录等操作，保存的凭证可被后续自动化任务复用，极大提升了测试效率。\n\nweb-eval-agent 特别适合全栈开发者、前端工程师以及使用 Cursor 等 AI 集成开发环境的团队使用。它能无缝嵌入开发工作流，当 AI 编写完代码后，可直接调用该工具进行即时验收，帮助开发者从繁琐的回归测试中解放出来，更专注于核心逻辑构建。需要注意的是，该项目目前已停止维护，团队正转向新的方向，但其设计理念仍为自动化测试提供了有价值的参考。","# ⚠️ PROJECT HAS BEEN SUNSET ⚠️\n\n## This project has been discontinued. We're building something new at [withrefresh.com](https:\u002F\u002Fwithrefresh.com)\n\n---\n\n# 🚀 operative.sh web-eval-agent MCP Server\n\n> *Let the coding agent debug itself, you've got better things to do.*\n\n![Demo](https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Frefreshdotdev_web-eval-agent_readme_db3e7dbec2e2.gif)\n\n\n\n## 🔥 Supercharge Your Debugging\n\n[operative.sh](https:\u002F\u002Fwww.operative.sh\u002Fmcp)'s MCP Server launches a browser-use powered agent to autonomously execute and debug web apps directly in your code editor.\n\n## ⚡ Features\n\n- 🌐 **Navigate your webapp** using BrowserUse (2x faster with operative backend)\n- 📊 **Capture network traffic** - requests are intelligently filtered and returned into the context window\n- 🚨 **Collect console errors** - captures logs & errors\n- 🤖 **Autonomous debugging** - the Cursor agent calls the web QA agent mcp server to test if the code it wrote works as epected end-to-end.\n\n## 🧰 MCP Tool Reference\n\n| Tool | Purpose |\n|------|---------|\n| `web_eval_agent` | 🤖 Automated UX evaluator that drives the browser, captures screenshots, console & network logs, and returns a rich UX report. |\n| `setup_browser_state` | 🔒 Opens an interactive (non-headless) browser so you can sign in once; the saved cookies\u002Flocal-storage are reused by subsequent `web_eval_agent` runs. |\n\n**Key arguments**\n\n* `web_eval_agent`\n  * `url` **(required)** – address of the running app (e.g. `http:\u002F\u002Flocalhost:3000`)\n  * `task` **(required)** – natural-language description of what to test (\"run through the signup flow and note any UX issues\")\n  * `headless_browser` *(optional, default `false`)* – set to `true` to hide the browser window\n\n* `setup_browser_state`\n  * `url` *(optional)* – page to open first (handy to land directly on a login screen)\n\nYou can trigger these tools straight from your IDE chat, for example:\n\n```bash\nEvaluate my app at http:\u002F\u002Flocalhost:3000 – run web_eval_agent with the task \"Try the full signup flow and report UX issues\".\n```\n\n## 🏁 Quick Start\n\n### Easy Setup with One-Click Integration\n1. [Get your API key (free)](https:\u002F\u002Fwww.operative.sh\u002Fmcp) - when you create your API key, you'll see:\n   - **\"Add to Cursor\"** button with a deeplink for instant Cursor installation\n   - **Prefilled Claude Code command** with your API key automatically included\n\n### Manual Setup (macOS\u002FLinux)\n\n1. Pre-requisites (typically not needed):\n - brew: `\u002Fbin\u002Fbash -c \"$(curl -fsSL https:\u002F\u002Fraw.githubusercontent.com\u002FHomebrew\u002Finstall\u002FHEAD\u002Finstall.sh)\"`\n - npm: (`brew install npm`)\n - jq: `brew install jq` \n2. Run the installer after [getting an api key (free)](https:\u002F\u002Fwww.operative.sh\u002Fmcp)\n   - Installs [playwright](https:\u002F\u002Fgithub.com\u002Fmicrosoft\u002Fplaywright) \n   - [Installs uv](https:\u002F\u002Fastral.sh\u002F)\n   - Inserts JSON into your code editor (Cursor\u002FCline\u002FWindsurf) for you! \n```bash\ncurl -LSf https:\u002F\u002Foperative.sh\u002Finstall.sh -o install.sh && bash install.sh && rm install.sh\n```\n3. Visit your favorite IDE and restart to apply the changes\n4. Send a prompt in chat mode to call the web eval agent tool! e.g. \n```bash\nTest my app on http:\u002F\u002Flocalhost:3000. Use web-eval-agent.\n```\n\n## 🛠️ Manual Installation\n1. Get your API key at operative.sh\u002Fmcp\n2. [Install uv](https:\u002F\u002Fdocs.astral.sh\u002Fuv\u002F#highlights)\n\n```bash\ncurl -LsSf https:\u002F\u002Fastral.sh\u002Fuv\u002Finstall.sh | sh\n```\n\n3. Source environment variables after installing UV\n\nMac\n```\nsource ~\u002F.zshrc\n```\n\nLinux \n```\nsource ~\u002F.bashrc \n```\n4. Install playwright:\n\n```bash\nnpm install -g chromium playwright && uvx --with playwright playwright install --with-deps\n```\n5. Add below JSON to your relevant code editor with api key \n6. Restart your code editor\n   \n## 🔃 Updating \n- `uv cache clean`\n- refresh MCP server \n\n```json \n    \"web-eval-agent\": {\n      \"command\": \"uvx\",\n      \"args\": [\n        \"--refresh-package\",\n        \"webEvalAgent\",\n        \"--from\",\n        \"git+https:\u002F\u002Fgithub.com\u002FOperative-Sh\u002Fweb-eval-agent.git\",\n        \"webEvalAgent\"\n      ],\n      \"env\": {\n        \"OPERATIVE_API_KEY\": \"\u003CYOUR_KEY>\"\n      }\n    }\n```\n## [Operative Discord Server](https:\u002F\u002Fdiscord.gg\u002FryjCnf9myb)\n\n## 🛠️ Manual Installation (Mac + Cursor\u002FCline\u002FWindsurf) \n1. Get your API key at operative.sh\u002Fmcp\n2. [Install uv](https:\u002F\u002Fdocs.astral.sh\u002Fuv\u002F#highlights)\n```bash\ncurl -LsSf https:\u002F\u002Fastral.sh\u002Fuv\u002Finstall.sh | sh)\n```\n3. Install playwright:\n```bash\nnpm install -g chromium playwright && uvx --with playwright playwright install --with-deps\n```\n4. Add below JSON to your relevant code editor with api key \n5. Restart your code editor\n\n## Manual Installation (Windows + Cursor\u002FCline\u002FWindsurf)  \n\nWe're refining this, please open an issue if you have any issues! \n1. Do all this in your code editor terminal \n2. `curl -LSf https:\u002F\u002Foperative.sh\u002Finstall.sh -o install.sh && bash install.sh && rm install.sh`\n3. Get your API key at operative.sh\u002Fmcp\n4. Install uv `(curl -LsSf https:\u002F\u002Fastral.sh\u002Fuv\u002Finstall.sh | sh)`\n5. `uvx --from git+https:\u002F\u002Fgithub.com\u002FOperative-Sh\u002Fweb-eval-agent.git playwright install`\n6. Restart code editor \n\n\n## 🚨 Issues \n- Updates aren't being received in code editors, update or reinstall for latest version: Run `uv cache clean` for latest \n- Any issues feel free to open an Issue on this repo or in the discord!\n- 5\u002F5 - static apps without changes weren't screencasting, fixed! `uv clean` + restart to get fix\n\n## Changelog \n- 4\u002F29 - Agent overlay update - pause\u002Fplay\u002Fstop agent run in the browser\n\n## 📋 Example MCP Server Output Report\n\n```text\n📊 Web Evaluation Report for http:\u002F\u002Flocalhost:5173 complete!\n📝 Task: Test the API-key deletion flow by navigating to the API Keys section, deleting a key, and judging the UX.\n\n🔍 Agent Steps\n  📍 1. Navigate → http:\u002F\u002Flocalhost:5173\n  📍 2. Click     \"Login\"        (button index 2)\n  📍 3. Click     \"API Keys\"     (button index 4)\n  📍 4. Click     \"Create Key\"   (button index 9)\n  📍 5. Type      \"Test API Key\" (input index 2)\n  📍 6. Click     \"Done\"         (button index 3)\n  📍 7. Click     \"Delete\"       (button index 10)\n  📍 8. Click     \"Delete\"       (confirm index 3)\n🏁 Flow tested successfully – UX felt smooth and intuitive.\n\n🖥️ Console Logs (10)\n  1. [debug] [vite] connecting…\n  2. [debug] [vite] connected.\n  3. [info]  Download the React DevTools …\n     …\n\n🌐 Network Requests (10)\n  1. GET \u002Fsrc\u002Fpages\u002FSleepingMasks.tsx                   304\n  2. GET \u002Fsrc\u002Fpages\u002FMCPRegistryRegistry.tsx             304\n     …\n\n⏱️ Chronological Timeline\n  01:16:23.293 🖥️ Console [debug] [vite] connecting…\n  01:16:23.303 🖥️ Console [debug] [vite] connected.\n  01:16:23.312 ➡️ GET \u002Fsrc\u002Fpages\u002FSleepingMasks.tsx\n  01:16:23.318 ⬅️ 304 \u002Fsrc\u002Fpages\u002FSleepingMasks.tsx\n     …\n  01:17:45.038 🤖 🏁 Flow finished – deletion verified\n  01:17:47.038 🤖 📋 Conclusion repeated above\n👁️  See the \"Operative Control Center\" dashboard for live logs.\n```\n\n## Star History\n\n[![Star History Chart](https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Frefreshdotdev_web-eval-agent_readme_e85cbbbd34b2.png)](https:\u002F\u002Fwww.star-history.com\u002F#Operative-Sh\u002Fweb-eval-agent&Date)\n\n\n---\n\nBuilt with \u003C3 @ [operative.sh](https:\u002F\u002Fwww.operative.sh)\n","# ⚠️ 项目已停止维护 ⚠️\n\n## 本项目已不再更新。我们正在 [withrefresh.com](https:\u002F\u002Fwithrefresh.com) 构建全新产品。\n\n---\n\n# 🚀 operative.sh web-eval-agent MCP 服务器\n\n> *让编码助手自动调试自身，把时间留给更重要的事情。*\n\n![演示](https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Frefreshdotdev_web-eval-agent_readme_db3e7dbec2e2.gif)\n\n\n\n## 🔥 强化你的调试能力\n\noperative.sh（[www.operative.sh\u002Fmcp](https:\u002F\u002Fwww.operative.sh\u002Fmcp)）的 MCP 服务器会启动一个基于 BrowserUse 的代理，直接在你的代码编辑器中自主执行并调试 Web 应用程序。\n\n## ⚡ 功能特性\n\n- 🌐 **导航你的 Web 应用**：使用 BrowserUse（结合 operative 后端可提速 2 倍）\n- 📊 **捕获网络流量**：智能过滤请求，并将其返回到上下文窗口\n- 🚨 **收集控制台错误**：捕捉日志与错误信息\n- 🤖 **自主调试**：Cursor 代理会调用 web QA 代理的 MCP 服务器，以端到端方式测试其编写的代码是否按预期工作。\n\n## 🧰 MCP 工具参考\n\n| 工具 | 用途 |\n|------|------|\n| `web_eval_agent` | 🤖 自动化 UX 评估器，驱动浏览器、捕获截图、控制台与网络日志，并生成丰富的 UX 报告。 |\n| `setup_browser_state` | 🔒 打开一个交互式（非无头）浏览器，方便你只需登录一次；后续的 `web_eval_agent` 运行将复用保存的 Cookie 和本地存储数据。 |\n\n**关键参数**\n\n* `web_eval_agent`\n  * `url` **(必填)** – 正在运行的应用地址（例如 `http:\u002F\u002Flocalhost:3000`）\n  * `task` **(必填)** – 用自然语言描述要测试的内容（“走一遍注册流程，并记录任何 UX 问题”）\n  * `headless_browser` *(可选，默认为 `false`)* – 设置为 `true` 可隐藏浏览器窗口\n\n* `setup_browser_state`\n  * `url` *(可选)* – 首先打开的页面（便于直接跳转到登录界面）\n\n你可以直接从 IDE 的聊天窗口触发这些工具，例如：\n\n```bash\n在我的应用 http:\u002F\u002Flocalhost:3000 上进行评估——使用 web_eval_agent 完成任务：“尝试完整的注册流程，并报告 UX 问题。”\n```\n\n## 🏁 快速入门\n\n### 一键集成，轻松设置\n1. [获取你的 API 密钥（免费）](https:\u002F\u002Fwww.operative.sh\u002Fmcp) - 创建 API 密钥时，你会看到：\n   - **“添加到 Cursor”** 按钮，附带即时安装 Cursor 的深层链接\n   - **预填充的 Claude Code 命令**，其中已自动包含你的 API 密钥\n\n### 手动设置（macOS\u002FLinux）\n\n1. 先决条件（通常无需操作）：\n   - brew：`\u002Fbin\u002Fbash -c \"$(curl -fsSL https:\u002F\u002Fraw.githubusercontent.com\u002FHomebrew\u002Finstall\u002FHEAD\u002Finstall.sh)\"`\n   - npm：`brew install npm`\n   - jq：`brew install jq` \n2. 在 [获取 API 密钥（免费）](https:\u002F\u002Fwww.operative.sh\u002Fmcp) 后运行安装脚本：\n   - 安装 [playwright](https:\u002F\u002Fgithub.com\u002Fmicrosoft\u002Fplaywright)\n   - [安装 uv](https:\u002F\u002Fastral.sh\u002F)\n   - 自动为你将 JSON 插入到代码编辑器（Cursor\u002FCline\u002FWindsurf）！\n```bash\ncurl -LSf https:\u002F\u002Foperative.sh\u002Finstall.sh -o install.sh && bash install.sh && rm install.sh\n```\n3. 打开你喜欢的 IDE 并重启以使更改生效\n4. 在聊天模式下发送提示，调用 web eval agent 工具！例如：\n```bash\n测试我的应用 http:\u002F\u002Flocalhost:3000。使用 web-eval-agent。\n```\n\n## 🛠️ 手动安装\n1. 在 operative.sh\u002Fmcp 获取你的 API 密钥\n2. [安装 uv](https:\u002F\u002Fdocs.astral.sh\u002Fuv\u002F#highlights)\n\n```bash\ncurl -LsSf https:\u002F\u002Fastral.sh\u002Fuv\u002Finstall.sh | sh\n```\n\n3. 安装 UV 后加载环境变量：\n\nMac\n```\nsource ~\u002F.zshrc\n```\n\nLinux \n```\nsource ~\u002F.bashrc \n```\n4. 安装 playwright：\n\n```bash\nnpm install -g chromium playwright && uvx --with playwright playwright install --with-deps\n```\n5. 将以下 JSON 添加到相应的代码编辑器中，并填入你的 API 密钥\n6. 重启你的代码编辑器\n\n## 🔃 更新\n- `uv cache clean`\n- 刷新 MCP 服务器\n\n```json \n    \"web-eval-agent\": {\n      \"command\": \"uvx\",\n      \"args\": [\n        \"--refresh-package\",\n        \"webEvalAgent\",\n        \"--from\",\n        \"git+https:\u002F\u002Fgithub.com\u002FOperative-Sh\u002Fweb-eval-agent.git\",\n        \"webEvalAgent\"\n      ],\n      \"env\": {\n        \"OPERATIVE_API_KEY\": \"\u003CYOUR_KEY>\"\n      }\n    }\n```\n## [Operative Discord 服务器](https:\u002F\u002Fdiscord.gg\u002FryjCnf9myb)\n\n## 🛠️ 手动安装（Mac + Cursor\u002FCline\u002FWindsurf） \n1. 在 operative.sh\u002Fmcp 获取你的 API 密钥\n2. [安装 uv](https:\u002F\u002Fdocs.astral.sh\u002Fuv\u002F#highlights)\n```bash\ncurl -LsSf https:\u002F\u002Fastral.sh\u002Fuv\u002Finstall.sh | sh)\n```\n3. 安装 playwright：\n```bash\nnpm install -g chromium playwright && uvx --with playwright playwright install --with-deps\n```\n4. 将以下 JSON 添加到相应的代码编辑器中，并填入你的 API 密钥\n5. 重启你的代码编辑器\n\n## 手动安装（Windows + Cursor\u002FCline\u002FWindsurf）  \n\n我们仍在优化此流程，如有任何问题，请提交 Issue！ \n1. 在代码编辑器的终端中完成以下操作\n2. `curl -LSf https:\u002F\u002Foperative.sh\u002Finstall.sh -o install.sh && bash install.sh && rm install.sh`\n3. 在 operative.sh\u002Fmcp 获取你的 API 密钥\n4. 安装 uv `(curl -LsSf https:\u002F\u002Fastral.sh\u002Fuv\u002Finstall.sh | sh)`\n5. `uvx --from git+https:\u002F\u002Fgithub.com\u002FOperative-Sh\u002Fweb-eval-agent.git playwright install`\n6. 重启代码编辑器\n\n\n## 🚨 问题\n- 如果代码编辑器未接收到更新，请更新或重新安装以获取最新版本：运行 `uv cache clean` 即可获得最新修复\n- 如有任何问题，请随时在此仓库或 Discord 中提交 Issue！\n- 5\u002F5 — 对于没有变化的静态应用无法录屏的问题已修复！执行 `uv clean` 并重启即可解决\n\n## 更改记录\n- 4月29日：代理叠加层更新——可在浏览器中暂停、播放或停止代理运行\n\n## 📋 MCP 服务器输出报告示例\n\n```text\n📊 为 http:\u002F\u002Flocalhost:5173 完成的 Web 评估报告！\n📝 任务：测试 API 密钥删除流程，方法是导航至 API 密钥部分，删除一个密钥，并评估用户体验。\n\n🔍 代理步骤\n  📍 1. 导航 → http:\u002F\u002Flocalhost:5173\n  📍 2. 点击     “登录”        （按钮索引 2）\n  📍 3. 点击     “API 密钥”     （按钮索引 4）\n  📍 4. 点击     “创建密钥”   （按钮索引 9）\n  📍 5. 输入      “测试 API 密钥” （输入索引 2）\n  📍 6. 点击     “完成”         （按钮索引 3）\n  📍 7. 点击     “删除”       （按钮索引 10）\n  📍 8. 点击     “删除”       （确认索引 3）\n🏁 流程测试成功——用户体验流畅且直观。\n\n🖥️ 控制台日志（10 条）\n  1. [debug] [vite] 连接中…\n  2. [debug] [vite] 已连接。\n  3. [info] 下载 React DevTools …\n     …\n\n🌐 网络请求（10 条）\n  1. GET \u002Fsrc\u002Fpages\u002FSleepingMasks.tsx                   304\n  2. GET \u002Fsrc\u002Fpages\u002FMCPRegistryRegistry.tsx             304\n     …\n\n⏱️ 时间线\n  01:16:23.293 🖥️ 控制台 [debug] [vite] 连接中…\n  01:16:23.303 🖥️ 控制台 [debug] [vite] 已连接。\n  01:16:23.312 ➡️ GET \u002Fsrc\u002Fpages\u002FSleepingMasks.tsx\n  01:16:23.318 ⬅️ 304 \u002Fsrc\u002Fpages\u002FSleepingMasks.tsx\n     …\n  01:17:45.038 🤖 🏁 流程结束——删除成功已验证\n  01:17:47.038 🤖 📋 上述结论再次确认\n👁️ 请查看“Operative 控制中心”仪表板，以获取实时日志。\n```\n\n## 星标历史\n\n[![星标历史图](https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Frefreshdotdev_web-eval-agent_readme_e85cbbbd34b2.png)](https:\u002F\u002Fwww.star-history.com\u002F#Operative-Sh\u002Fweb-eval-agent&Date)\n\n\n---\n\n由 \u003C3 @ [operative.sh](https:\u002F\u002Fwww.operative.sh) 构建","# web-eval-agent 快速上手指南\n\n> **⚠️ 重要提示**：该项目已停止维护（Sunset）。开发团队正在构建新项目，请访问 [withrefresh.com](https:\u002F\u002Fwithrefresh.com) 获取最新信息。以下指南仅适用于历史版本参考。\n\n`web-eval-agent` 是一个基于 MCP (Model Context Protocol) 的服务器工具，它能启动一个自主代理，在浏览器中自动执行和调试 Web 应用，并将截图、控制台日志及网络请求反馈给您的代码编辑器（如 Cursor）。\n\n## 环境准备\n\n在开始之前，请确保您的系统满足以下要求：\n\n*   **操作系统**：macOS, Linux 或 Windows\n*   **包管理器**：\n    *   macOS\u002FLinux: 推荐安装 `Homebrew` (`\u002Fbin\u002Fbash -c \"$(curl -fsSL https:\u002F\u002Fraw.githubusercontent.com\u002FHomebrew\u002Finstall\u002FHEAD\u002Finstall.sh)\"`)\n    *   需具备 `npm` 和 `jq` (macOS: `brew install npm jq`)\n*   **核心依赖**：\n    *   **uv**: Python 包管理工具\n    *   **Playwright**: 用于浏览器自动化\n*   **API Key**: 需要在 [operative.sh\u002Fmcp](https:\u002F\u002Fwww.operative.sh\u002Fmcp) 免费注册获取。\n\n## 安装步骤\n\n### 方法一：一键安装（推荐 macOS\u002FLinux）\n\n这是最简便的方式，脚本会自动处理依赖并配置编辑器。\n\n1.  访问 [operative.sh\u002Fmcp](https:\u002F\u002Fwww.operative.sh\u002Fmcp) 获取免费的 API Key。\n2.  在终端运行以下安装命令：\n\n```bash\ncurl -LSf https:\u002F\u002Foperative.sh\u002Finstall.sh -o install.sh && bash install.sh && rm install.sh\n```\n\n3.  安装完成后，**重启您的代码编辑器**（如 Cursor, Cline, Windsurf）以加载 MCP 配置。\n\n### 方法二：手动安装\n\n如果一键脚本无法运行，可按以下步骤手动配置：\n\n1.  **安装 uv**：\n    ```bash\n    curl -LsSf https:\u002F\u002Fastral.sh\u002Fuv\u002Finstall.sh | sh\n    ```\n    *安装后请根据系统刷新环境变量：*\n    *   macOS: `source ~\u002F.zshrc`\n    *   Linux: `source ~\u002F.bashrc`\n\n2.  **安装 Playwright 及其浏览器依赖**：\n    ```bash\n    npm install -g chromium playwright && uvx --with playwright playwright install --with-deps\n    ```\n\n3.  **配置编辑器**：\n    在您的编辑器 MCP 配置文件（通常在 `settings.json` 或专门的 MCP 配置区）中添加以下 JSON，并将 `\u003CYOUR_KEY>` 替换为您获取的 API Key：\n\n    ```json\n    \"web-eval-agent\": {\n      \"command\": \"uvx\",\n      \"args\": [\n        \"--refresh-package\",\n        \"webEvalAgent\",\n        \"--from\",\n        \"git+https:\u002F\u002Fgithub.com\u002FOperative-Sh\u002Fweb-eval-agent.git\",\n        \"webEvalAgent\"\n      ],\n      \"env\": {\n        \"OPERATIVE_API_KEY\": \"\u003CYOUR_KEY>\"\n      }\n    }\n    ```\n\n4.  **重启代码编辑器**。\n\n## 基本使用\n\n安装并重启编辑器后，您可以在编辑器的聊天窗口（Chat Mode）中直接调用该工具。\n\n### 核心功能工具\n\n*   `web_eval_agent`: 自动驱动浏览器执行任务，捕获截图、日志和网络流量，生成用户体验报告。\n*   `setup_browser_state`: 打开非无头模式浏览器，用于手动登录一次，保存 Cookie\u002FLocal Storage 供后续自动测试复用。\n\n### 使用示例\n\n在编辑器聊天框中输入以下指令（将 URL 和任务描述替换为您的实际需求）：\n\n**示例 1：测试注册流程**\n```text\nEvaluate my app at http:\u002F\u002Flocalhost:3000 – run web_eval_agent with the task \"Try the full signup flow and report UX issues\".\n```\n\n**示例 2：测试特定功能（如删除 API Key）**\n```text\nTest my app on http:\u002F\u002Flocalhost:5173. Use web-eval-agent to navigate to the API Keys section, delete a key, and judge the UX.\n```\n\n**示例 3：先设置登录状态再测试**\n如果需要登录才能测试，先调用：\n```text\nRun setup_browser_state with url \"http:\u002F\u002Flocalhost:3000\u002Flogin\" so I can sign in manually.\n```\n登录成功后，再运行正常的 `web_eval_agent` 任务，它会自动复用刚才的登录状态。\n\n### 输出结果\n\n工具执行完毕后，会在聊天窗口返回一份详细的报告，包含：\n*   **Agent 步骤**：代理执行的具体操作序列（点击、输入等）。\n*   **控制台日志**：捕获的浏览器 Console 报错和信息。\n*   **网络请求**：关键的网络请求列表及状态码。\n*   **时间线**：按时间顺序排列的操作与日志对照。","前端工程师小李正在开发一个复杂的电商注册流程，需要在每次代码提交后验证从填写表单到最终跳转的完整用户体验。\n\n### 没有 web-eval-agent 时\n- **人工回归耗时**：每次修改代码后，必须手动打开浏览器，重复点击数十次以跑通整个注册流程，严重打断编码心流。\n- **隐蔽错误难发现**：容易忽略控制台中的异步报错或网络请求失败，导致线上出现“白屏”或数据未提交等隐患。\n- **环境状态重置繁琐**：测试需要登录态时，每次都要重新扫码或输入账号密码，无法快速复用之前的会话状态。\n- **反馈滞后**：往往等到测试人员介入或用户投诉时，才发现关键路径上的交互逻辑存在断裂。\n\n### 使用 web-eval-agent 后\n- **自主闭环测试**：只需在 IDE 中输入“运行注册流程并报告体验问题”，web-eval-agent 即可自动驱动浏览器完成全链路操作并生成报告。\n- **全维度监控**：工具自动捕获并过滤网络流量与控制台日志，将隐藏的 JS 错误和失败的 API 请求直接呈现在上下文中。\n- **会话状态复用**：通过 `setup_browser_state` 一次性登录保存凭证，后续所有自动化评估均自动携带登录态，无需重复认证。\n- **实时自我修正**：Cursor 代理可调用 web-eval-agent 即时验证刚写的代码是否按预期工作，实现“写完即测，出错即改”。\n\nweb-eval-agent 将原本耗时数小时的手工回归测试压缩为分钟级的自动化对话，让开发者能专注于核心逻辑而非重复劳动。","https:\u002F\u002Foss.gittoolsai.com\u002Fimages\u002Frefreshdotdev_web-eval-agent_a4f8cd59.png","refreshdotdev","Refresh","https:\u002F\u002Foss.gittoolsai.com\u002Favatars\u002Frefreshdotdev_fc1ec8d0.jpg","Data to augment the future of work ",null,"refresh_dev","https:\u002F\u002Fwww.refresh.dev","https:\u002F\u002Fgithub.com\u002Frefreshdotdev",[85,89,93],{"name":86,"color":87,"percentage":88},"Python","#3572A5",68.4,{"name":90,"color":91,"percentage":92},"HTML","#e34c26",19.3,{"name":94,"color":95,"percentage":96},"JavaScript","#f1e05a",12.3,1236,105,"2026-03-30T16:31:02","Apache-2.0","macOS, Linux, Windows","未说明",{"notes":104,"python":105,"dependencies":106},"该项目已停止维护 (SUNSET)。安装主要依赖 'uv' (Python 包管理器) 和 'Playwright' (浏览器自动化工具，需额外安装 Chromium 及系统依赖)。需要获取 Operative API Key 才能运行。支持在 Cursor、Cline、Windsurf 等编辑器中作为 MCP Server 使用。Windows 支持尚在完善中。","未说明 (通过 uv 管理)",[107,108,109,110,111],"uv","playwright","chromium","jq","npm",[15,45],[114,115,116,117,118,119,120,108,121],"debugging","debugging-tool","mcp","mcp-server","modelcontextprotocol","vibe-coding","vibe-testing","qa","2026-03-27T02:49:30.150509","2026-04-06T06:54:43.049124",[125,130,135,140,145,150,155],{"id":126,"question_zh":127,"answer_zh":128,"source_url":129},12606,"如何在 Windows 上手动安装 web-eval-agent？","Windows 用户建议在代码编辑器的终端中执行以下步骤进行手动安装：\n1. 运行安装脚本：`curl -LSf https:\u002F\u002Foperative.sh\u002Finstall.sh -o install.sh && bash install.sh && rm install.sh`\n2. 在 operative.sh 获取 API Key。\n3. 安装 uv 工具：`(curl -LsSf https:\u002F\u002Fastral.sh\u002Fuv\u002Finstall.sh | sh)`\n4. 安装 Playwright 依赖：`uvx --from git+https:\u002F\u002Fgithub.com\u002FOperative-Sh\u002Fweb-eval-agent.git playwright install`\n5. 重启代码编辑器。\n注意：如果遇到问题，请确保在与 Cursor 相同的 Shell（如 WSL、PowerShell 或编辑器自带终端）中运行安装命令。","https:\u002F\u002Fgithub.com\u002Frefreshdotdev\u002Fweb-eval-agent\u002Fissues\u002F24",{"id":131,"question_zh":132,"answer_zh":133,"source_url":134},12607,"遇到 'Invalid API key' (403 错误) 或浏览器启动后立即关闭怎么办？","这通常是因为依赖项未完全下载导致的。请按以下步骤解决：\n1. 在命令行 (CLI) 中手动运行以下命令以预下载依赖（此时不需要 API Key）：\n   `uvx --from git+https:\u002F\u002Fgithub.com\u002FOperative-Sh\u002Fweb-eval-agent.git webEvalAgent`\n2. 等待命令执行完成。\n3. 重启 Cursor 编辑器。\n4. 如果问题依旧，尝试生成一个新的 API Key 并重试。","https:\u002F\u002Fgithub.com\u002Frefreshdotdev\u002Fweb-eval-agent\u002Fissues\u002F2",{"id":136,"question_zh":137,"answer_zh":138,"source_url":139},12608,"如何正确使用 web-eval-agent？提示语应该是什么？","在 Cline 或 Cursor 的聊天模式中，直接发送提示语调用工具即可（例如描述你要评估的任务）。如果工具超时失败并建议手动分析，可能是因为环境未配置正确。请确保已完成手动安装步骤（特别是 Playwright 的安装），并在编辑器终端刷新后重试。如果看到两个浏览器窗口（一个等待日志，一个运行应用），这是正常现象；若无法看到输出日志，请检查是否已在正确的 Shell 中安装了依赖。","https:\u002F\u002Fgithub.com\u002Frefreshdotdev\u002Fweb-eval-agent\u002Fissues\u002F12",{"id":141,"question_zh":142,"answer_zh":143,"source_url":144},12609,"运行时出现 'Unexpected token ... is not valid JSON' 错误怎么办？","这是一个已知问题，通常不会阻碍产品的正常使用。该错误信息（如 `INFO [...]` 被误解析为 JSON）往往是由于日志输出混入了标准输出流导致的。只要代理能够继续执行任务并返回结果，可以忽略此错误。如果导致功能完全不可用，请尝试重启 MCP 服务器或代码编辑器。","https:\u002F\u002Fgithub.com\u002Frefreshdotdev\u002Fweb-eval-agent\u002Fissues\u002F22",{"id":146,"question_zh":147,"answer_zh":148,"source_url":149},12610,"付费订阅后额度未更新（例如购买了 10,000 次请求但只显示 2,000）怎么办？","这是一个已知的显示延迟或同步问题。维护者通常会手动修复此类账户的额度。如果您遇到此情况，请稍作等待，系统会自动或通过后台操作将您的额度更新至正确的数值（如 10,000 credits）。如果长时间未恢复，建议联系支持团队。","https:\u002F\u002Fgithub.com\u002Frefreshdotdev\u002Fweb-eval-agent\u002Fissues\u002F13",{"id":151,"question_zh":152,"answer_zh":153,"source_url":154},12611,"该工具是否支持需要与外部插件（如 MetaMask）交互的 Web3 项目？","目前对于需要与浏览器扩展（如 MetaMask 钱包）进行复杂交互的场景支持尚不完善。如果您有此类特定需求，建议直接通过邮件 (erik@operative.sh) 联系开发团队讨论可行性或获取定制支持。","https:\u002F\u002Fgithub.com\u002Frefreshdotdev\u002Fweb-eval-agent\u002Fissues\u002F11",{"id":156,"question_zh":157,"answer_zh":158,"source_url":129},12612,"在 Windows 上安装时提示 'executable named playwright is not provided' 怎么办？","这是因为执行环境或路径问题。请尝试显式指定从 `playwright` 包运行安装命令：\n`uvx --from playwright playwright install`\n或者确保您是在安装了 `web-eval-agent` 依赖的同一虚拟环境或上下文中运行该命令。如果在 Windows 上仍然失败，请切换不同的 Shell（WSL、PowerShell 或 CMD）重试，并确保在该 Shell 中重新运行完整的安装流程。",[]]