GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

swe-bench

Website
Wikipedia
https://static.github-zh.com/github_avatars/smallcloudai?size=40
smallcloudai / refact

AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result.

fine-tuning自托管developer-toolsOpen SourceragVisual Studio Codeai-agententerpriseon-premswe-bench
Rust 3.28 k
1 个月前
https://static.github-zh.com/github_avatars/JARVIS-Xs?size=40
JARVIS-Xs / SE-Agent

#大语言模型#SE-Agent is a self-evolution framework that enables information exchange between reasoning paths through a trajectory-level evolution mechanism, breaking the cognitive limitations of single trajectori...

agentclaude-codecode-generation大语言模型mctsswe-bench
Python 154
14 天前
https://static.github-zh.com/github_avatars/logic-star-ai?size=40
logic-star-ai / insights

We track and analyze the activity and performance of autonomous code agents in the wild

agentsswe-bench
TypeScript 38
2 个月前
https://static.github-zh.com/github_avatars/RanjanaRaghavan?size=40
RanjanaRaghavan / swe-bench-evaluation

This project explores how Large Language Models (LLMs) perform on real-world software engineering tasks, inspired by the SWE-Bench benchmark. Using locally hosted models like Llama 3 via Ollama, the t...

generative-aiswe-bench
TeX 0
7 个月前