swe-bench · GitHub Topics

AI Agent that handles engineering tasks end-to-end: integrates with developers’ tools, plans, executes, and iterates until it achieves a successful result.

fine-tuning 自托管 developer-tools Open Source rag Visual Studio Code ai-agent enterprise on-prem swe-bench

Rust 3.28 k

1 个月前

JARVIS-Xs / SE-Agent

#大语言模型#SE-Agent is a self-evolution framework that enables information exchange between reasoning paths through a trajectory-level evolution mechanism, breaking the cognitive limitations of single trajectori...

agent claude-code code-generation 大语言模型 mcts swe-bench

Python 154

14 天前

logic-star-ai / insights

We track and analyze the activity and performance of autonomous code agents in the wild

agents swe-bench

TypeScript 38

2 个月前

RanjanaRaghavan / swe-bench-evaluation

This project explores how Large Language Models (LLMs) perform on real-world software engineering tasks, inspired by the SWE-Bench benchmark. Using locally hosted models like Llama 3 via Ollama, the t...

generative-ai swe-bench

TeX 0

7 个月前