-
@baidu and @PaddlePaddle
- Shenzhen, China
- https://sijunhe.github.io/
- @SijunHe
Stars
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…
[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Web-Bench is a benchmark designed to evaluate the performance of LLMs in actual Web development.
Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
aider is AI pair programming in your terminal
Coding problems used in aider's polyglot benchmark
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
A framework for the evaluation of autoregressive code generation language models.
Model Context Protocol Servers
Your AI Operator for Web, Android, Automation & Testing.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption
Beautiful & consistent icon toolkit made by the community. Open-source project and a fork of Feather Icons.
Open source Claude Artifacts – built with Llama 3.1 405B
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Prompt, run, edit, and deploy full-stack web applications. -- bolt.new -- Help Center: https://support.bolt.new/ -- Community Support: https://discord.com/invite/stackblitz
WebLINX is a benchmark for building web navigation agents with conversational capabilities
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
Large Action Model framework to develop AI Web Agents
👾📦 CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.