-
the University of Edinburgh
- https://xinhuajian.wordpress.com/
- https://scholar.google.com/citations?hl=en&user=E5M9x8wAAAAJ
Stars
Claude Code and Codex API Routing Proxy
Toolkit for measuring Claude Code and Codex performance over time against a baseline using SWEbench-lite dataset **No API key required for Max or Pro subscribers**
Democratizing Reinforcement Learning for LLMs
All Cursor AI's official download links for both the latest and older versions, making it easy for you to update, downgrade, and choose any version. 🚀
Litex is a simple formal language Learnable in 2 hours.
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API
SGLang is a fast serving framework for large language models and vision language models.
🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
Notebooks for the O'Reilly book "Learning Ray"
A repo lists papers related to LLM based agent
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
📃 A better UX for chat, writing content, and coding with LLMs.
aider is AI pair programming in your terminal
Composio equips your AI agents & LLMs with 100+ high-quality integrations via function calling
SWE-bench: Can Language Models Resolve Real-world Github Issues?
Python client to interact with the lean4 language server.
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
CP4 Free Source Code Project (C++17, Java11, Python3 and OCaml)
32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.