Skip to content
View samjia2000's full-sized avatar

Block or report samjia2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source implementation of AlphaEvolve

Python 4,659 701 Updated Nov 27, 2025

OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's AlphaEvolve.

Python 949 146 Updated May 31, 2025

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,197 184 Updated Nov 27, 2025

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

Python 10,886 1,489 Updated Nov 20, 2025

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,643 426 Updated Oct 16, 2025

๐Ÿ“šA curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.๐ŸŽ‰

Python 4,762 324 Updated Nov 28, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,137 404 Updated Nov 30, 2025
Python 773 67 Updated Jun 26, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 503 31 Updated Nov 26, 2025

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 490 58 Updated Nov 22, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,386 1,332 Updated Nov 20, 2025

[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference

Jupyter Notebook 45 2 Updated Jun 17, 2025

[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models

Python 133 11 Updated Aug 15, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,097 238 Updated Nov 29, 2025

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 507 36 Updated Feb 10, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,356 342 Updated Jul 12, 2025

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,405 116 Updated Feb 19, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,852 2,677 Updated Jul 3, 2025

Machine Learning and Agentic AI Resources, Practice and Research

Python 4,532 1,655 Updated Nov 2, 2025

VisualWebArena is a benchmark for multimodal agents.

Python 409 67 Updated Nov 9, 2024

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,236 199 Updated Nov 26, 2025

Towards Large Multimodal Models as Visual Foundation Agents

Python 244 9 Updated Apr 24, 2025

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 479 31 Updated Jun 6, 2025

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Python 354 18 Updated Jun 18, 2023

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Python 584 48 Updated Aug 21, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,048 3,021 Updated Aug 15, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,073 122 Updated Jun 1, 2023

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 ๐Ÿ“ and reasoning techniques.

6,854 371 Updated Oct 17, 2025

๐Ÿ™Œ OpenHands: Code Less, Make More

Python 65,297 7,977 Updated Nov 30, 2025
Next