samjia2000

Follow

samjia2000

Follow

37 followers · 15 following

Achievements

Achievements

Stars

algorithmicsuperintelligence / openevolve

Open-source implementation of AlphaEvolve

Python 4,659 701 Updated Nov 27, 2025

shyamsaktawat / OpenAlpha_Evolve

OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's AlphaEvolve.

Python 949 146 Updated May 31, 2025

openai / mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 1,197 184 Updated Nov 27, 2025

HKUDS / DeepCode

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

Python 10,886 1,489 Updated Nov 20, 2025

HKUDS / AI-Researcher

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,643 426 Updated Oct 16, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,762 324 Updated Nov 28, 2025

laude-institute / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 1,137 404 Updated Nov 30, 2025

bytedance / SandboxFusion

Python 773 67 Updated Jun 26, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 503 31 Updated Nov 26, 2025

Ayanami0730 / deep_research_bench

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 490 58 Updated Nov 22, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,386 1,332 Updated Nov 20, 2025

samjia2000 / Optimal-Reasoning-Efficiency

3 Updated Jun 10, 2025

LINs-lab / DeFT

[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference

Jupyter Notebook 45 2 Updated Jun 17, 2025

Parallel-Reasoning / APR

[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models

Python 133 11 Updated Aug 15, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,097 238 Updated Nov 29, 2025

mit-han-lab / duo-attention

[ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Python 507 36 Updated Feb 10, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 3,356 342 Updated Jul 12, 2025

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,405 116 Updated Feb 19, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,852 2,677 Updated Jul 3, 2025

yanshengjia / ml-road

Machine Learning and Agentic AI Resources, Practice and Research

Python 4,532 1,655 Updated Nov 2, 2025

web-arena-x / visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Python 409 67 Updated Nov 9, 2024

web-arena-x / webarena

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 1,236 199 Updated Nov 26, 2025

THUDM / VisualAgentBench

Towards Large Multimodal Models as Visual Foundation Agents

Python 244 9 Updated Apr 24, 2025

THUDM / WebRL

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 479 31 Updated Jun 6, 2025

agi-templar / Stable-Alignment

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".

Python 354 18 Updated Jun 18, 2023

SalesforceAIResearch / xLAM

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Python 584 48 Updated Aug 21, 2025

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,048 3,021 Updated Aug 15, 2024

openai / prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 2,073 122 Updated Jun 1, 2023

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,854 371 Updated Oct 17, 2025

OpenHands / OpenHands

🙌 OpenHands: Code Less, Make More

Python 65,297 7,977 Updated Nov 30, 2025