user074

Follow

Jianing Qi user074

Follow

2 followers · 5 following

CUNY Grad Center
NYC

Achievements

Achievements

Highlights

Pro

Lists (1)

Sort

✨ Inspiration

Stars

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 515 53 Updated Nov 12, 2025

spiral-rl / spiral

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Python 161 17 Updated Sep 18, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,106 1,301 Updated Nov 10, 2025

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,026 2,214 Updated Oct 17, 2025

comet-ml / opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Python 15,641 1,168 Updated Nov 12, 2025

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,262 890 Updated Jul 8, 2025

kvfrans / cfgrl

Python 53 4 Updated May 31, 2025

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,184 304 Updated Nov 12, 2025

QwenLM / Qwen3-Coder

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,308 991 Updated Jul 31, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 50,313 6,255 Updated Nov 12, 2025

yuhuUSTC / FAR

Frequency Autoregressive Image Generation with Continuous Tokens

Python 92 4 Updated Jun 9, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,038 299 Updated Nov 3, 2025

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,891 134 Updated Dec 6, 2024

nerfies / nerfies.github.io

JavaScript 3,693 1,588 Updated Jun 21, 2024

OpenPipe / deductive-reasoning

Train your own SOTA deductive reasoning model

Python 108 8 Updated Mar 6, 2025

zjysteven / VLM-Visualizer

Visualizing the attention of vision-language models

Jupyter Notebook 252 20 Updated Feb 28, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,715 986 Updated Nov 6, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,858 899 Updated Sep 30, 2025

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,825 134 Updated Jan 17, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,633 2,398 Updated Sep 8, 2025

uclaml / SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,215 102 Updated May 8, 2024

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,389 1,521 Updated Apr 24, 2025

deepseek-ai / DeepSeek-R1

91,476 11,782 Updated Jun 27, 2025

ccny-ccvcl / ccvcl-web

ccvcl website

HTML 1 Updated Oct 14, 2025

Zer0-bit / gaggiuino

A Gaggia Classic control project using microcontrollers.

2,324 344 Updated Nov 6, 2025

emingenc / even_glasses

even-realities g1 smart glasses ble control pip package

Python 72 15 Updated Nov 24, 2024

even-realities / EvenDemoApp

C 409 116 Updated Jun 30, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,354 545 Updated Nov 8, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,275 423 Updated Nov 12, 2025

SergioMEV / slurm-for-dummies

A dummy's guide to setting up (and using) HPC clusters on Ubuntu 22.04LTS using Slurm and Munge. Created by the Quant Club @ UIowa.

376 32 Updated Apr 3, 2024