hijkzzz

hijkzzz

RLer + MLSyser / 2 + NLPer / 2

642 followers · 52 following

Achievements

x3 x4

Achievements

x3 x4

Stars

PrimeIntellect-ai / prime-rl

Async RL Training at Scale

Python 859 143 Updated Nov 29, 2025

NVIDIA-NeMo / ProRL-Agent-Server

Python 15 2 Updated Nov 29, 2025

HKUDS / AI-Trader

"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://ai4trade.ai

Python 9,707 1,525 Updated Nov 26, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,148 1,384 Updated Nov 14, 2025

NVIDIA-NeMo / Automodel

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 190 25 Updated Nov 29, 2025

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,540 376 Updated Nov 27, 2025

ByteDance-Seed / seed-oss

Python 842 45 Updated Sep 15, 2025

clash-verge-rev / clash-verge-rev

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 84,733 6,255 Updated Nov 29, 2025

onestardao / WFGY

WFGY 2.0. Semantic Reasoning Engine for LLMs (MIT). Fixes RAG/OCR drift, collapse & “ghost matches” via symbolic overlays + logic patches. Autoboot; OneLine & Flagship. ⭐ Star if you explore semant…

Python 1,266 106 Updated Oct 14, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 503 31 Updated Nov 26, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,287 294 Updated Nov 27, 2025

snowflakedb / ArcticTraining

ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)

Python 254 27 Updated Nov 27, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,315 1,948 Updated Nov 1, 2025

ISEEKYAN / mbridge

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 165 34 Updated Nov 27, 2025

ISEEKYAN / verl_megatron_practice

(best/better) practices of megatron on veRL and tuning guide

Shell 103 8 Updated Sep 26, 2025

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 43,881 3,017 Updated Nov 27, 2025