AHEADer

Follow

Zhang Junda AHEADer

Follow

85 followers · 61 following

Tiktok
Singapore

Achievements

Achievements

Lists (1)

Sort

🚀 My stack

Stars

Tony-Tan / CUDA_Freshman

Cuda 2,659 501 Updated Jan 16, 2024

github / spec-kit

💫 Toolkit to help you get started with Spec-Driven Development

Python 61,836 5,374 Updated Dec 4, 2025

AI-Hypercomputer / gpu-recipes

Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

Python 109 55 Updated Jan 10, 2026

DanFu09 / model-tensor-debugger

Debug the intermediate outputs of two models.

HTML 2 Updated Aug 8, 2025

NVIDIA / cuDecomp

An Adaptive Pencil Decomposition Library for NVIDIA GPUs

C++ 74 11 Updated Dec 2, 2025

thinking-machines-lab / batch_invariant_ops

Python 948 71 Updated Nov 4, 2025

vllm-project / semantic-router

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 2,750 414 Updated Jan 12, 2026

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,913 1,206 Updated Sep 26, 2025

deepreinforce-ai / CUDA-L1

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 277 79 Updated Nov 3, 2025

SuperClaude-Org / SuperClaude_Framework

A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies.

Python 20,017 1,733 Updated Jan 10, 2026

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,133 645 Updated Jan 10, 2026

OpenMOSS / MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,075 95 Updated Dec 8, 2025

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,227 1,953 Updated Dec 29, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,508 4,186 Updated Dec 3, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,604 2,003 Updated Jan 12, 2026

HazyResearch / Megakernels

kernels, of the mega variety

Python 644 35 Updated Sep 28, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

107,584 28,276 Updated Jan 8, 2026

infinigence / Semi-PD

A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.

Python 123 15 Updated Dec 25, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,311 116 Updated Dec 27, 2025

ther0n / UnnaturalScrollWheels

Invert scroll direction for physical scroll wheels while maintaining "Natural" scrolling for trackpads on MacOS

Swift 3,887 84 Updated Dec 2, 2025

Michaelvll / llm-ie-benchmarks

A collection of reproducible inference engine benchmarks

Shell 38 1 Updated Apr 22, 2025

perplexityai / pplx-kernels

Perplexity GPU Kernels

C++ 552 75 Updated Nov 7, 2025

ByteDance-Seed / ByteCheckpoint

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 260 17 Updated Dec 8, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,761 774 Updated Jan 12, 2026

agno-agi / agno

The complete stack for AI Engineers: framework, runtime and control plane.

Python 36,811 4,873 Updated Jan 12, 2026

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,220 85 Updated Aug 28, 2025

PatrickJS / awesome-cursorrules

📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors

MDX 36,930 3,132 Updated Oct 24, 2025

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 75,291 8,986 Updated Jan 11, 2026

skypilot-org / skypilot

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, or on-prem).

Python 9,197 902 Updated Jan 12, 2026

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 3,056 224 Updated Jan 12, 2026