JF-D

🎯

Focusing

JFDuan JF-D

🎯

Focusing

Interested in AI for system, efficient LLM training and serving!

98 followers · 183 following

Ph.D. Candidate@CUHK-MMLab, B.E.@ UCAS
HongKong
https://jf-d.github.io/

Achievements

Lists (1)

Sort

🔮 Future ideas

Stars

MiroMindAI / MiroThinker

MiroThinker is an open-source search agent model, built for tool-augmented reasoning and real-world information seeking, aiming to match the deep research experience of OpenAI Deep Research and Gem…

Python 4,607 300 Updated Jan 11, 2026

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,386 270 Updated Jan 12, 2026

hao-ai-lab / d3LLM

d3LLM: Ultra-Fast Diffusion LLM 🚀

Python 50 2 Updated Dec 19, 2025

nex-agi / NexVenusCL

Nex Venus Communication Library

C++ 70 7 Updated Nov 17, 2025

alibaba-damo-academy / Inferix

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Python 99 4 Updated Dec 16, 2025

linux-rdma / perftest

Infiniband Verbs Performance Tests

C 897 370 Updated Jan 11, 2026

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 744 78 Updated Jan 10, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 5,024 328 Updated Jan 8, 2026

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 888 73 Updated Jan 9, 2026

leigest519 / ScreenCoder

ScreenCoder — Turn any UI screenshot into clean, editable HTML/CSS with full control. Fast, accurate, and easy to customize.

Python 2,529 244 Updated Oct 22, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,627 198 Updated Jan 12, 2026

InternRobotics / MMSI-Bench

[arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

Python 69 Updated Dec 29, 2025

facebookresearch / Multi-SpatialMLLM

Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models

Python 166 8 Updated Oct 10, 2025

zartbot / shallowsim

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 176 26 Updated Mar 27, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,608 987 Updated Jan 6, 2026

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,640 390 Updated Jan 12, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,055 793 Updated Jan 6, 2026

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,878 1,055 Updated Dec 29, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,965 926 Updated Dec 15, 2025

facebookresearch / MLGym

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 584 57 Updated Aug 10, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,528 3,282 Updated Jan 12, 2026

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,215 51 Updated Nov 16, 2024

hao-ai-lab / vllm-ltr

[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank

Python 67 16 Updated Nov 4, 2024

flexflow / flexflow-serve

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

C++ 66 7 Updated Sep 15, 2025

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 2,936 237 Updated Jan 10, 2026

LoongServe / LoongServe

Jupyter Notebook 130 14 Updated Nov 11, 2024

NVlabs / COAT

[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training

Python 257 23 Updated Aug 9, 2025

chengzeyi / ParaAttention

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

Python 410 41 Updated Jul 5, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,263 3,009 Updated Jan 12, 2026

JamesAslan / MicroArchBench

Python 77 14 Updated Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JFDuan JF-D

Achievements

Achievements

Block or report JF-D

Lists (1)

🔮 Future ideas

Stars

MiroMindAI / MiroThinker

inclusionAI / AReaL

hao-ai-lab / d3LLM

nex-agi / NexVenusCL

alibaba-damo-academy / Inferix

linux-rdma / perftest

ovg-project / kvcached

zhaochenyang20 / Awesome-ML-SYS-Tutorial

MoonshotAI / checkpoint-engine

leigest519 / ScreenCoder

alibaba / ROLL

InternRobotics / MMSI-Bench

facebookresearch / Multi-SpatialMLLM

zartbot / shallowsim

deepseek-ai / 3FS

tile-ai / tilelang

deepseek-ai / DeepGEMM

deepseek-ai / DeepEP

deepseek-ai / FlashMLA

facebookresearch / MLGym

NVIDIA-NeMo / NeMo

srush / awesome-o1

hao-ai-lab / vllm-ltr

flexflow / flexflow-serve

hao-ai-lab / FastVideo

LoongServe / LoongServe

NVlabs / COAT

chengzeyi / ParaAttention

volcengine / verl

JamesAslan / MicroArchBench