ZillaRU

🎼

ZillaRU

🎼

50 followers · 62 following

ByteDance
Shenzhen, Guangdong
https://www.jianshu.com/u/31c221f09d8a
in/%E9%92%9F%E8%8E%B9-%E8%8C%B9-8b4732187
https://music.163.com/#/user/home?id=273265199

Achievements

Lists (7)

Sort

Starred repositories

multimodal-art-projection / REER_DeepWriter

Forked from HaozheH3/REER_DeepWriter

REverse-Engineered Reasoning for Open-Ended Generation

Python 83 6 Updated Sep 10, 2025

SakanaAI / robust-kbench

Python 64 6 Updated Nov 22, 2025

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)

Python 683 93 Updated Nov 21, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,733 271 Updated Nov 28, 2025

openxla / xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,734 696 Updated Nov 28, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,851 4,651 Updated Nov 26, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,654 850 Updated Nov 28, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 10,747 1,146 Updated Apr 9, 2025

RRZE-HPC / gpu-benches

collection of benchmarks to measure basic GPU capabilities

C++ 460 68 Updated Oct 24, 2025

lizhe2004 / Awesome-LLM-RAG-Application

the resources about the application based on LLM with RAG pattern

1,585 102 Updated Nov 4, 2025

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 632 51 Updated Apr 8, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 15,956 1,875 Updated Nov 7, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,781 4,015 Updated Nov 28, 2025

harleyszhang / llm_note

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python 847 88 Updated Sep 16, 2025

HandsOnLLM / Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 18,105 4,272 Updated Jul 21, 2025

BBuf / how-to-learn-deep-learning-framework

how to learn PyTorch and OneFlow

460 28 Updated Mar 22, 2024

BBuf / tvm_mlir_learn

compiler learning resources collect.

Python 2,597 360 Updated Mar 19, 2025

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 2,140 175 Updated Nov 18, 2024

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,759 284 Updated Nov 28, 2025