Skip to content
View jaesuny's full-sized avatar

Block or report jaesuny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
JavaScript 6 1 Updated Sep 30, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 287 Updated May 15, 2025

The HELMET Benchmark

Jupyter Notebook 182 32 Updated Aug 15, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,089 827 Updated Nov 3, 2025

A robust web archive analytics toolkit

Cython 119 15 Updated Oct 15, 2025

Helpful tools and examples for working with flex-attention

Python 1,053 64 Updated Nov 14, 2025

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 582 30 Updated Aug 12, 2025

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,065 116 Updated Jul 29, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

6,209 995 Updated Nov 11, 2025

[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Python 927 46 Updated Oct 31, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,360 1,767 Updated Oct 13, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,531 31,147 Updated Nov 14, 2025

Easily embed, cluster and semantically label text datasets

Python 584 47 Updated Mar 28, 2024

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 3,594 469 Updated Oct 17, 2025

Inference code for Llama models

Python 58,925 9,815 Updated Jan 26, 2025

Builder and index for PyTorch packages

Python 255 31 Updated Oct 16, 2025

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 778 54 Updated Mar 6, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,032 2,644 Updated Nov 3, 2025

The official Meta Llama 3 GitHub site

Python 29,093 3,481 Updated Jan 26, 2025

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 598 69 Updated Oct 14, 2025

LLM training in simple, raw C/CUDA

Cuda 28,155 3,287 Updated Jun 26, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,504 369 Updated Nov 15, 2025

Schedule-Free Optimization in PyTorch

Python 2,229 68 Updated May 21, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,215 188 Updated Nov 14, 2025

Puzzles for learning Triton

Jupyter Notebook 2,114 172 Updated Nov 18, 2024

A PyTorch Native LLM Training Framework

Python 884 50 Updated Sep 12, 2025

Grok open release

Python 50,562 8,373 Updated Aug 30, 2024

Ring attention implementation with flash attention

Python 910 88 Updated Sep 10, 2025

GPU programming related news and material links

1,787 105 Updated Sep 17, 2025
Next