jaesuny

Follow

Jaesun Park jaesuny

Follow

51 followers · 120 following

@kakao
Seoul, South Korea

Achievements

Achievements

Starred repositories

bzantium / smart-menlo

JavaScript 6 1 Updated Sep 30, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 287 Updated May 15, 2025

deepseek-ai / DeepSeek-R1

91,481 11,776 Updated Jun 27, 2025

princeton-nlp / HELMET

The HELMET Benchmark

Jupyter Notebook 182 32 Updated Aug 15, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,089 827 Updated Nov 3, 2025

chatnoir-eu / chatnoir-resiliparse

A robust web archive analytics toolkit

Cython 119 15 Updated Oct 15, 2025

meta-pytorch / attention-gym

Helpful tools and examples for working with flex-attention

Python 1,053 64 Updated Nov 14, 2025

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 582 30 Updated Aug 12, 2025

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,065 116 Updated Jul 29, 2024

deepseek-ai / DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

6,209 995 Updated Nov 11, 2025

microsoft / Samba

[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Python 927 46 Updated Oct 31, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,360 1,767 Updated Oct 13, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,531 31,147 Updated Nov 14, 2025

huggingface / text-clustering

Easily embed, cluster and semantically label text datasets

Python 584 47 Updated Mar 28, 2024

microsoft / PhiCookBook

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 3,594 469 Updated Oct 17, 2025

meta-llama / llama

Inference code for Llama models

Python 58,925 9,815 Updated Jan 26, 2025

MiroPsota / torch_packages_builder

Builder and index for PyTorch packages

Python 255 31 Updated Oct 16, 2025

mit-han-lab / omniserve

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 778 54 Updated Mar 6, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,032 2,644 Updated Nov 3, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,093 3,481 Updated Jan 26, 2025

feifeibear / long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 598 69 Updated Oct 14, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,155 3,287 Updated Jun 26, 2025

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,504 369 Updated Nov 15, 2025

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,229 68 Updated May 21, 2025

NVIDIA-NeMo / Curator

Scalable data pre processing and curation toolkit for LLMs

Python 1,215 188 Updated Nov 14, 2025

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 2,114 172 Updated Nov 18, 2024

volcengine / veScale

A PyTorch Native LLM Training Framework

Python 884 50 Updated Sep 12, 2025

xai-org / grok-1

Grok open release

Python 50,562 8,373 Updated Aug 30, 2024

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 910 88 Updated Sep 10, 2025

gpu-mode / resource-stream

GPU programming related news and material links

1,787 105 Updated Sep 17, 2025

Starred topics

macOS