Skip to content
View frezcirno's full-sized avatar
🚩
Focusing
🚩
Focusing

Highlights

  • Pro

Organizations

@codepass-team

Block or report frezcirno

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Distributed query engine providing simple and reliable data processing for any modality and scale

Rust 4,655 326 Updated Oct 31, 2025

Interrupts-off or softirqs-off latency tracer

C 295 112 Updated Apr 5, 2023

Clipboard extension app for macOS.

Swift 8,248 692 Updated Jun 29, 2024

MIT Chord/DHash

C++ 501 94 Updated Jul 6, 2011

Exploration of the Dynamo paper in Python

HTML 43 2 Updated Jul 5, 2019

CUDA SGEMM optimization note

Cuda 15 2 Updated Oct 31, 2023

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,503 297 Updated Oct 29, 2025

Official git repository for libdivide: optimized integer division

C++ 1,250 89 Updated Jun 15, 2025

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 867 1,093 Updated Aug 29, 2025

Seamless operability between C++11 and Python

C++ 17,406 2,232 Updated Oct 27, 2025

Codes & examples for "CUDA - From Correctness to Performance"

C++ 115 23 Updated Oct 24, 2024

A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.

Python 244 9 Updated Oct 28, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 2,921 214 Updated Oct 31, 2025

An Open Source Machine Learning Framework for Everyone

C++ 192,268 74,942 Updated Oct 31, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,641 671 Updated Oct 31, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,504 3,221 Updated Oct 31, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 795 59 Updated Oct 31, 2025

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 450 103 Updated Oct 30, 2025

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 2,004 220 Updated Oct 13, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,317 235 Updated Oct 31, 2025

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

Python 328 58 Updated Oct 18, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,255 1,099 Updated Oct 30, 2025

MQSim is a fast & accurate simulator for modern multi-queue (MQ) and SATA SSDs. MQSim faithfully models new high-bandwidth protocol implementations, steady-state SSD conditions, and full end-to-end…

C++ 333 164 Updated Aug 25, 2025

A fast multi-producer, multi-consumer lock-free concurrent queue for C++11

C++ 11,665 1,840 Updated Jul 6, 2025

Apache Parquet Format

Thrift 2,082 451 Updated Oct 20, 2025

PetPS: Supporting Huge Embedding Models with Tiered Memory

C++ 33 2 Updated May 21, 2024

A collection of LLM memes

284 4 Updated Sep 22, 2025

[SIGMOD '24] CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure

C++ 70 7 Updated Jul 2, 2024

A Deep Learning Recommender System

Python 2,666 864 Updated Jun 2, 2024

大概是2020年最全的免费可商用字体,这里收录的商免字体都能找到明确的授权出处,可以放心使用,持续更新中...

JavaScript 5,647 403 Updated Feb 27, 2025
Next