Skip to content
View pprp's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report pprp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 2,626 144 Updated Jan 14, 2026

This repository catalogs cutting-edge research papers, practical tools, datasets, and learning materials for AI-powered SVG generation, processing, and manipulation.

18 Updated Dec 19, 2025

Framework for building enterprise-level assistant agents.

Java 207 48 Updated Jan 16, 2026

A universal sandbox platform for AI application scenarios, providing multi-language SDKs, unified sandbox protocols, and sandbox runtimes for LLM-related capabilities.

Python 138 21 Updated Jan 16, 2026

Demystifying Reinforcement Learning in Agentic Reasoning

Python 149 23 Updated Oct 14, 2025

A construction kit for reinforcement learning environment management.

Python 308 33 Updated Jan 16, 2026

An interface library for RL post training with environments.

Python 1,046 155 Updated Jan 16, 2026

QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

C++ 161 17 Updated Nov 11, 2025

Superposition Yields Robust Neural Scaling

Jupyter Notebook 44 5 Updated Nov 28, 2025
Python 93 13 Updated Nov 16, 2025

Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference examples to build with Nemotron models

Jupyter Notebook 354 53 Updated Jan 14, 2026

Open-source release accompanying Gao et al. 2025

Python 491 51 Updated Dec 11, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,830 239 Updated Jan 16, 2026
Python 33 9 Updated Nov 26, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,456 222 Updated Jan 16, 2026

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,517 704 Updated Jan 16, 2026

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Python 267 13 Updated Nov 29, 2025

The official PyTorch implementation of the paper "Conda: Column-Normalized Adam for Training Large Language Models Faster"

Python 7 Updated Nov 11, 2025

Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”

Python 113 3 Updated Jan 15, 2026

Complete simulation of IEEE 754 fixed and floating point specification to any precision

Jupyter Notebook 9 2 Updated Jul 7, 2025

Miles is an enterprise-facing reinforcement learning framework for large-scale MoE post-training and production workloads, forked from and co-evolving with slime.

Python 726 77 Updated Jan 17, 2026
Python 12 5 Updated Oct 23, 2025

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 283 19 Updated Nov 7, 2025

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

271 5 Updated Dec 1, 2025

A framework to compare low-bit integer and float-point formats

Python 58 5 Updated Nov 1, 2025

Agentic Learning Powered by AWorld

Python 73 7 Updated Jan 14, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,955 1,381 Updated Jan 12, 2026

[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Python 34 4 Updated Nov 4, 2025

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Cuda 1,020 145 Updated Jan 16, 2026
Next