Skip to content
View ETOgaosion's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report ETOgaosion

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Perplexity open source garden for inference technology

Rust 274 21 Updated Nov 20, 2025

support Multiple Producer and Multiple Consumer with lock-free queue

C++ 18 4 Updated Jan 11, 2021

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,227 107 Updated Oct 20, 2025
Python 1,009 61 Updated Nov 20, 2025

A library of reinforcement learning components and agents

Python 3,855 511 Updated Sep 26, 2025

This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.

247 30 Updated Sep 21, 2025

The Fish Shell Framework

Shell 11,100 810 Updated May 30, 2025

BurstEngine is an efficient framework designed to train LLMs on long-sequence data.

Python 7 2 Updated Sep 25, 2025

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 454 50 Updated May 14, 2025

NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer

Cuda 144 12 Updated Sep 18, 2025

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 749 86 Updated Nov 28, 2025

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,481 138 Updated Nov 29, 2025

Expert Parallelism Load Balancer

Python 1,314 195 Updated Mar 24, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,253 109 Updated Nov 18, 2025

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 892 51 Updated Nov 27, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,182 85 Updated Aug 28, 2025

使用HTML5与原生JavaScript实现太鼓达人网页版游戏

JavaScript 35 14 Updated Sep 28, 2016

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,857 2,681 Updated Nov 29, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 63,264 7,649 Updated Nov 27, 2025

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

Python 1,014 163 Updated Mar 13, 2019

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 165 34 Updated Nov 27, 2025

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

C 1,514 497 Updated Nov 27, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,571 304 Updated Nov 13, 2025
Python 304 52 Updated Sep 8, 2025

✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】

Jupyter Notebook 14,714 1,731 Updated Nov 28, 2025

GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types. It can be compiled and run on both Linux and Windows.

C++ 114 26 Updated Jul 8, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,097 238 Updated Nov 29, 2025

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python 847 88 Updated Sep 16, 2025

Learning Large Language Model (LLM)(大语言模型学习)

Python 845 104 Updated Apr 13, 2025
Next