Skip to content
View hongtaoxuu's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Qingdao

Highlights

  • Pro

Block or report hongtaoxuu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 81,060 5,996 Updated Nov 9, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,164 82 Updated Aug 28, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,227 104 Updated Oct 17, 2025

My learning notes/codes for ML SYS.

Python 4,092 250 Updated Nov 6, 2025

Sampling profiler for Python programs

Rust 14,536 484 Updated Nov 5, 2025

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 395 41 Updated Nov 7, 2025

A fast, clean, responsive Hugo theme.

HTML 12,615 3,260 Updated Oct 26, 2025

The Minimum Viable Model website and Jekyll theme.

CSS 98 448 Updated Jun 15, 2024

Efficient Triton Kernels for LLM Training

Python 5,812 427 Updated Nov 8, 2025

A compiler for pl0 , c++

C++ 1 Updated Jun 24, 2021
C++ 1 Updated Mar 23, 2021

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 3,238 309 Updated Jul 7, 2025

基于Python的开源量化交易平台开发框架

Python 33,628 10,319 Updated Nov 2, 2025

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,943 163 Updated Jul 9, 2025

NCCL Tests

Cuda 1,326 327 Updated Nov 3, 2025

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Python 536 70 Updated Aug 26, 2025

Best practice for training LLaMA models in Megatron-LM

Python 659 56 Updated Jan 2, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

C++ 11 4 Updated Jun 2, 2024

毒奶自用,懒人配置文件(Quantumult X):去广告分流规则、Tiktok解锁重写、VSCO解锁、神机分流、blackmatrix7分流规则。

JavaScript 3,123 235 Updated Jul 2, 2025

毒奶去网页广告计划用户脚本 For Quantumult X & Surge & Shadowrocket & Loon & Stash & 油猴 ;1.新增页面右下角导航;2.通过调用 JavaScript 移除特定网站网页广告 —— 搜索引擎(Bing/Google)广告及内容农场结果清除/低端影视/欧乐影院/iyf爱壹帆/哔滴影视/Pornhub/Javbus/Supjav/Jable…

JavaScript 4,199 235 Updated Nov 7, 2025

Ongoing research training transformer models at scale

Python 14,141 3,257 Updated Nov 7, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 2,892 540 Updated Nov 7, 2025

BLAS-like Library Instantiation Software Framework

C 2,550 402 Updated Oct 21, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,188 365 Updated Aug 14, 2025

Zero Bubble Pipeline Parallelism

Python 433 31 Updated May 7, 2025

Development repository for the Triton language and compiler

MLIR 17,510 2,370 Updated Nov 9, 2025

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,700 559 Updated Nov 7, 2025

Main repo to keep scripts, dockerfiles, wiki, etc

Shell 15 1 Updated Mar 14, 2023
Next