Skip to content
View bashow0316's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report bashow0316

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AMD SMI

C++ 109 56 Updated Dec 15, 2025

HIPIFY: Convert CUDA to Portable C++ Code

C++ 646 102 Updated Jan 13, 2026

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 1,942 450 Updated Nov 22, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,878 1,055 Updated Dec 29, 2025

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,168 111 Updated Jan 12, 2026

OpenPMIx Project Repository

C 258 126 Updated Jan 10, 2026
Python 65 23 Updated Jan 12, 2026

AI Tensor Engine for ROCm

Python 335 173 Updated Jan 13, 2026

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 800 112 Updated Jan 13, 2026

2021/3/30 ~ 2021/7/12 に行われる企画「競プロ典型 90 問」の問題・解説・ソースコードなどの資料をアップロードしています。

C++ 837 84 Updated May 29, 2024

Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs H200 vs MI325X & soon™ TPUv6e/v7/Trainium2/3/GB300 NVL72 - DeepSeek 670B MoE, GPTOSS

Python 415 70 Updated Jan 13, 2026

Open Machine Learning Compiler Framework

Python 13,017 3,760 Updated Jan 12, 2026

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,687 192 Updated Jun 25, 2024

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,920 1,379 Updated Jan 12, 2026

Kaggle Python docker image

Python 2,663 1,006 Updated Jan 12, 2026

Social Compilation Service

TypeScript 1,251 106 Updated Dec 24, 2025

A simple Python sandbox for helpful LLM data agents

Python 302 51 Updated Jun 18, 2024

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,102 732 Updated Jan 4, 2026

Scalable toolkit for efficient model reinforcement

Python 1,222 214 Updated Jan 13, 2026

NVIDIA Inference Xfer Library (NIXL)

C++ 801 219 Updated Jan 12, 2026

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,339 290 Updated Jan 12, 2026

Large Language Model Text Generation Inference

Python 10,727 1,251 Updated Jan 8, 2026

Robust recipes to align language models with human and AI preferences

Python 5,476 467 Updated Sep 8, 2025

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,697 293 Updated Aug 14, 2024

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,725 1,509 Updated Jan 4, 2026

mscp: transfer files over multiple SSH (SFTP) connections

C 234 19 Updated Nov 15, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,167 395 Updated Jul 11, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 82,832 12,449 Updated Jan 10, 2026

Normalized Transformer (nGPT)

Python 195 22 Updated Nov 19, 2024

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 370 30 Updated Sep 25, 2024
Next