bashow0316

Follow

🎯

Focusing

Shogo Asaba bashow0316

🎯

Focusing

Follow

50 followers · 52 following

Achievements

Achievements

Highlights

Pro

Stars

ROCm / amdsmi

AMD SMI

C++ 109 56 Updated Dec 15, 2025

ROCm / HIPIFY

HIPIFY: Convert CUDA to Portable C++ Code

C++ 646 102 Updated Jan 13, 2026

alibaba / clusterdata

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 1,942 450 Updated Nov 22, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,878 1,055 Updated Dec 29, 2025

uccl-project / uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,168 111 Updated Jan 12, 2026

openpmix / openpmix

OpenPMIx Project Repository

C 258 126 Updated Jan 10, 2026

AMD-AGI / Primus

Python 65 23 Updated Jan 12, 2026

ROCm / aiter

AI Tensor Engine for ROCm

Python 335 173 Updated Jan 13, 2026

vllm-project / guidellm

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 800 112 Updated Jan 13, 2026

E869120 / kyopro_educational_90

2021/3/30 ～ 2021/7/12 に行われる企画「競プロ典型 90 問」の問題・解説・ソースコードなどの資料をアップロードしています。

C++ 837 84 Updated May 29, 2024

InferenceMAX / InferenceMAX

Open Source Continuous Inference Benchmarking - GB200 NVL72 vs MI355X vs B200 vs H200 vs MI325X & soon™ TPUv6e/v7/Trainium2/3/GB300 NVL72 - DeepSeek 670B MoE, GPTOSS

Python 415 70 Updated Jan 13, 2026

apache / tvm

Open Machine Learning Compiler Framework

Python 13,017 3,760 Updated Jan 12, 2026

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,687 192 Updated Jun 25, 2024

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,920 1,379 Updated Jan 12, 2026

Kaggle / docker-python

Kaggle Python docker image

Python 2,663 1,006 Updated Jan 12, 2026

melpon / wandbox

Social Compilation Service

TypeScript 1,251 106 Updated Dec 24, 2025

cohere-ai / cohere-terrarium

A simple Python sandbox for helpful LLM data agents

Python 302 51 Updated Jun 18, 2024

SWE-bench / SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,102 732 Updated Jan 4, 2026

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,222 214 Updated Jan 13, 2026

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 801 219 Updated Jan 12, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,339 290 Updated Jan 12, 2026

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,727 1,251 Updated Jan 8, 2026

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,476 467 Updated Sep 8, 2025

JIA-Lab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,697 293 Updated Aug 14, 2024

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,725 1,509 Updated Jan 4, 2026

upa / mscp

mscp: transfer files over multiple SSH (SFTP) connections

C 234 19 Updated Nov 15, 2025

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,167 395 Updated Jul 11, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 82,832 12,449 Updated Jan 10, 2026

NVIDIA / ngpt

Normalized Transformer (nGPT)

Python 195 22 Updated Nov 19, 2024

OpenBMB / InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

Python 370 30 Updated Sep 25, 2024