Skip to content
View AndroidSheepy's full-sized avatar
  • USTC, intern@MBZUAI
  • Abu Dhabi, UAE

Highlights

  • Pro

Block or report AndroidSheepy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

kernels, of the mega variety

Python 595 26 Updated Sep 28, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,849 730 Updated Oct 15, 2025
C++ 31 2 Updated Jul 17, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 35,171 15,032 Updated Nov 2, 2025
Cuda 28 2 Updated Apr 2, 2025

libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源

C 9 1 Updated May 21, 2024
Jupyter Notebook 6 2 Updated Dec 7, 2024

DFloat11: Lossless LLM Compression for Efficient GPU Inference

Python 556 33 Updated Aug 24, 2025

Documentation of NVIDIA chip/hardware interfaces

C 1,313 98 Updated Aug 18, 2025

Effective transpose on Hopper GPU

Cuda 25 3 Updated Sep 6, 2025

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 25,600 5,102 Updated Oct 27, 2025
Python 2 Updated Nov 1, 2024

Port of OpenAI's Whisper model in C/C++

C++ 44,173 4,884 Updated Nov 1, 2025

VoiceTrans是一站式离线AI视频字幕生成和翻译软件,从视频下载,音频提取,听写打轴,字幕翻译,视频合成,字幕总结各个环节为翻译者提供便利。

Python 839 38 Updated Sep 27, 2025

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 9,827 580 Updated Sep 7, 2024

Automatically generate, translate, and overlay subtitles for any video.

Python 80 7 Updated Jun 10, 2025

Automatically generate and overlay subtitles for any video.

Python 2,072 339 Updated Jul 12, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,815 293 Updated Nov 1, 2025

一个基于 JavaScript 的网盘文件下载地址获取工具。基于【网盘直链下载助手】修改 ,支持 百度网盘 / 阿里云盘 / 中国移动云盘 / 天翼云盘 / 迅雷云盘 / 夸克网盘 / UC网盘 / 123云盘 八大网盘

JavaScript 7,888 366 Updated Oct 12, 2025

Nano vLLM

Python 7,340 942 Updated Aug 31, 2025

Code&Data for the paper "Evaluating Evidence Attribution in Generated Fact Checking Explanations"

HTML 3 Updated Feb 5, 2025

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,529 340 Updated Oct 21, 2025

Midi event transformer for symbolic music generation

Python 319 48 Updated Dec 31, 2024

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 3,902 3,391 Updated Aug 16, 2025

Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.

Python 97 15 Updated Sep 17, 2025

A curated list of awesome advice for computer science Ph.D. applicants.

314 15 Updated Sep 12, 2021

A open-source guide that demystifies how U.S. universities evaluate and admit students into Computer Science PhD programs.

TeX 173 13 Updated Nov 1, 2025

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

589 15 Updated Sep 30, 2025
Next