Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源
DFloat11: Lossless LLM Compression for Efficient GPU Inference
Documentation of NVIDIA chip/hardware interfaces
📄 Awesome CV is LaTeX template for your outstanding job application
shinnpuru / VoiceTransl
Forked from GalTransl/GalTranslVoiceTrans是一站式离线AI视频字幕生成和翻译软件,从视频下载,音频提取,听写打轴,字幕翻译,视频合成,字幕总结各个环节为翻译者提供便利。
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Automatically generate, translate, and overlay subtitles for any video.
Automatically generate and overlay subtitles for any video.
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
一个基于 JavaScript 的网盘文件下载地址获取工具。基于【网盘直链下载助手】修改 ,支持 百度网盘 / 阿里云盘 / 中国移动云盘 / 天翼云盘 / 迅雷云盘 / 夸克网盘 / UC网盘 / 123云盘 八大网盘
Code&Data for the paper "Evaluating Evidence Attribution in Generated Fact Checking Explanations"
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
Midi event transformer for symbolic music generation
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.
A curated list of awesome advice for computer science Ph.D. applicants.
A open-source guide that demystifies how U.S. universities evaluate and admit students into Computer Science PhD programs.
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).