-
USTC
- Beijing
-
01:12
(UTC +08:00)
Lists (8)
Sort Name ascending (A-Z)
Stars
Open Immersive Translate. A revolutionary open-source browser translation plugin that enables everyone to have a native-like reading experience. 开源的沉浸式翻译,一款革命性的浏览器翻译插件,让所有人都能够拥有母语般的阅读体验。
Perform common file preview and editing via the web.
Collection of Loongson products' public documentation
Large Language Model (LLM) Systems Paper List
A list of benchmark suites used in the research related to compilers, program performance, scientific computations etc.
这个项目介绍了简单的CUDA入门,涉及到CUDA执行模型、线程层次、CUDA内存模型、核函数的编写方式以及PyTorch使用CUDA扩展的两种方式。通过该项目可以基本入门基于PyTorch的CUDA扩展的开发方式。
My learning notes/codes for ML SYS.
An open-source AI agent that brings the power of Gemini directly into your terminal.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
CUDA Templates and Python DSLs for High-Performance Linear Algebra
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Test suite for C/C++/Fortran compilers developed by Fujitsu
This is a Chinese translation of the CUDA programming guide
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
resurrected LLVM "C Backend", with improvements
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
A smarter cd command. Supports all major shells.
Preview GitHub README.md files locally before committing them.
PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
GPUOcelot: A dynamic compilation framework for PTX
A Datacenter Scale Distributed Inference Serving Framework
Fast and memory-efficient exact attention
Development repository for the Triton language and compiler
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.