-
Institute of Computing Technology, CAS
Lists (8)
Sort Name ascending (A-Z)
AI
Architecture-simulator
architecture researchchisel-tutorial
Learn chisel3 from scratchcpu-tutorial
for new hand ic studentfunction-simulator
Linux-kernel
NPU
xv6-improved
for research- All languages
- Assembly
- Batchfile
- Bluespec
- C
- C#
- C++
- CMake
- CSS
- Cuda
- Dart
- Dockerfile
- FIRRTL
- Gnuplot
- Go
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- LLVM
- Less
- Lua
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Mathematica
- Nix
- OCaml
- OpenEdge ABL
- PHP
- Perl
- PowerShell
- Python
- Racket
- ReScript
- Rich Text Format
- RobotFramework
- Roff
- Ruby
- Rust
- SCSS
- SWIG
- Scala
- Scheme
- Shell
- Starlark
- Svelte
- SystemVerilog
- Tcl
- TeX
- TypeScript
- Typst
- VHDL
- Verilog
- Vue
Starred repositories
Low overhead tracing library and trace visualizer for pipelined CUDA kernels
Tips for Writing a Research Paper using LaTeX
An Eclipse 4 RCP based GUI to interact with SystemC simulators
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
A tiny deep learning training framework implemented from scratch in C++ that follows PyTorch's API.
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.
Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
A simple, lightweight PowerShell script to remove pre-installed apps, disable telemetry, as well as perform various other changes to customize, declutter and improve your Windows experience. Win11D…
A step-by-step tutorial that allows beginners to write their own autonomous vehicle program from scratch using a simple starter kit. Dora-drives makes learning autonomous vehicle systems faster and…
a simple minimal riscv32imac virtual machine, support Linux MMU+SMP booting.
A light llama-like llm inference framework based on the triton kernel.
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
Flash Attention in ~100 lines of CUDA (forward pass only)
This repository provides an HLS-based implementation of Tiny-LLAMA and Llama 2B. We have included detailed host files, configuration files, and the src directory, as well as a ready-to-run bitstrea…
A SystemVerilog language server based on the Slang library.