-
LSA Lab, National Tsing Hua University
- Shenzhen, China
- https://yszheda.github.io/blog
Stars
CUDA Templates and Python DSLs for High-Performance Linear Algebra
flink learning blog. http://www.54tianzhisheng.cn/tags/Flink/
🦜🔗 The platform for reliable agents.
mirror of git://git.kernel.org/pub/scm/linux/kernel/git/paulmck/perfbook.git
Chinese translation of Bjarne Stroustrup's HOPL4 paper
jalammar / jalammar.github.io
Forked from barryclark/jekyll-nowBuild a Jekyll blog in minutes, without touching the command line.
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…
An Industrial Grade Federated Learning Framework
Bolt is a deep learning library with high performance and heterogeneous flexibility.
😎 A curated list of awesome collision detection libraries and resources
FeatherCNN is a high performance inference engine for convolutional neural networks.
oneAPI Threading Building Blocks (oneTBB)
Sources for Arm Streamline's gator daemon, part of Arm Mobile Studio suite of performance analysis tools
YouZan systemtap toolkit to online analyze on production
An Open Source Machine Learning Framework for Everyone
rkflashkit is an open source toolkit for flashing Linux kernel images to rockchip rk3066/rk3188/rk3288 etc. based devices. It's programmed with python and gtk2.
Deep Learning Architecture Genealogy Project