Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Flash Attention in ~100 lines of CUDA (forward pass only)
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for float…
Text Classification Algorithms: A Survey
A list of Machine Learning Art Colabs
Summary of all repositories for my public contents, mostly Python, in Jupyter Notebooks, PDFs, Markdowns, and more!
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06