Skip to content
View goldenfox2025's full-sized avatar

Block or report goldenfox2025

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

一种大模型推理引擎的实现。

C++ 7 1 Updated Aug 19, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 1 Updated Aug 20, 2025
Cuda 1 Updated Jul 12, 2025
C++ 1 Updated Jun 4, 2025

LLM inference in C/C++

C++ 93,077 14,493 Updated Jan 16, 2026
C++ 1 Updated Apr 29, 2025
Python 45 99 Updated Jan 16, 2026

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,369 923 Updated Jan 7, 2026

一种经过优化的推理引擎核心。

1 Updated Apr 6, 2025
Rust 1 Updated Feb 2, 2025

😞

Cuda 1 Updated Feb 4, 2025
C++ 1 Updated Apr 25, 2025

Fast and memory-efficient exact attention

Python 21,656 2,287 Updated Jan 15, 2026

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

Python 4,630 1,092 Updated Dec 15, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 32,285 2,265 Updated Dec 27, 2025

Build, run, manage multi-agent systems.

Python 36,944 4,893 Updated Jan 16, 2026

AI education materials for Chinese students, teachers and IT professionals.

HTML 14,047 2,959 Updated May 16, 2024

LLM training in simple, raw C/CUDA

Cuda 28,618 3,355 Updated Jun 26, 2025