Skip to content
View ruoqianguo's full-sized avatar
🎯
Focusing
🎯
Focusing
  • NVIDIA Corporation
  • Shanghai

Organizations

@CVCUDA

Block or report ruoqianguo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,557 712 Updated Nov 27, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,913 756 Updated Nov 25, 2025
C++ 1 Updated Mar 14, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,044 1,121 Updated Apr 30, 2025

一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words an…

Swift 11,014 557 Updated Nov 23, 2025

直播源相关资源汇总 📺 💯 IPTV、M3U —— 勤洗手、戴口罩,祝愿所有人百毒不侵

27,801 3,267 Updated Nov 14, 2025

The Triton TensorRT-LLM Backend

910 132 Updated Nov 25, 2025

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Python 1,100 98 Updated Dec 26, 2024

🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.

Python 20,606 1,397 Updated Jun 23, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,247 1,895 Updated Nov 27, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,072 1,682 Updated Nov 27, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,057 2,655 Updated Nov 3, 2025

Inference code for Llama models

Python 58,950 9,817 Updated Jan 26, 2025

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 56,174 17,340 Updated Nov 25, 2025

Free ChatGPT Site List 这儿为你准备了众多免费好用的ChatGPT镜像站点

17,060 1,449 Updated Oct 27, 2025

Decode JPEG image on GPU using PyTorch

C++ 93 10 Updated Oct 9, 2023

Deformable DETR: Deformable Transformers for End-to-End Object Detection.

Python 3,800 600 Updated May 16, 2024

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 4,153 666 Updated Aug 15, 2024

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,610 244 Updated Nov 15, 2025

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,859 1,505 Updated Nov 27, 2025

Reference implementations of MLPerf® training benchmarks

Python 1,728 585 Updated Nov 25, 2025

This repository contains the results and code for the MLPerf™ Training v2.0 benchmark.

C++ 29 24 Updated Feb 23, 2024

OpenMMLab Detection Toolbox and Benchmark

Python 32,077 9,830 Updated Aug 21, 2024

PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT

Jupyter Notebook 2 Updated Apr 26, 2023

A complete easy to follow implementation of Google's Vision Transformer proposed in "AN IMAGE IS WORTH 16X16 WORDS". This pytorch implementation has comments for better understanding.

Python 98 14 Updated Dec 1, 2020

Video Swin Transformer - PyTorch

Python 266 38 Updated Jan 4, 2022

This is an official implementation for "Video Swin Transformers".

Python 1,605 210 Updated Mar 8, 2023

Simple samples for TensorRT programming

Python 1,647 351 Updated Nov 25, 2025

The Foundation for All Legate Libraries

C++ 232 63 Updated Nov 26, 2025
Next