Stars
A Datacenter Scale Distributed Inference Serving Framework
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words an…
直播源相关资源汇总 📺 💯 IPTV、M3U —— 勤洗手、戴口罩,祝愿所有人百毒不侵
The Triton TensorRT-LLM Backend
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Reference implementations of MLPerf® training benchmarks
This repository contains the results and code for the MLPerf™ Training v2.0 benchmark.
OpenMMLab Detection Toolbox and Benchmark
inocsin / Torch-TensorRT
Forked from pytorch/TensorRTPyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT
A complete easy to follow implementation of Google's Vision Transformer proposed in "AN IMAGE IS WORTH 16X16 WORDS". This pytorch implementation has comments for better understanding.
Video Swin Transformer - PyTorch
This is an official implementation for "Video Swin Transformers".
Simple samples for TensorRT programming