-
Red Hat
-
12:03
(UTC +01:00) - in/nicolo-lucchesi-834958184
Stars
A simple GPU reservation tool for single host shared development systems
Achieve state of the art inference performance with modern accelerators on Kubernetes
Bazzite makes gaming and everyday use smoother and simpler across desktop PCs, handhelds, tablets, and home theater PCs.
A high-throughput and memory-efficient inference and serving engine for LLMs
PyTorch extensions for high performance and large scale training.
Muggled DPT: Depth estimation without the magic
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Accessible large language models via k-bit quantization for PyTorch.
PyTorch code and models for the DINOv2 self-supervised learning method.
Swift app demonstrating Core ML Stable Diffusion
Provides an interface layer to convert between n-dimensional types in different Rust crates
DeepSeek-VL: Towards Real-World Vision-Language Understanding
CUDA Templates and Python DSLs for High-Performance Linear Algebra
On-device AI across mobile, embedded and edge for PyTorch
Generative Models by Stability AI
Everything we actually know about the Apple Neural Engine (ANE)
Stable Diffusion with Core ML on Apple Silicon
Train transformer language models with reinforcement learning.
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Neural Network Compression Framework for enhanced OpenVINO™ inference
Deep learning in Rust, with shape checked tensors and neural networks
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…