- Boston
- in/kyle-sayers
Stars
Educational resource demonstrating common GPU programming pitfalls and solutions using Triton kernels.
A high-throughput and memory-efficient inference and serving engine for LLMs
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
A simple GPU reservation tool for single host shared development systems
Keymapper config to make Linux keyboard shortcuts work like a 'Tosh! And more. (A Kinto alternative.)
Achieve state of the art inference performance with modern accelerators on Kubernetes
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
aider is AI pair programming in your terminal
This repository contains implementations of various machine learning models from scratch using PyTorch.
This repository contains JavaScript code (p5js library) for generating fractal tree using recursion. 🍂
tdg5 / reqless-py
Forked from seomoz/qless-pyPython Bindings for qless
Dataset of Linus Torvalds' rants classified by negativity using sentiment analysis
neuralmagic / vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
neuralmagic / nm-vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
pySLAM-D is a real-time SLAM algorithm for UAV aerial stitching. Includes additional features and refactored code inspired by BU's implementation https://github.com/armandok/pySLAM-D
Visualizer for neural network, deep learning and machine learning models
(Next Generation Scholars Fall 2017): A portal for counselors and students to manage college application deadlines and tasks
Pytorch implementation of same-family gaussian mixture models with guardrails. Features separable parameter optimization and singularity mitigation
Push Cursor on Target messages to TAK clients with attachments and other information
An environment of the board game Go using OpenAI's Gym API
Sparsity-aware deep learning inference runtime for CPUs
A game where players compete to draw differing prompts on a shared canvas, as scored by a computer vision model
Code and data for "From Networks to Named Entities and Back Again: Exploring Classical Arabic Isnad Networks"
Deep learning in Rust, with shape checked tensors and neural networks