Skip to content
View carrycooldude's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@fossasia @EddieHubCommunity @BITDurg-git @Bhilai-Institute-of-Technology-Durg

Block or report carrycooldude

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Introduction to Machine Learning Systems

JavaScript 9,911 1,032 Updated Nov 26, 2025

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 639 75 Updated Nov 26, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,102 250 Updated Nov 26, 2025

A JAX-native LLM Post-Training Library

Python 1,918 177 Updated Nov 26, 2025

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,387 2,964 Updated Nov 16, 2025

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 14,396 1,209 Updated Nov 25, 2025

Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.

Python 137 17 Updated May 29, 2025

Official inference framework for 1-bit LLMs

Python 24,425 1,902 Updated Jun 3, 2025

Low-precision matrix multiplication

C++ 1,816 456 Updated Jan 29, 2024

LLM inference in C/C++

C++ 90,441 13,832 Updated Nov 26, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 50,153 8,383 Updated Nov 12, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,063 522 Updated Jun 9, 2025

Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense f…

Python 358 29 Updated Dec 12, 2024

Push platform for realtime and bidirectional communication between clients and servers

Go 321 45 Updated Nov 20, 2025

A Lightweight Recommendation System

Python 9,000 688 Updated Oct 13, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,057 2,654 Updated Nov 3, 2025

An open source payments switch written in Rust to make payments fast, reliable and affordable

Rust 39,339 4,552 Updated Nov 26, 2025

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Python 385 14 Updated May 29, 2025

A stand-alone implementation of several NumPy dtype extensions used in machine learning.

C++ 312 50 Updated Nov 25, 2025

Framework for enhancing LLMs for RAG tasks using fine-tuning.

Python 758 59 Updated May 22, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 94,576 10,682 Updated Nov 26, 2025

Hybrid ML + physics model of the Earth's atmosphere

Python 892 114 Updated Nov 25, 2025

Official inference repo for FLUX.1 models

Python 24,719 1,820 Updated Jul 31, 2025

Intermediate Graphics Library (IGL) is a cross-platform library that commands the GPU. It provides a single low-level cross-platform interface on top of various graphics APIs (e.g. OpenGL, Metal an…

C++ 3,147 198 Updated Nov 26, 2025
Jupyter Notebook 692 85 Updated Apr 30, 2025

Agentic components of the Llama Stack APIs

4,281 638 Updated Aug 5, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,650 1,863 Updated Nov 25, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,185 3,210 Updated Nov 26, 2025
Next