carrycooldude

🎯

Focusing

Kartikey Rawat carrycooldude

🎯

Focusing

Sr. AI Developer Advocate at Qualcomm | Google Developer Expert in AI and Google Cloud |

331 followers · 127 following

Achievements

Highlights

Developer Program Member

Organizations

Lists (2)

Sort

✨ Inspiration

13 repositories

Need to Test it out

4 repositories

Starred repositories

harvard-edge / cs249r_book

Introduction to Machine Learning Systems

JavaScript 9,911 1,032 Updated Nov 26, 2025

SamsungSAILMontreal / TinyRecursiveModels

Python 5,717 855 Updated Oct 8, 2025

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 639 75 Updated Nov 26, 2025

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,102 250 Updated Nov 26, 2025

google / tunix

A JAX-native LLM Post-Training Library

Python 1,918 177 Updated Nov 26, 2025

google-gemini / gemini-fullstack-langgraph-quickstart

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,387 2,964 Updated Nov 16, 2025

google-ai-edge / gallery

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 14,396 1,209 Updated Nov 25, 2025

sayakpaul / nanoDiT

Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.

Python 137 17 Updated May 29, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,425 1,902 Updated Jun 3, 2025

google / gemmlowp

Low-precision matrix multiplication

C++ 1,816 456 Updated Jan 29, 2024

ggml-org / llama.cpp

LLM inference in C/C++

C++ 90,441 13,832 Updated Nov 26, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 50,153 8,383 Updated Nov 12, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,063 522 Updated Jun 9, 2025

facebookresearch / memory

Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsely activated memory layers complement compute-heavy dense f…

Python 358 29 Updated Dec 12, 2024

CRED-CLUB / propeller

Push platform for realtime and bidirectional communication between clients and servers

Go 321 45 Updated Nov 20, 2025

bytedance / monolith

A Lightweight Recommendation System

Python 9,000 688 Updated Oct 13, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,057 2,654 Updated Nov 3, 2025

juspay / hyperswitch

An open source payments switch written in Rust to make payments fast, reliable and affordable

Rust 39,339 4,552 Updated Nov 26, 2025

sayakpaul / diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Python 385 14 Updated May 29, 2025

jax-ml / ml_dtypes

A stand-alone implementation of several NumPy dtype extensions used in machine learning.

C++ 312 50 Updated Nov 25, 2025

IntelLabs / RAG-FiT

Framework for enhancing LLMs for RAG tasks using fine-tuning.

Python 758 59 Updated May 22, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 94,576 10,682 Updated Nov 26, 2025

neuralgcm / neuralgcm

Hybrid ML + physics model of the Earth's atmosphere

Python 892 114 Updated Nov 25, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 24,719 1,820 Updated Jul 31, 2025

facebook / igl

Intermediate Graphics Library (IGL) is a cross-platform library that commands the GPU. It provides a single low-level cross-platform interface on top of various graphics APIs (e.g. OpenGL, Metal an…

C++ 3,147 198 Updated Nov 26, 2025

huggingface / huggingface-llama-recipes

Jupyter Notebook 692 85 Updated Apr 30, 2025

llamastack / llama-stack-apps

Agentic components of the Llama Stack APIs

4,281 638 Updated Aug 5, 2025

FirebaseExtended / compass-travel-planning-sample

TypeScript 48 26 Updated Nov 18, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 21,650 1,863 Updated Nov 25, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,185 3,210 Updated Nov 26, 2025