🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,026 30,752 Updated Oct 13, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 47,224 9,144 Updated Oct 13, 2025

modelscope / facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,486 890 Updated Jun 6, 2025

apple / ml-mobileclip

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,258 102 Updated Oct 9, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,137 1,860 Updated Jun 3, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,221 2,131 Updated Dec 25, 2024

CircleRadon / TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025

Python 269 9 Updated May 26, 2025

allenai / mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 940 39 Updated Mar 19, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

34,586 1,877 Updated Aug 1, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 37,956 4,111 Updated Jul 6, 2025

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,177 1,294 Updated May 23, 2024

LLaVA-VL / LLaVA-NeXT

Python 4,304 408 Updated Sep 14, 2025

HyperGAI / HPT

HPT - Open Multimodal LLMs from HyperGAI

Python 315 22 Updated Jun 6, 2024

mbzuai-oryx / LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 842 61 Updated Aug 5, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,031 3,471 Updated Jan 26, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,161 391 Updated Oct 9, 2025

thunlp / LLaVA-UHD

LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer

Python 387 20 Updated Apr 20, 2025

pkunlp-icler / FastV

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 496 19 Updated Jan 4, 2025

openai / transformer-debugger

Python 4,100 243 Updated Jun 4, 2024

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 1,044 77 Updated Nov 18, 2024

QuivrHQ / quivr

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,507 3,676 Updated Jul 9, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 60,095 7,285 Updated Oct 13, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 45,053 7,675 Updated Dec 9, 2024

JimmyHHua / CppPrimer_Learning

C++ Primer 5th 学习过程记录（详细的笔记和课后练习解答）

C++ 3 Updated Sep 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jimmy Hua JimmyHHua

Achievements