Skip to content
View JimmyHHua's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 深圳, China

Block or report JimmyHHua

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 24,990 1,741 Updated Oct 13, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 49,932 5,223 Updated Oct 13, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 14,166 1,075 Updated Oct 13, 2025

Train transformer language models with reinforcement learning.

Python 15,854 2,232 Updated Oct 13, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,587 298 Updated Aug 6, 2025

[NeurIPS2024] Cross-video Identity Correlating for Person Re-identification Pre-training

Python 93 4 Updated Jun 20, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,026 30,752 Updated Oct 13, 2025

Ultralytics YOLO 🚀

Python 47,224 9,144 Updated Oct 13, 2025

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,486 890 Updated Jun 6, 2025

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,258 102 Updated Oct 9, 2025

Official inference framework for 1-bit LLMs

Python 24,137 1,860 Updated Jun 3, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 17,221 2,131 Updated Dec 25, 2024

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025

Python 269 9 Updated May 26, 2025

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 940 39 Updated Mar 19, 2025

LLM101n: Let's build a Storyteller

34,586 1,877 Updated Aug 1, 2024

A generative speech model for daily dialogue.

Python 37,956 4,111 Updated Jul 6, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,177 1,294 Updated May 23, 2024
Python 4,304 408 Updated Sep 14, 2025

HPT - Open Multimodal LLMs from HyperGAI

Python 315 22 Updated Jun 6, 2024

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 842 61 Updated Aug 5, 2025

The official Meta Llama 3 GitHub site

Python 29,031 3,471 Updated Jan 26, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,161 391 Updated Oct 9, 2025

LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer

Python 387 20 Updated Apr 20, 2025

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 496 19 Updated Jan 4, 2025

A family of lightweight multimodal models.

Python 1,044 77 Updated Nov 18, 2024

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 38,507 3,676 Updated Jul 9, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 60,095 7,285 Updated Oct 13, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 45,053 7,675 Updated Dec 9, 2024

C++ Primer 5th 学习过程记录(详细的笔记和课后练习解答)

C++ 3 Updated Sep 10, 2022
Next