Skip to content
View thisisiron's full-sized avatar
😵‍💫
😵‍💫

Organizations

@ai-rush-2019 @bcaitech1 @ml-zip

Block or report thisisiron

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The absolute trainer to light up AI agents.

Python 7,857 610 Updated Nov 10, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,339 543 Updated Nov 8, 2025

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,310 268 Updated Jan 18, 2025

This is the official repository for our recent work: PIDNet

Python 714 125 Updated Aug 6, 2024

The official implementation of "Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes"

Python 459 55 Updated Jun 19, 2023

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,851 160 Updated Oct 9, 2025

A framework for few-shot evaluation of language models.

Python 10,580 2,840 Updated Nov 10, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

1,916 81 Updated Nov 8, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 20,056 3,324 Updated Nov 10, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,462 4,001 Updated Nov 10, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,202 217 Updated Nov 8, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,118 3,944 Updated Nov 10, 2025

"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"

Python 7,769 1,048 Updated Oct 16, 2025

"VideoRAG: Chat with Your Videos"

Python 1,270 183 Updated Oct 22, 2025

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,541 411 Updated Oct 16, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,349 11,578 Updated Nov 9, 2025

🧮 Calculator for vision tokens in VLMs.

Python 1 Updated Oct 4, 2025

LLM inference in C/C++

C++ 89,532 13,630 Updated Nov 10, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,748 634 Updated Nov 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,674 11,163 Updated Nov 10, 2025

Awesome-Paper-list: Visualization meets LLM

55 2 Updated Sep 28, 2025

An AI-powered task-management system you can drop into Cursor, Lovable, Windsurf, Roo, and others.

JavaScript 23,572 2,301 Updated Nov 10, 2025

A powerful AI coding agent. Built for the terminal.

Go 9,471 800 Updated Sep 18, 2025

Code-MCP: Connect Claude AI to your development environment through the Model Context Protocol (MCP), enabling terminal commands and file operations through the AI interface.

Python 34 2 Updated Mar 21, 2025

Machine Learning Engineering Open Book

Python 15,685 958 Updated Oct 27, 2025

VisionLLM Series

Python 1,122 57 Updated Feb 27, 2025

[CVPR 2025 Highlight] Official code for "Olympus: A Universal Task Router for Computer Vision Tasks"

Python 428 72 Updated May 31, 2025

Model Context Protocol Servers

TypeScript 72,298 8,697 Updated Nov 10, 2025

The AI Code Editor

31,641 2,098 Updated Oct 22, 2025
Next