Skip to content
View liaoning97's full-sized avatar

Block or report liaoning97

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 831 106 Updated Nov 2, 2025

Pose-only SDK for Structure from Motion

C++ 22 4 Updated Nov 7, 2025
Python 3 Updated Oct 25, 2025

🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code execution & editing

Python 33 1 Updated Oct 20, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 358 17 Updated Aug 26, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,773 1,371 Updated Nov 28, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,589 615 Updated Nov 20, 2025

Build memory-native AI agents with Memory OS — an open-source framework for long-term memory, retrieval, and adaptive learning in large language models. Agent Memory | Memory System | Memory Manage…

Python 3,178 285 Updated Nov 29, 2025

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 410 17 Updated Apr 25, 2025

Adapting VLMs to Bench2Drive.

Python 166 21 Updated Oct 12, 2025

Megatron's multi-modal data loader

Python 278 32 Updated Nov 20, 2025

A Python toolkit for Machine Learning (ML) practices for Combinatorial Optimization (CO).

C 71 12 Updated Nov 29, 2025

Official implementation of ICLR 2025 paper: "Unify ML4TSP: Drawing Methodological Principles for TSP and Beyond from Streamlined Design Space of Learning and Search".

C 45 Updated May 20, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,007 127 Updated Apr 3, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,449 210 Updated Nov 13, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,765 1,009 Updated Nov 25, 2025

Fully open reproduction of DeepSeek-R1

Python 25,694 2,402 Updated Nov 24, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,970 532 Updated Sep 25, 2024

【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models

Python 2,277 140 Updated Jul 15, 2025

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Python 386 30 Updated Apr 29, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,197 3,211 Updated Nov 28, 2025

Ongoing research training transformer models at scale

Python 14,352 3,325 Updated Nov 28, 2025
Python 1,490 219 Updated Jun 26, 2025

Example models using DeepSpeed

Python 6,735 1,110 Updated Oct 15, 2025

Official implementation for "TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables" (NeurIPS 2024)

Python 408 62 Updated Nov 27, 2024

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,440 563 Updated Nov 28, 2025

多模态 MM +Chat 合集

Python 279 22 Updated Aug 19, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,328 442 Updated Nov 28, 2025

[TMLR 2025🔥] A survey for the autoregressive models in vision.

753 21 Updated Nov 8, 2025
Next