liaoning97

liaoning97

prompt learning, instruction tuning, vision-language, open-set recognition.

2 followers · 7 following

Shanghai Jiao Tong University
Shanghai
https://scholar.google.com/citations?user=6aARLhMAAAAJ&hl=zh-CN

Starred repositories

RUC-NLPIR / DeepAgent

🛠️ DeepAgent: A General Reasoning Agent with Scalable Toolsets

Python 831 106 Updated Nov 2, 2025

pose-only-vision / PoSDK

Pose-only SDK for Structure from Motion

C++ 22 4 Updated Nov 7, 2025

jianke0604 / NTKMTL

Python 3 Updated Oct 25, 2025

ByteDance-BandAI / LLM-I

🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code execution & editing

Python 33 1 Updated Oct 20, 2025

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 358 17 Updated Aug 26, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,773 1,371 Updated Nov 28, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,589 615 Updated Nov 20, 2025

MemTensor / MemOS

Build memory-native AI agents with Memory OS — an open-source framework for long-term memory, retrieval, and adaptive learning in large language models. Agent Memory | Memory System | Memory Manage…

Python 3,178 285 Updated Nov 29, 2025

mit-han-lab / vila-u

[ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Python 410 17 Updated Apr 25, 2025

Thinklab-SJTU / Bench2Drive-VL

Adapting VLMs to Bench2Drive.

Python 166 21 Updated Oct 12, 2025

NVIDIA / Megatron-Energon

Megatron's multi-modal data loader

Python 278 32 Updated Nov 20, 2025

Thinklab-SJTU / ML4CO-Kit

A Python toolkit for Machine Learning (ML) practices for Combinatorial Optimization (CO).

C 71 12 Updated Nov 29, 2025

Thinklab-SJTU / ML4TSPBench

Official implementation of ICLR 2025 paper: "Unify ML4TSP: Drawing Methodological Principles for TSP and Beyond from Streamlined Design Space of Learning and Search".

C 45 Updated May 20, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,007 127 Updated Apr 3, 2025

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,449 210 Updated Nov 13, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,765 1,009 Updated Nov 25, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,694 2,402 Updated Nov 24, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,970 532 Updated Sep 25, 2024

deepseek-ai / DeepSeek-V3

Python 100,433 16,373 Updated Aug 28, 2025

PKU-YuanGroup / MoE-LLaVA

【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models

Python 2,277 140 Updated Jul 15, 2025

Ablustrund / LoRAMoE

LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment

Python 386 30 Updated Apr 29, 2024

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,197 3,211 Updated Nov 28, 2025