Skip to content
View KangcongLi's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report KangcongLi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory

Python 113 4 Updated Nov 4, 2025

This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"

Python 35 2 Updated Jul 11, 2024

🔥🔥First-ever hour scale video understanding models

Python 598 40 Updated Jul 14, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 2,109 89 Updated Dec 29, 2025

[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding

Python 57 2 Updated Dec 13, 2024

Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 35,429 2,509 Updated Dec 15, 2025

Official repository of MMDU dataset

Python 102 2 Updated Sep 29, 2024
Python 27 2 Updated Oct 27, 2025

Code of "MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation"

Python 127 3 Updated Dec 4, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,328 7,940 Updated Jan 9, 2026

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 919 164 Updated Dec 20, 2025

A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Python 395 39 Updated Oct 30, 2025

A curated list of large VLM-based VLA models for robotic manipulation.

299 11 Updated Dec 21, 2025

A Self-Training Framework for Vision-Language Reasoning

Python 88 1 Updated Jan 23, 2025

[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Python 50 5 Updated May 12, 2024

CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models

Python 89 4 Updated Jul 4, 2024

This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.

334 23 Updated Dec 29, 2025

Official PyTorch Code for Anchor Token Guided Prompt Learning Methods: [ICCV 2025] ATPrompt and [Arxiv 2511.21188] AnchorOPT

Python 121 3 Updated Dec 22, 2025

This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.

Python 17 2 Updated Jul 3, 2025
Python 101 3 Updated Aug 14, 2025

LongBench v2 and LongBench (ACL 25'&24')

Python 1,063 115 Updated Jan 15, 2025
Python 18 2 Updated Dec 2, 2024

Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]

Python 38 4 Updated May 28, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,870 78 Updated Jan 4, 2026

Retrieval and Retrieval-augmented LLMs

Python 11,117 826 Updated Dec 15, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,419 117 Updated Nov 13, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,136 229 Updated Aug 17, 2024
Jupyter Notebook 29 7 Updated Oct 4, 2025

StreamingDialogue: Prolonged Dialogue Learning via Long Context Compression with Minimal Losses (NeurIPS 2024)

Python 3 Updated Oct 26, 2024
Next