RERV

Follow

Haoyu Lu RERV

Follow

67 followers · 16 following

Achievements

Achievements

Stars

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

8,828 588 Updated Nov 7, 2025

megvii-research / megfile

Megvii FILE Library - Working with Files in Python same as the standard library

Python 160 18 Updated Nov 7, 2025

spiral-rl / spiral

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Python 161 17 Updated Sep 18, 2025

MoonshotAI / Kimi-Dev

open-source coding LLM for software engineering tasks

Python 1,029 117 Updated Sep 30, 2025

MoonshotAI / Kimi-VL

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,098 53 Updated Jul 15, 2025

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 569 16 Updated Apr 13, 2025

MoonshotAI / Kimi-k1.5

3,467 232 Updated Mar 7, 2025

aim-uofa / MovieDreamer

[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences

318 11 Updated Aug 10, 2024

mshukor / ima-lmms

[NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs

Python 21 5 Updated Oct 15, 2024

joez17 / VideoNIAH

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Python 50 1 Updated Mar 9, 2025

X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,258 129 Updated May 30, 2025

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,003 580 Updated Apr 24, 2024

ytongbai / LVM

Python 1,840 61 Updated Jun 28, 2024

deepseek-ai / DreamCraft3D

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Python 2,981 359 Updated Apr 22, 2025

HaozheZhao / MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Python 357 14 Updated Dec 18, 2023

kohjingyu / gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Jupyter Notebook 468 38 Updated Jan 19, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

16,649 1,073 Updated Nov 9, 2025

chenking2020 / FindTheChatGPTer

ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利

2,026 201 Updated Aug 14, 2023

DirtyHarryLYL / LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

873 38 Updated Mar 8, 2025

tsujuifu / pytorch_tvc

A PyTorch implementation of TVC

Jupyter Notebook 24 1 Updated Dec 18, 2023

OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,309 268 Updated Jan 18, 2025

OpenMOSS / MOSS

An open-source tool-augmented conversational language model from Fudan University

Python 12,062 1,138 Updated Jul 13, 2024

OpenBMB / BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,787 251 Updated Dec 5, 2023

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,753 2,932 Updated Sep 2, 2024

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,567 46,114 Updated Nov 9, 2025

m-bain / webvid

Large-scale text-video dataset. 10 million captioned short videos.

Python 663 39 Updated Aug 14, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,437 6,138 Updated Sep 18, 2024

sihyun-yu / digan

[ICLR'22] Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

Python 184 19 Updated Mar 13, 2023

Tobi-r9 / RaMViD

Python 105 12 Updated Nov 11, 2023

togethercomputer / OpenChatKit

Python 9,014 1,019 Updated Apr 9, 2024