Skip to content
View RERV's full-sized avatar

Block or report RERV

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kimi K2 is the large language model series developed by Moonshot AI team

8,828 588 Updated Nov 7, 2025

Megvii FILE Library - Working with Files in Python same as the standard library

Python 160 18 Updated Nov 7, 2025

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Python 161 17 Updated Sep 18, 2025

open-source coding LLM for software engineering tasks

Python 1,029 117 Updated Sep 30, 2025

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,098 53 Updated Jul 15, 2025

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 569 16 Updated Apr 13, 2025

[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences

318 11 Updated Aug 10, 2024

[NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs

Python 21 5 Updated Oct 15, 2024

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Python 50 1 Updated Mar 9, 2025

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 2,258 129 Updated May 30, 2025

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,003 580 Updated Apr 24, 2024
Python 1,840 61 Updated Jun 28, 2024

[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

Python 2,981 359 Updated Apr 22, 2025

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Python 357 14 Updated Dec 18, 2023

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Jupyter Notebook 468 38 Updated Jan 19, 2024

✨✨Latest Advances on Multimodal Large Language Models

16,649 1,073 Updated Nov 9, 2025

ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利

2,026 201 Updated Aug 14, 2023

Recent LLM-based CV and related works. Welcome to comment/contribute!

873 38 Updated Mar 8, 2025

A PyTorch implementation of TVC

Jupyter Notebook 24 1 Updated Dec 18, 2023

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,309 268 Updated Jan 18, 2025

An open-source tool-augmented conversational language model from Fudan University

Python 12,062 1,138 Updated Jul 13, 2024

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins

Python 2,787 251 Updated Dec 5, 2023

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,753 2,932 Updated Sep 2, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,567 46,114 Updated Nov 9, 2025

Large-scale text-video dataset. 10 million captioned short videos.

Python 663 39 Updated Aug 14, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,437 6,138 Updated Sep 18, 2024

[ICLR'22] Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

Python 184 19 Updated Mar 13, 2023
Python 105 12 Updated Nov 11, 2023
Next