Skip to content
View VegB's full-sized avatar
🈚
🈚
  • UC Santa Barbara
  • Santa Barbara, CA

Organizations

@asyml

Block or report VegB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Python 145 4 Updated Aug 23, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,596 575 Updated May 30, 2025
Python 48 6 Updated Dec 8, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,649 381 Updated Jun 2, 2025

Project webpage of LayoutGPT

JavaScript 2 Updated Jun 9, 2023

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,935 380 Updated Mar 14, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,197 1,103 Updated Dec 26, 2025

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,540 190 Updated Apr 2, 2025

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,767 2,923 Updated Sep 2, 2024

Official repo for LayoutGPT

Python 401 29 Updated Apr 10, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,285 210 Updated Mar 5, 2024

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 950 38 Updated Mar 19, 2025

An open-source framework for training large multimodal models.

Python 4,060 320 Updated Aug 31, 2024

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)

Jupyter Notebook 143 6 Updated Jun 10, 2025

Reverse engineered ChatGPT API

Python 27,989 4,432 Updated Aug 2, 2023

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

Python 1,333 123 Updated Dec 1, 2023

Intuitive Annotation Tool for Information Extraction / Named Entity Recognition using localturk / Amazon Mechanical Turk

JavaScript 264 26 Updated Aug 25, 2019

Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training

Python 168 18 Updated Apr 27, 2023

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,273 66 Updated Oct 18, 2022

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,347 73 Updated Jul 11, 2024

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 2,156 231 Updated May 20, 2024
Jupyter Notebook 231 30 Updated Dec 18, 2023

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,344 374 Updated Oct 19, 2025

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Python 307 27 Updated Jul 12, 2024

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,188 617 Updated Jul 19, 2024

Simple image captioning model

Jupyter Notebook 1,409 225 Updated Jun 9, 2024

LaTeX template for dissertations in Peking University

TeX 590 195 Updated Apr 25, 2024

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 184,029 26,297 Updated Jan 16, 2026