Highlights
- Pro
-
EVA Public
Forked from baaivision/EVAEVA Series: Visual Representation Fantasies from BAAI
Python MIT License UpdatedSep 11, 2023 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedSep 8, 2023 -
ViT-Lens Public
Forked from TencentARC/ViT-Lens[Preprint] ViT-Lens: Towards Omni-modal Representations
Python Other UpdatedAug 30, 2023 -
Emu Public
Forked from baaivision/EmuEmu: An Open Multimodal Generalist
Python UpdatedJul 13, 2023 -
ULIP Public
Forked from salesforce/ULIPPython BSD 3-Clause "New" or "Revised" License UpdatedJul 5, 2023 -
-
PandaGPT Public
Forked from yxuansu/PandaGPTPandaGPT: One Model To Instruction-Follow Them All
Python Apache License 2.0 UpdatedJun 5, 2023 -
ImageBind Public
Forked from facebookresearch/ImageBindImageBind One Embedding Space to Bind Them All
Python Other UpdatedJun 5, 2023 -
ONE-PEACE Public
Forked from OFA-Sys/ONE-PEACEA general representation modal across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Python Apache License 2.0 UpdatedJun 5, 2023 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedMay 26, 2023 -
open_flamingo Public
Forked from mlfoundations/open_flamingoAn open-source framework for training large multimodal models.
Python MIT License UpdatedMay 24, 2023 -
open_clip Public
Forked from mlfoundations/open_clipAn open source implementation of CLIP.
Jupyter Notebook Other UpdatedMay 24, 2023 -
GANet Public
A Keypoint-based Global Association Network for Lane Detection. Accepted by CVPR 2022
-
LLaVA Public
Forked from haotian-liu/LLaVALarge Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
Python Apache License 2.0 UpdatedMay 15, 2023 -
Painter Public
Forked from baaivision/PainterPainter & SegGPT Series: Vision Foundation Models from BAAI
Python MIT License UpdatedMay 14, 2023 -
LAVIS Public
Forked from salesforce/LAVISLAVIS - A One-stop Library for Language-Vision Intelligence
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 12, 2023 -
mmagic Public
Forked from open-mmlab/mmagicOpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, image/video restoration/…
Python Apache License 2.0 UpdatedMay 8, 2023 -
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedApr 11, 2023 -
Awesome-BEV-Perception-Multi-Cameras Public
Forked from chaytonmin/Awesome-BEV-Perception-Multi-CamerasAwesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird-Eye-View, such as DETR3D, BEVDet, BEVFormer
UpdatedJun 23, 2022 -
bevfusion Public
Forked from mit-han-lab/bevfusionBEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Python Apache License 2.0 UpdatedJun 4, 2022 -
Deformable-DETR Public
Forked from fundamentalvision/Deformable-DETRDeformable DETR: Deformable Transformers for End-to-End Object Detection.
Python Apache License 2.0 UpdatedMay 22, 2022 -
awesome-NeRF Public
Forked from awesome-NeRF/awesome-NeRFA curated list of awesome neural radiance fields papers
TeX MIT License UpdatedMay 19, 2022 -