Skip to content
View ODD2's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Block or report ODD2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,567 1,193 Updated Oct 11, 2025

Empowering RAG with a memory-based data interface for all-purpose applications!

Python 2,161 154 Updated Sep 11, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 9,933 1,030 Updated Sep 24, 2025

ImageBind One Embedding Space to Bind Them All

Python 8,854 827 Updated Oct 3, 2025

A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input a YouTube link and utilize a pre-trained CRNN model to dete…

Jupyter Notebook 44 7 Updated May 21, 2025

A network simulation tool for V2X in 5G. The networking operation is built on python. We utilize Eclipse SUMO (Simulation of Urban MObility) to simulate realistic road traffic. The 5G wireless comm…

Jupyter Notebook 19 2 Updated Aug 7, 2022

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,863 479 Updated May 5, 2025

App for generating QR codes with GitHub logo and export to SVG/PNG/JPEG/WEBP format

Vue 57 Updated Apr 3, 2022

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

Python 289 21 Updated Sep 1, 2025

[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Python 373 35 Updated Oct 13, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,244 380 Updated Jun 27, 2025

Lets make video diffusion practical!

Python 16,131 1,550 Updated Oct 16, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,945 2,236 Updated Nov 6, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,248 1,122 Updated Aug 27, 2025

[ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.

102 3 Updated Aug 9, 2025
Python 104 5 Updated Jun 7, 2025

Reproduction of DeepSeek-R1

Python 243 23 Updated Apr 14, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,189 65 Updated Feb 25, 2025

[ICLR 2025 Spotlight] The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”

Python 170 4 Updated Mar 31, 2025

A lightweight, powerful framework for multi-agent workflows

Python 17,194 2,830 Updated Nov 8, 2025

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Python 892 47 Updated Oct 25, 2025

[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 881 46 Updated Jul 10, 2025

Code release for DynamicTanh (DyT)

Python 1,020 85 Updated Mar 30, 2025

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,534 340 Updated Oct 21, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,184 214 Updated Nov 8, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 50,734 8,854 Updated Nov 3, 2025

[CVPR'25] Towards More General Video-based Deepfake Detection through Facial Feature Guided Adaptation for Foundation Model (DFD-FCG)

Python 33 6 Updated Jul 20, 2025

a list of demo websites for automatic music generation research

739 53 Updated Nov 4, 2025
Next