A deep learning project for automated chorus detection in songs, featuring a command-line interface (CLI) tool that allows users to input a YouTube link and utilize a pre-trained CRNN model to dete…

Jupyter Notebook 44 7 Updated May 21, 2025

ODD2 / ProjectSumo

A network simulation tool for V2X in 5G. The networking operation is built on python. We utilize Eclipse SUMO (Simulation of Urban MObility) to simulate realistic road traffic. The 5G wireless comm…

Jupyter Notebook 19 2 Updated Aug 7, 2022

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,863 479 Updated May 5, 2025

luridev / qr-github-generator

App for generating QR codes with GitHub logo and export to SVG/PNG/JPEG/WEBP format

Vue 57 Updated Apr 3, 2022

VITA-Group / VLM-3R

VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

Python 289 21 Updated Sep 1, 2025

ypwang61 / One-Shot-RLVR

[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Python 373 35 Updated Oct 13, 2025

ace-step / ACE-Step

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,244 380 Updated Jun 27, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,131 1,550 Updated Oct 16, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,945 2,236 Updated Nov 6, 2025

Tencent-Hunyuan / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,248 1,122 Updated Aug 27, 2025

Tencent-Hunyuan / Tencent-Hunyuan-Large

Python 1,584 116 Updated Dec 6, 2024

wzk1015 / Awesome-Vision-to-Music-Generation

[ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.

102 3 Updated Aug 9, 2025

ZeyueT / VidMuse

Python 104 5 Updated Jun 7, 2025

ByungKwanLee / DeepSick-R1

Reproduction of DeepSeek-R1

Python 243 23 Updated Apr 14, 2025

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,189 65 Updated Feb 25, 2025

opendatalab / LOKI

[ICLR 2025 Spotlight] The official implementation of the paper “LOKI：A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”

Python 170 4 Updated Mar 31, 2025

openai / openai-agents-python

A lightweight, powerful framework for multi-agent workflows

Python 17,194 2,830 Updated Nov 8, 2025

NVlabs / Eagle

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Python 892 47 Updated Oct 25, 2025

kuleshov-group / bd3lms

[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 881 46 Updated Jul 10, 2025

jiachenzhu / DyT

Code release for DynamicTanh (DyT)

Python 1,020 85 Updated Mar 30, 2025

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,534 340 Updated Oct 21, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,184 214 Updated Nov 8, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 50,734 8,854 Updated Nov 3, 2025

aiiu-lab / DFD-FCG

[CVPR'25] Towards More General Video-based Deepfake Detection through Facial Feature Guided Adaptation for Foundation Model (DFD-FCG)

Python 33 6 Updated Jul 20, 2025

affige / genmusic_demo_list

a list of demo websites for automatic music generation research

739 53 Updated Nov 4, 2025