Skip to content
View menorki's full-sized avatar

Block or report menorki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Contexts Optical Compression

Python 20,065 1,477 Updated Oct 25, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,251 552 Updated Oct 30, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,859 3,810 Updated Nov 7, 2025

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Jupyter Notebook 235 15 Updated Oct 14, 2025

Comfyui implementation of OpenIXCLab Sec-4B

Python 319 18 Updated Oct 18, 2025

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 406 34 Updated Nov 5, 2025

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,569 260 Updated Sep 22, 2025

Lynx: Towards High-Fidelity Personalized Video Generation

Python 283 35 Updated Sep 26, 2025

Open-source framework for building AI-powered apps in JavaScript, Go, and Python, built and used in production by Google

TypeScript 4,957 551 Updated Nov 7, 2025

Text-audio foundation model from Boson AI

Python 7,601 564 Updated Sep 15, 2025

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 3,174 445 Updated Nov 9, 2025
Python 379 12 Updated Jul 13, 2025

SoTA open-source TTS

Python 14,488 1,962 Updated Sep 25, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,552 194 Updated Nov 7, 2025

[NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving

Python 253 12 Updated Aug 4, 2025

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,649 449 Updated Sep 25, 2025

Taming Stable Diffusion for Lip Sync!

Python 5,102 819 Updated Jun 20, 2025

Connect any AI model to 600+ integrations; powered by MCP 📡 🚀

TypeScript 3,097 341 Updated Nov 10, 2025

DFloat11: Lossless LLM Compression for Efficient GPU Inference

Python 557 33 Updated Aug 24, 2025

Open-source unified multimodal model

Python 5,268 456 Updated Oct 27, 2025

[CVPR 2025 Highlight] UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion

Python 108 8 Updated Sep 30, 2025

[CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R

Python 977 56 Updated Oct 18, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,016 112 Updated Oct 29, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,696 307 Updated Nov 10, 2025

ComfyUI Plugin of Nunchaku

Python 2,469 108 Updated Nov 8, 2025

Model Compression Toolbox for Large Language Models and Diffusion Models

Python 693 67 Updated Aug 14, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,314 194 Updated Nov 8, 2025

ACE-Step: A Step Towards Music Generation Foundation Model

Python 3,245 381 Updated Jun 27, 2025

WhisperPlus: Faster, Smarter, and More Capable 🚀

Python 1,916 145 Updated Nov 3, 2025

Expanding FramePack into a multifunction video creation tool

Python 685 76 Updated Nov 1, 2025
Next