JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.

Jupyter Notebook 953 59 Updated Aug 14, 2025

Alpha-VLLM / Lumina-DiMOO

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 892 58 Updated Nov 20, 2025

dvlab-research / DreamOmni2

This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''

Python 2,421 202 Updated Oct 20, 2025

nv-tlabs / ChronoEdit

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Python 603 33 Updated Nov 20, 2025

lostindark / DriverStoreExplorer

Driver Store Explorer

C# 9,036 480 Updated Sep 24, 2025

yonggekkk / argosbx

小白自建代理神器！ArgoSBX一键无交互小钢炮脚本💣：Sing-box、Xray、Argo三内核自动搭配；支持VPS、Docker、容器多环境部署；套CDN的4大方案+套WARP的15种组合；已支持协议：AnyTLS、Any-reality、Vless-xhttp-reality-vision-enc、Vless-tcp-reality-vision、Vless-xhttp-vision-…

Shell 2,776 1,475 Updated Nov 25, 2025

tailscale / tailscale-synology

Synology packages for tailscale.com

946 94 Updated Jun 12, 2023

gdy666 / lucky

软硬路由公网神器,ipv6/ipv4 端口转发,反向代理,DDNS,WOL,ipv4 stun内网穿透,cron,acme,rclone,ftp,webdav,filebrowser

Go 6,815 640 Updated Nov 20, 2025

zerotier / ZeroTierOne

A Smart Ethernet Switch for Earth

C++ 16,169 1,862 Updated Nov 13, 2025

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 877 80 Updated Sep 28, 2025

Tencent-Hunyuan / SRPO

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,197 39 Updated Oct 26, 2025

DecartAI / Lucy-Edit-ComfyUI

Python 685 72 Updated Nov 7, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 15,828 1,857 Updated Nov 7, 2025

LykosAI / StabilityMatrix

Multi-Platform Package Manager for Stable Diffusion

C# 6,923 470 Updated Nov 21, 2025

nunchaku-tech / ComfyUI-nunchaku

ComfyUI Plugin of Nunchaku

Python 2,520 112 Updated Nov 8, 2025

nunchaku-tech / nunchaku

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,378 199 Updated Nov 17, 2025

yolain / ComfyUI-Easy-Use

In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes.

Python 2,084 147 Updated Nov 16, 2025

BlenderNeko / ComfyUI_Cutoff

cutoff implementation for ComfyUI

Python 396 26 Updated May 22, 2024

BlenderNeko / ComfyUI_ADV_CLIP_emb

ComfyUI node that let you pick the way in which prompt weights are interpreted

Python 421 40 Updated Aug 7, 2024

dfl / comfyui-clip-with-break

Clip text encoder with BREAK formatting like A1111 (uses conditioning concat)

Python 16 5 Updated Mar 4, 2025

shiimizu / ComfyUI_smZNodes

Custom nodes for ComfyUI such as CLIP Text Encode++

Python 306 25 Updated Jun 4, 2025

yonggekkk / Cloudflare-vless-trojan

CF-workers/pages代理脚本【Vless与Trojan】：支持nat64自动生成proxyip，一键自建proxyip与CF反代IP，CF优选官方IP三地区应用脚本，自动输出美、亚、欧最佳优选IP

JavaScript 12,522 8,734 Updated Nov 12, 2025

microsoft / VibeVoice

Frontier Open-Source Text-to-Speech

10,058 1,291 Updated Sep 5, 2025

Enemyx-net / VibeVoice-ComfyUI

A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.

Python 1,169 191 Updated Oct 2, 2025

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

9,595 680 Updated Nov 7, 2025

sdbds / Zonos-for-windows

Forked from Zyphra/Zonos

Python 498 63 Updated Mar 7, 2025

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,116 813 Updated Mar 5, 2025

Python

Node.js

HTML

Google

GitHub API

Game engine

Data structures

Data visualization

CSS

Command-line interface

See all starred topics