Lists (9)
Sort Name ascending (A-Z)
Stars
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…
[CVPR 2023] Unofficial implementation for "VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models"
SVG Differentiable Rendering: Generating vector graphics using neural networks. Support: text-to-SVG, Image-to-SVG, SVG Editing.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
一个功能完整的视频自动化处理系统,支持从 YouTube 等平台下载视频,自动生成字幕、翻译内容、生成元数据,并定时上传到 Bilibili。
Online collaborative Whiteboard that is simple, free, easy to use and to deploy
Fully automatic censorship removal for language models
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
A custom node for ComfyUI that allows you to perform lip-syncing on videos using the Wav2Lip model. It takes an input video and an audio file and generates a lip-synced output video.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Laravel Migrations Generator: Automatically generate your migrations from an existing database schema.
Snapifit AI:开箱即用, 您的专属 AI 教练和营养师。即刻获取个性化健康管理指导。 Your personal AI trainer and nutritionist. Get instant, personalized health management guidance, right out of the box.
Feather-2 / paper-burner-x
Forked from baoyudu/paper-burnerPaper Burner X - 浏览器即开即用,AI文献识别、文档批量翻译、阅读与智能分析工具 丨BYOK, 基于 Paper Burner
Self-Hosted, Production-Read OCR Service for Paper Burner X,supporting DeepSeek-OCR and MCP Service
📝 基于 Vue2、Vditor,所构建的在线 Markdown 编辑器,支持绘制流程图、甘特图、时序图、任务列表、echarts 图表、五线谱,以及 PPT 预览、视频音频解析、HTML 自动转换为 Markdown 等功能。https://www.niceshare.site
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
a open framework for blind navigation based on esp32
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
🚀 The open-source alternative to Twilio.
An Open Source implementation of Notebook LM with more flexibility and features
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
A highly extensible private cloud storage solution for individuals and teams, featuring AI-powered semantic search.