Starred repositories
A feature-rich command-line audio/video downloader
🌈Bilibili_video_download-B站视频下载
🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
JavaScript library of crypto standards.
automatic mirror of https://git.videolan.org/?p=ffmpeg/nv-codec-headers.git
The WhisperX API is a containerized solution for transcribing audio files using the powerful `whisperx` model. This API provides an easy-to-use endpoint for audio transcription and is packaged into…
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
This repository provides a Docker image for CosyVoice
Industry leading face manipulation platform
dagedan / facefusion
Forked from facefusion/facefusionNext generation face swapper and enhancer
Industry leading face manipulation platform
Open-Sora: Democratizing Efficient Video Production for All
OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
Integrating ChatGPT into your browser deeply, everything you need is here
Robust Speech Recognition via Large-Scale Weak Supervision
虚拟爱抖露(アイドル)共享计划, 是基于单目RGB摄像头的人眼与人脸特征点检测算法, 在实时3D面部捕捉以及模型驱动领域的应用.
Essential UI blocks for building mobile web apps.
基于Vue + Echarts 构建的数据可视化平台,酷炫大屏展示模板和组件库,持续更新各行各业实用模板和炫酷小组件。