Skip to content
View 6gsn's full-sized avatar

Highlights

  • Pro

Block or report 6gsn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

G2P

Python 362 70 Updated Aug 11, 2025

PixNerd: Pixel Neural Field Diffusion

Python 133 4 Updated Sep 15, 2025

JavaScript animation engine

JavaScript 65,300 4,373 Updated Nov 15, 2025

Project page for "MG-Gen: Single Image to Motion Graphics Generation with Layer Decomposition"

Python 8 1 Updated Apr 18, 2025

Mapping Mediapipe's 52 blendshapes to FLAME's expression coefficients and poses.

Jupyter Notebook 47 4 Updated Sep 26, 2025

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 304 16 Updated Mar 12, 2025

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,228 199 Updated Oct 31, 2024

[EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.

Python 208 13 Updated Nov 20, 2025

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 365 14 Updated May 30, 2025

[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.

Jupyter Notebook 1,125 112 Updated Aug 26, 2024

[WIP] Scripts for fine-tuning Whisper

Python 223 30 Updated May 29, 2023

An integrated Japanese analyzer based on foundation models

Python 137 7 Updated Nov 3, 2025

Library to build speech synthesis systems designed for easy and fast prototyping.

Python 399 71 Updated Jun 29, 2024

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Python 263 56 Updated Jan 13, 2025

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 13,692 1,616 Updated Oct 24, 2025

A Python toolkit for sound source separation.

Python 162 15 Updated May 6, 2025

This is a repository of YACIS corpus and information of how to obtain the whole corpus as well as its annotations.

6 Updated Jan 18, 2022
171 10 Updated Sep 11, 2025

Neural network-based singing voice synthesis library for research

Python 736 85 Updated Oct 9, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 91,279 11,449 Updated Sep 8, 2025

A fork of open_jtalk

C++ 67 40 Updated Mar 31, 2025

HTS-style full-context labels for JSUT v1.1

49 2 Updated Apr 16, 2021

context labels and pronunciation data for JSUT corpus

75 12 Updated Sep 2, 2021

Bring projects, wikis, and teams together with AI. AppFlowy is the AI collaborative workspace where you achieve more without losing control of your data. The leading open source Notion alternative.

Dart 66,671 4,755 Updated Nov 16, 2025

Google AI 2018 BERT pytorch implementation

Python 6,501 1,328 Updated Sep 15, 2023

Face recognition using Tensorflow

Python 14,249 4,809 Updated Jul 24, 2023

Siamese and triplet networks with online pair/triplet mining in PyTorch

Python 3,162 633 Updated Apr 29, 2023

Official PyTorch implementation of "Synthesis of Screentone Patterns of Manga Characters"

Python 4 Updated Jul 30, 2023

This is a deep learning project on Manga109 dataset by using Yolov3

Python 4 2 Updated Jul 22, 2020

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Jupyter Notebook 266 40 Updated Mar 7, 2023
Next