-
IIIT Bangalore
- Bangalore
- in/rittik-panda-752143169
- @rittik_panda
Stars
CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control
The official repository of "Astra : General Interactive World Model with Autoregressive Denoising"
A practical guide to diffusion models, implemented from scratch.
LLM Council works together to answer your hardest questions
convert bvh motion file to UE animation in runtime.
Open Source framework for voice and multimodal conversational AI
Official implementation of "Continuous Autoregressive Language Models"
A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-taking.
(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”
Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.
Implementation for the paper "Can Language Models Learn to Listen?"
Voice Activity Projection Models: Self-supervised learning of Turn-taking Events
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility A…
[CVPR 2025] This is the official source for our paper "DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations"
PantoMatrix: Generating Face and Body Animation from Speech
This repository collects papers on VLLM applications. We will update new papers irregularly.
This repository collects papers on Human-Interaction-Motion-Generation applications. We will update new papers irregularly.
🏃♀️ A curated list about human motion capture, analysis and synthesis.
lecture slides for Deepmind x UCL 2021 reinforcement learning course available in YouTbue
A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Official repository for the ICCV 2019 paper "Neural 3D Morphable Models: Spiral Convolutional Networks for 3D Shape Representation Learning and Generation"