Stars
Augmented Vertex Block Descent (AVBD) reference implementation
Isaac Lab API, powered by MuJoCo-Warp, for RL and robotics research.
AI agents can now use real Android and iOS apps, just like a human.
MCP server for document format conversion using pandoc.
a language for fast, portable data-parallel computation
High-performance C++ library for multiphysics and multibody dynamics simulations
[RSS 2025] "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"
Real-time Face and Iris Landmarks Detection using C++
Chiri's DX11 wrapper to enable fixing broken stereoscopic effects.
A multi core friendly rigid body physics and collision detection library. Written in C++. Suitable for games and VR applications. Used by Horizon Forbidden West.
ECS-driven 2D and 3D physics engine for the Bevy game engine.
Official implementation of paper [DeepTag: A General Framework for Fiducial Marker Design and Detection]
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…
OpenMMLab Detection Toolbox and Benchmark
State-of-the-art 2D and 3D Face Analysis Project
One Sided Box Filter for Edge-preserving Image Processing
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
jsakamoto / CSharpProlog
Forked from JohnPool/CSharpPrologA C# implementation of Prolog (port from https://sourceforge.net/p/cs-prolog )
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
An Efficient Text-to-Image Generation Pretrain Pipeline