- SF Bay Area
- http://alexyu.net
- @alexyu00
Stars
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
SkyReels-V2: Infinite-length Film Generative model
aibo is an Emacs package that leverages OpenAI's chat API to bring ChatGPT into Emacs
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…
Quality-Aware Image-Text Alignment for Opinion-Unaware Image Quality Assessment
[ICCV2025] CAD-Recode: Reverse Engineering CAD Code from Point Clouds
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.
[CVPR 2025] Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering
Generate a video script, voice and a talking face completely with AI
[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
CUDA accelerated rasterization of gaussian splatting
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
An open-source library for GPU-accelerated robot learning and sim-to-real transfer.
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
Sky-T1: Train your own O1 preview model within $450
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.