-
ZJU -> UCSD -> Nvidia Research -> ByteDance Seed
- Santa Clara, United States
- https://Seerkfang.github.io
Highlights
- Pro
Stars
AHN: Artificial Hippocampus Networks for Efficient Long-Context Modeling
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
official code of “OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding”
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
[RA-L 2023] EasyHeC: Accurate and Automatic Hand-eye Calibration via Differentiable Rendering and Space Exploration
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
[IJCV 2024] P3Former: Position-Guided Point Cloud Panoptic Segmentation Transformer
ManiSkill-Learn is a framework for training agents on SAPIEN Open-Source Manipulation Skill Challenge (ManiSkill Challenge), a physics-rich manipulation skill benchmark with large-scale demonstrati…
[CoRL22] Frame Mining - a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.
SAPIEN Manipulation Skill Benchmark (NeurIPS 2021 Track on Datasets and Benchmarks)
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation (CVPR 2022)
[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.