Skip to content
View stanathong's full-sized avatar
🌈
🌈
  • London, UK

Block or report stanathong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,120 3,944 Updated Nov 10, 2025

[ICCV 2023 Oral] ScanNet++: A High-Fidelity Dataset of 3D Indoor Scenes

Python 328 31 Updated Oct 30, 2025

[arXiv 2025] Generative View Stitching

Python 72 3 Updated Nov 7, 2025

Repo for the Complete Agentic AI Engineering Course

Jupyter Notebook 2,835 2,316 Updated Nov 10, 2025

[ICCV 2025] The official implementation for EgoM2P: Egocentric Multimodal Multitask Pretraining.

Python 31 2 Updated Oct 18, 2025

Implementation of paper EditCLIP: Representation Learning for Image Editing (ICCV 2025)

Python 31 1 Updated Jun 29, 2025

Official repository for "MaskControl: Spatio-Temporal Control for Masked Motion Synthesis" ICCV 2025 (Oral & Award Candidate)

Python 138 8 Updated Nov 9, 2025

RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.

Python 4,166 465 Updated Nov 5, 2025

[ICCV'25] 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

Python 70 6 Updated Oct 14, 2025

Benchmarking Visual-Inertial SLAM at City Scale (ICCV 2025).

Python 116 5 Updated Nov 10, 2025

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,969 376 Updated Apr 21, 2025

Towards Unified Image Deblurring using a Mixture-of-Experts Decoder

Python 12 1 Updated Oct 13, 2025

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,541 411 Updated Oct 16, 2025

[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking

Python 418 17 Updated Nov 3, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,109 1,550 Updated Sep 5, 2024

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,206 557 Updated Nov 3, 2025
Python 719 52 Updated May 6, 2024

DeepMVS: Learning Multi-View Stereopsis

Python 354 82 Updated May 9, 2022

Production-grade 3D gaussian splatting with CPU/GPU support for Windows, Mac and Linux 🚀

C++ 1,602 146 Updated Sep 4, 2025

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Python 1,967 134 Updated Feb 23, 2024

Improving large 3D Reconstruction Models through geometry and texture Refinement

Python 5 Updated Aug 4, 2025

This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights into machine learning (ML) system design. The contents are o…

4,319 556 Updated Aug 5, 2025

LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans

285 6 Updated Jul 4, 2025

GenAI Processors is a lightweight Python library that enables efficient, parallel content processing.

Python 1,988 186 Updated Nov 10, 2025

Repo for baseline codes of Digital Twin Catalog project.

Python 67 5 Updated Nov 7, 2025

[NeurIPS 2025, Spotlight] Rectified Point Flow: Generic Point Cloud Pose Estimation

Python 145 7 Updated Nov 6, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 82,070 9,181 Updated Nov 10, 2025

CVPR'21 "Multi-view 3D Reconstruction of a Texture-less Smooth Surface of Unknown Generic Reflectance"

Python 83 7 Updated Dec 4, 2021
Next