Lists (1)
Sort Name ascending (A-Z)
Starred repositories
[ICRA 2025] Interactive4D: Interactive 4D LiDAR Segmentation
2025 最新校招面试题合集, 面向 2026 届应届生,全网最全整理!收录 1000+道真实面试题以及面经,涵盖阿里、腾讯、字节、美团、百度、华为、小米、英伟达、微软、米哈游等百家大中小厂。每题配备视频解析 or 文字讲解,持续更新中,助力拿下 Dream Offer!
Official Implementation of Paper Transfer between Modalities with MetaQueries
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
AI based foreign language reading and learning tool that allows you to learn foreign languages using any text content of interest,TextLingo是一款兴趣驱动的AI外语阅读与学习软件
Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?
We write your reusable computer vision tools. 💜
本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Improve your resumes with Resume Matcher. Get insights, keyword suggestions and tune your resumes to job descriptions.
Official implementation of the paper "Watermark Anything with Localized Messages"
[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
Poppy Humanoid is an open-source and 3D printed humanoid robot. Optimized for research and education purposes, its modularity allows for a wide range of applications and experimentations.
💼 Your own AI-powered voice interviewer for hiring.
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO and SAM 2
Refine high-quality datasets and visual AI models
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything