Skip to content
View developer0hye's full-sized avatar

Block or report developer0hye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling

Python 105 4 Updated Nov 24, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 4,621 418 Updated Nov 25, 2025

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 266 15 Updated Oct 27, 2025

DaD's a pretty good keypoint detector, probably the best.

Python 88 5 Updated Oct 14, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,985 505 Updated May 5, 2025

[ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Python 408 20 Updated Sep 14, 2025

Collection of leaked system prompts

13,598 1,883 Updated Nov 17, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

98,340 26,424 Updated Nov 19, 2025

Fast Multi-dimensional Sparse Attention

C++ 665 52 Updated Nov 19, 2025

MCP integration for Google Calendar to manage events.

TypeScript 790 238 Updated Nov 26, 2025

Integrating SAM2 with DINOv2/v3 for segmentation

Python 68 6 Updated Aug 8, 2025

한국어 문장 임베딩 모델들의 성능을 비교하고 시각화하는 프로젝트입니다. 본 프로젝트는 Claude Opus 4로 구현되었습니다.

Python 2 Updated Jul 30, 2025

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,055 75 Updated Aug 14, 2025

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 486 14 Updated Nov 18, 2025

Benchmarking vision language vision on face tasks

Python 16 1 Updated Mar 30, 2025

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,641 219 Updated Nov 24, 2025

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 491 41 Updated Oct 27, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,734 271 Updated Nov 28, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 85,073 9,672 Updated Nov 28, 2025
Python 28 6 Updated Apr 22, 2024

PyTorch native quantization and sparsity for training and inference

Python 2,540 376 Updated Nov 27, 2025

Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".

Python 184 17 Updated Apr 18, 2024

[CVPR 2024 & TPAMI 2025] UniRepLKNet

Python 1,047 60 Updated Aug 10, 2025

InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)

Python 339 24 Updated Dec 2, 2024

[ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers than Transformers for ConvNets?"

HTML 281 24 Updated Jul 5, 2023

[CVPR2025] Official code for Lost in Translation Found in Context

Python 21 Updated Jun 13, 2025

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Python 10,437 886 Updated Oct 12, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,654 850 Updated Nov 28, 2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 809 24 Updated Nov 25, 2025

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Python 1,191 100 Updated Sep 2, 2023
Next