developer0hye

Yonghye Kwon developer0hye

practical

238 followers · 187 following

MarkAny
Seoul, Korea
https://www.linkedin.com/in/yonghye-kwon-91641a174/

Achievements

x3 x4 x3

Achievements

x3 x4 x3

Lists (31)

Sort

Stars

seominseok0429 / Upsample-Anything-A-Simple-and-Hard-to-Beat-Baseline-for-Feature-Upsampling

Official Implementation of Upsample Anything: A Simple and Hard to Beat Baseline for Feature Upsampling

Python 105 4 Updated Nov 24, 2025

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 4,621 418 Updated Nov 25, 2025

kyegomez / NaViT

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 266 15 Updated Oct 27, 2025

Parskatt / dad

DaD's a pretty good keypoint detector, probably the best.

Python 88 5 Updated Oct 14, 2025

apple / ml-fastvlm

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,985 505 Updated May 5, 2025

UCSC-VLAA / OpenVision

[ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Python 408 20 Updated Sep 14, 2025

jujumilk3 / leaked-system-prompts

Collection of leaked system prompts

13,598 1,883 Updated Nov 17, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

98,340 26,424 Updated Nov 19, 2025

SHI-Labs / NATTEN

Fast Multi-dimensional Sparse Attention

C++ 665 52 Updated Nov 19, 2025

nspady / google-calendar-mcp

MCP integration for Google Calendar to manage events.

TypeScript 790 238 Updated Nov 26, 2025

WZH0120 / SAM2-UNeXT

Integrating SAM2 with DINOv2/v3 for segmentation

Python 68 6 Updated Aug 8, 2025

developer0hye / korean-sentence-embedding-example

한국어 문장 임베딩 모델들의 성능을 비교하고 시각화하는 프로젝트입니다. 본 프로젝트는 Claude Opus 4로 구현되었습니다.

Python 2 Updated Jul 30, 2025

DAMO-NLP-SG / VideoLLaMA3

Frontier Multimodal Foundation Models for Image and Video Understanding

Jupyter Notebook 1,055 75 Updated Aug 14, 2025

OpenGVLab / VideoChat-Flash

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 486 14 Updated Nov 18, 2025

yhenon / llm-face-vision

Benchmarking vision language vision on face tasks

Python 16 1 Updated Mar 30, 2025

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,641 219 Updated Nov 24, 2025

tue-mps / eomt

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 491 41 Updated Oct 27, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,734 271 Updated Nov 28, 2025

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 85,073 9,672 Updated Nov 28, 2025

plncmm / llmner

Python 28 6 Updated Apr 22, 2024

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,540 376 Updated Nov 27, 2025

luogen1996 / RepAdapter

Official implementation of "Towards Efficient Visual Adaption via Structural Re-parameterization".

Python 184 17 Updated Apr 18, 2024

AILab-CVC / UniRepLKNet

[CVPR 2024 & TPAMI 2025] UniRepLKNet

Python 1,047 60 Updated Aug 10, 2025

sail-sg / inceptionnext

InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)

Python 339 24 Updated Dec 2, 2024

VITA-Group / SLaK

[ICLR 2023] "More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity"; [ICML 2023] "Are Large Kernels Better Teachers than Transformers for ConvNets?"

HTML 281 24 Updated Jul 5, 2023

art-jang / LiTFiC

[CVPR2025] Official code for Lost in Translation Found in Context

Python 21 Updated Jun 13, 2025

Olow304 / memvid

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Python 10,437 886 Updated Oct 12, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,654 850 Updated Nov 28, 2025

PKU-YuanGroup / UniWorld

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 809 24 Updated Nov 25, 2025

KMnP / vpt

❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119

Python 1,191 100 Updated Sep 2, 2023

Yonghye Kwon developer0hye

Lists (31)

6d-pose-estimation

Action Recognition

Agentic-AI

Backbone

cmake

Color Recognition

Crawling

CS-STUDY

cuda

Detection

DETR

Faster

ffmpeg

Human-Detection-Dataset

image-dewarping

llm

media-processing

mini

OCR

OpenDataset

polygon-estimation

Production

Productivity

QA

REID

STT

Tracking

vit-lora

VLM

Youtube

자기계발

Stars