Young1993

I may be slow to respond.

shimin Young1993

I may be slow to respond.

Eng.D Algorithm Researcher (V)LLM Developer/ ALIBABA Cloud

4 followers · 4 following

Institute of Computing Innovation, Zhejiang University; Master of The Hong Kong Polytechnic University
Hangzhou
https://www.zhihu.com/people/leeshimin
https://twitter.com/younglishimin?s=21&t=zmqUbpol0Bc6wAIytuAIUQ

Lists (1)

Sort

🚀 My stack

1 repository

Starred repositories

wyczzy / AIGI-Holmes

(ICCV 2025)This repository is the official implementation of AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models

Python 136 2 Updated Jul 22, 2025

apple / ml-depth-pro

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,965 375 Updated Apr 21, 2025

iflow-ai / iflow-cli

iFlow cli is a comprehensive command-line intelligence that embeds in your terminal, analyzes your repositories, does coding tasks, interprets your needs across contexts, and boosts efficiency by p…

Shell 3,017 217 Updated Nov 3, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,490 1,280 Updated Oct 12, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,649 2,115 Updated Jul 17, 2025

Yuchen413 / text2image_safety

Python 193 16 Updated Apr 7, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 20,040 3,308 Updated Nov 8, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 24,605 1,808 Updated Jul 31, 2025

jeonsworld / ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)

Jupyter Notebook 2,091 395 Updated Jun 7, 2022

Young1993 / tlm

The public code of EMNLP2023 (main conference) paper "TLM: Token-Level Masking for Transformers"

Python 5 1 Updated Feb 28, 2024

john-hewitt / backpacks-flash-attn

The original Backpack Language Model implementation, a fork of FlashAttention

Python 69 6 Updated May 29, 2023

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,007 1,080 Updated Nov 18, 2024

AI4Finance-Foundation / FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook 17,984 2,545 Updated Oct 3, 2025

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,736 865 Updated Jun 10, 2024

jessevig / bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 7,743 848 Updated Jun 1, 2025

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

Python 2,956 501 Updated Nov 7, 2025

xlang-ai / instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 2,015 156 Updated Jan 15, 2025

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,551 46,113 Updated Nov 8, 2025

LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,493 3,298 Updated Aug 17, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,242 4,777 Updated Jun 2, 2025

nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 76,878 8,298 Updated May 27, 2025

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 15,923 1,269 Updated Jan 18, 2025

Young1993 / UGEN

Incorporating Instructional Prompts into A Unified Generative Framework for Joint Multiple Intent Detection and Slot Filling - Coling2022(Oral))

Python 8 4 Updated May 8, 2023

swz30 / Restormer

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Python 2,273 282 Updated Oct 23, 2025