charlesCXK

🎯

Focusing

Xiaokang Chen charlesCXK

🎯

Focusing

Researcher at DeepSeek AI. <-- Ph.D. student at Peking University

471 followers · 62 following

DeepSeek AI, Peking University
Beijing
charlesCXK.github.io

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Stars

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,332 60 Updated Dec 7, 2025

computerhistory / AlexNet-Source-Code

This package contains the original 2012 AlexNet code.

Cuda 2,818 365 Updated Mar 12, 2025

UmiMarch / OpenVideo

OpenVideo specializes in the domain of text-to-video generation, with the goal of providing high-quality and diverse video datasets to AI researchers globally.

Python 113 4 Updated May 22, 2025

tensorzero / tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Rust 10,827 754 Updated Jan 19, 2026

ZHO-ZHO-ZHO / ComfyUI-DeepSeek-JanusPro

Python 106 9 Updated Feb 21, 2025

bytedance / Sa2VA

Official Repo For Pixel-LLM Codebase

Python 1,494 106 Updated Jan 14, 2026

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,189 1,810 Updated Feb 26, 2025

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,394 67 Updated Aug 4, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,665 2,235 Updated Feb 1, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,713 613 Updated Jan 15, 2026

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,098 61 Updated Mar 20, 2025

thunlp / LLaVA-UHD

LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs

Python 411 21 Updated Dec 20, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,988 532 Updated Sep 25, 2024

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,566 489 Updated Jan 17, 2026

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,253 72 Updated May 21, 2025

srush / annotated-mamba

Annotated version of the Mamba paper

Jupyter Notebook 493 19 Updated Feb 27, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 2,016 134 Updated Aug 27, 2025

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,378 334 Updated Jan 17, 2026

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 50,839 4,192 Updated Jan 16, 2026

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,600 235 Updated Jun 14, 2024

3DTopia / LGM

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 2,020 136 Updated Aug 20, 2024

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,149 272 Updated Jan 10, 2025

lxtGH / OMG-Seg

Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,339 55 Updated Oct 15, 2025

Yuliang-Liu / Monkey

Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)

Python 1,943 139 Updated Oct 23, 2025

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,367 4,773 Updated Jun 2, 2025

chongzhou96 / EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 1,111 50 Updated May 24, 2025

DirtyHarryLYL / LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

874 38 Updated Mar 8, 2025

naver-ai / dual-teacher

Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"

Python 50 5 Updated Nov 16, 2023

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,209 1,104 Updated Dec 26, 2025

ZachGoldberg / Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

13,923 778 Updated Jul 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiaokang Chen charlesCXK

Achievements

Achievements

Highlights

Organizations

Block or report charlesCXK

Stars

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

computerhistory / AlexNet-Source-Code

UmiMarch / OpenVideo

tensorzero / tensorzero

ZHO-ZHO-ZHO / ComfyUI-DeepSeek-JanusPro

bytedance / Sa2VA

deepseek-ai / DeepSeek-VL2

apple / ml-aim

deepseek-ai / Janus

open-compass / VLMEvalKit

bytedance / 1d-tokenizer

thunlp / LLaVA-UHD

deepseek-ai / DeepSeek-V2

EvolvingLMMs-Lab / lmms-eval

facebookresearch / schedule_free

srush / annotated-mamba

NUS-HPC-AI-Lab / VideoSys

showlab / Awesome-Video-Diffusion

unslothai / unsloth

luosiallen / latent-consistency-model

3DTopia / LGM

ali-vilab / VGen

lxtGH / OMG-Seg

Yuliang-Liu / Monkey

lm-sys / FastChat

chongzhou96 / EdgeSAM

DirtyHarryLYL / LLM-in-Vision

naver-ai / dual-teacher

BradyFU / Awesome-Multimodal-Large-Language-Models

ZachGoldberg / Startup-CTO-Handbook