jiangzhengkai

zkjiang jiangzhengkai

LLM/MLLM Engineer

209 followers · 451 following

Tencent
Shanghai
21:30 (UTC +08:00)
https://jiangzhengkai.github.io/
@jiang_zhengkai

Achievements

Highlights

Stars

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 382 18 Updated Nov 10, 2025

Lakonik / piFlow

pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

Python 163 4 Updated Nov 10, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 63,427 9,303 Updated Nov 10, 2025

mm-vl / ULM-R1

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Python 30 5 Updated Jul 22, 2025

NVlabs / rcm

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 269 13 Updated Nov 5, 2025

NVlabs / LongLive

LongLive: Real-time Interactive Long Video Generation

Python 813 50 Updated Nov 3, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,128 250 Updated Nov 10, 2025

Tencent-Hunyuan / HunyuanImage-3.0

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,408 108 Updated Oct 31, 2025

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,008 2,214 Updated Oct 17, 2025

apache / arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,138 3,898 Updated Nov 10, 2025

Tencent-Hunyuan / SRPO

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,175 38 Updated Oct 26, 2025

Fredreic1849 / BranchGRPO

BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models

Python 37 Updated Oct 30, 2025

Tencent-Hunyuan / HunyuanImage-2.1

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation

Python 658 50 Updated Oct 14, 2025

jax-ml / scaling-book

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 683 99 Updated Nov 11, 2025

jamez-bondos / awesome-gpt4o-images

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 7,628 1,498 Updated May 26, 2025

Tencent-Hunyuan / HunyuanVideo-Foley

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.

Python 1,245 86 Updated Sep 28, 2025

ByteDance-Seed / m3-agent

Python 1,085 96 Updated Oct 22, 2025

AlibabaPAI / torchacc

PyTorch distributed training acceleration framework

Python 53 9 Updated Aug 13, 2025

Kai-46 / KnapFormer

Python 118 5 Updated Aug 10, 2025

Kai-46 / minFM

HTML 164 9 Updated Oct 27, 2025

Tencent-Hunyuan / MixGRPO

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Python 1,035 42 Updated Oct 13, 2025

MizzenAI / HPSv3

Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)

Python 216 12 Updated Sep 8, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,000 329 Updated Nov 11, 2025

RiseAI-Sys / DAX

High performance inference engine for diffusion models

Python 94 3 Updated Sep 5, 2025

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,709 71 Updated Nov 9, 2025

krea-ai / flux-krea

Official GitHub repository for FLUX.1 Krea [dev].

Python 355 30 Updated Aug 2, 2025

alibaba-damo-academy / Lumos

Lumos Project: Frontier video unified model research by Alibaba DAMO Academy.

Python 141 3 Updated Jul 17, 2025

NVlabs / Long-RL

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 652 23 Updated Sep 24, 2025

nvidia-cosmos / cosmos-predict2

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 660 89 Updated Oct 29, 2025

mit-han-lab / lpd

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

Python 76 6 Updated Jul 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zkjiang jiangzhengkai

Achievements

Achievements

Highlights

Block or report jiangzhengkai

Stars

meituan-longcat / LongCat-Flash-Omni

Lakonik / piFlow

PaddlePaddle / PaddleOCR

mm-vl / ULM-R1

NVlabs / rcm

NVlabs / LongLive

zhaochenyang20 / Awesome-ML-SYS-Tutorial

Tencent-Hunyuan / HunyuanImage-3.0

Gar-b-age / CookLikeHOC

apache / arrow

Tencent-Hunyuan / SRPO

Fredreic1849 / BranchGRPO

Tencent-Hunyuan / HunyuanImage-2.1

jax-ml / scaling-book

jamez-bondos / awesome-gpt4o-images

Tencent-Hunyuan / HunyuanVideo-Foley

ByteDance-Seed / m3-agent

AlibabaPAI / torchacc

Kai-46 / KnapFormer

Kai-46 / minFM

Tencent-Hunyuan / MixGRPO

MizzenAI / HPSv3

QwenLM / Qwen-Image

RiseAI-Sys / DAX

facebookresearch / MetaCLIP

krea-ai / flux-krea

alibaba-damo-academy / Lumos

NVlabs / Long-RL

nvidia-cosmos / cosmos-predict2

mit-han-lab / lpd