-
The Chinese University of Hong Kong
- Hong Kong
-
09:52
(UTC +08:00) - li-jinsong.github.io
- in/li-jinsong
- @li_jinsong_2002
Lists (4)
Sort Name ascending (A-Z)
Stars
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation
Explain Before You Answer: A Survey on Compositional Visual Reasoning
Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!
Reference PyTorch implementation and models for DINOv3
This is the code related to "🔥Effective Training Data Synthesis for Improving MLLM Chart Understanding" (ICCV 2025).
MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.
Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"
Survival analysis using deep learning methods
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
A framework for few-shot evaluation of language models.
Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’
slime is an LLM post-training framework for RL Scaling.
A Collection of Papers on Diffusion Language Models
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Official PyTorch implementation for "Large Language Diffusion Models"
Lets make video diffusion practical!
Dataset introduced in PlotQA: Reasoning over Scientific Plots