Skip to content
View CesureX's full-sized avatar
  • Beijing
  • 12:03 (UTC -12:00)

Block or report CesureX

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

neural netowrk Quantun Monte Carlo

Python 3 Updated Nov 27, 2025

Adobe illustrator 科研组图插件,支持复制粘贴相对位置、形状尺寸批量设置、图片一键自动排列,一键添加子图label | Adobe Illustrator plugin, specifically designed for scientific illustration, supports copy-pasting with relative positioning, bat…

JavaScript 169 6 Updated Nov 27, 2025

Fingerprint recognition in Python

Python 13 2 Updated Oct 25, 2025

Dynamic 3D Foundation Model using Causal Transformer

Python 284 18 Updated Oct 7, 2025

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Python 365 12 Updated Nov 27, 2025

A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)

2 Updated Aug 18, 2025

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,430 67 Updated Mar 16, 2025

This project provides a script to perform full model fine-tuning on FLUX.1 [dev]. It is adapted from the original DreamBooth training example in the `diffusers` library.

Python 3 Updated Jun 23, 2025

Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…

JavaScript 7,714 1,635 Updated May 26, 2025
Python 2,226 161 Updated Nov 8, 2024

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,591 2,116 Updated Nov 20, 2025

[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy

Python 850 39 Updated Sep 26, 2025

Official inference repo for FLUX.1 models

Python 24,733 1,823 Updated Jul 31, 2025

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,467 1,005 Updated Jul 31, 2025

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 2,985 300 Updated Sep 4, 2025

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 390 37 Updated Apr 20, 2024

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 68 12 Updated Sep 10, 2024

[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer

Python 1,841 140 Updated Jul 3, 2025

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,513 82 Updated Nov 10, 2025

Cosmos-Transfer1-DiffusionRenderer: High-quality video de-lighting and re-lighting based on Cosmos video diffusion framework

Jupyter Notebook 747 55 Updated Oct 2, 2025
Python 18 2 Updated Jun 14, 2025

[CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"

Python 306 19 Updated Jun 13, 2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 807 24 Updated Nov 25, 2025

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,680 456 Updated Sep 25, 2025

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,793 76 Updated Oct 22, 2025

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,566 111 Updated May 29, 2025

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 418 20 Updated Jun 20, 2025
Python 106 4 Updated Jul 9, 2024

WebGL 3D Gaussian Splat Viewer

JavaScript 2,732 301 Updated Nov 16, 2025

Fast Diffusion Models with Transformers

Python 902 118 Updated Aug 17, 2025
Next