Skip to content
View semchan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report semchan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

R1-like Video-LLM for Temporal Grounding

Python 124 3 Updated Jun 20, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 12,789 1,220 Updated Oct 28, 2025

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 11,746 1,134 Updated Nov 8, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,772 296 Updated Jun 12, 2025

这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。

Python 669 90 Updated Feb 18, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,712 983 Updated Nov 6, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,857 898 Updated Sep 30, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,619 8,389 Updated Sep 20, 2025

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 3,965 3,463 Updated Aug 16, 2025

[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats

Python 506 23 Updated Oct 14, 2025

[ACM MM24] MotionMaster: Training-free Camera Motion Transfer For Video Generation

Python 97 3 Updated Oct 15, 2024

In 2024, the strongest open-source implementation of asymmetric magvit_v2 supports inference code but excludes VQVAE. It supports the joint encoding of images and videos, accommodating arbitrary vi…

Python 151 1 Updated Jul 30, 2024

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 629 32 Updated Oct 16, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,113 1,211 Updated Nov 4, 2025

Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"

Python 20 1 Updated Jul 4, 2023

code for "MVOC:atraining-free multiple video object composition method with diffusion models"

Python 23 2 Updated Jul 3, 2024

Code for FreeTraj, a tuning-free method for trajectory-controllable video generation

Python 108 3 Updated Sep 19, 2025
Python 166 17 Updated Apr 9, 2024

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,791 298 Updated Jun 28, 2024

Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".

Python 212 24 Updated Mar 9, 2024

python库,实现推送实时rtmp音视频流

C++ 134 36 Updated Apr 17, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,068 1,071 Updated Oct 29, 2025

code for UPST-NeRF: Universal Photorealistic Style Transfer of Neural Radiance Fields for 3D Scene

Python 71 7 Updated Mar 20, 2023

Public code release for: ColorfulCurves: Palette-Aware Lightness Control and Color Editing via Sparse Optimization (SIGGRAPH 2023) [Ted Chao, Jason Klein, Jianchao Tan, Jose Echevarria, Yotam Gingold]

Python 55 4 Updated Nov 14, 2023

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 7,971 478 Updated Oct 1, 2025

An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing work.

Python 110 16 Updated Aug 26, 2023
Python 425 37 Updated Nov 1, 2023

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python 1,584 191 Updated Sep 18, 2025
Next