Skip to content
View tanakr1's full-sized avatar

Block or report tanakr1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

📱Beautiful, fast and modern React Native UI library

TypeScript 1,391 59 Updated Nov 26, 2025

🦄 Tailwindcss first-class variant API

TypeScript 3,081 80 Updated Nov 22, 2025

🚀 Beautiful, fast and modern React UI library. (Previously NextUI)

TypeScript 27,455 2,007 Updated Nov 27, 2025

The developers' cloud

TypeScript 53,696 4,845 Updated Nov 27, 2025

A video player for iOS、macOS、tvOS、visionOS , based on AVPlayer and FFmpeg, support the horizontal, vertical screen. support adjust volume, brightness and seek by slide, SwiftUI, support subtitles.

Swift 1,375 256 Updated Nov 24, 2025

Code for KaLM-Embedding models

Python 100 6 Updated Jun 30, 2025

Controllable Animation Video Generation with Large Models-based Multimodal Agents

Jupyter Notebook 216 11 Updated Nov 2, 2025

A Survey of Attributions for Large Language Models

220 9 Updated Aug 24, 2024

The development and future prospects of large multimodal reasoning models.

551 20 Updated Aug 2, 2025

Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the making!

Python 1,055 144 Updated Mar 19, 2025

Uni-MoE: Lychee's Large Multimodal Model Family.

Python 1,021 59 Updated Nov 24, 2025

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

84 3 Updated Nov 18, 2025

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 373 24 Updated Nov 25, 2025
Jupyter Notebook 580 33 Updated Nov 26, 2025

KVAE 1.0

Jupyter Notebook 18 Updated Nov 20, 2025

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 472 25 Updated Nov 26, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,736 176 Updated May 20, 2025

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Python 2,449 325 Updated Oct 17, 2025

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,519 113 Updated Oct 31, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,332 1,136 Updated Nov 21, 2025
Python 1,113 97 Updated Oct 22, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 1,251 107 Updated Nov 18, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,346 108 Updated Nov 27, 2025

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,505 59 Updated Jun 14, 2025

Open-source unified multimodal model

Python 5,372 469 Updated Oct 27, 2025

Depth Anything 3

Jupyter Notebook 2,901 216 Updated Nov 25, 2025

AirPods liberated from Apple's ecosystem.

Kotlin 17,686 824 Updated Nov 25, 2025

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,937 596 Updated Jul 4, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,064 522 Updated Jun 9, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,748 179 Updated Oct 4, 2025
Next