-
Tsinghua University
- 100084, Beijing, China
-
20:56
(UTC +08:00) - @ShengZhao65735
- https://scholar.google.com/citations?user=sxEizdsAAAAJ
- https://zhaosheng-thu.github.io
Highlights
- Pro
Stars
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
Official Implementation of RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering.
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Inefficient sample code for getting screen contents in Unity on Meta Quest to workaround lack of 'camera access'
A repository including codes for finetuning SV3D and novel view generation.
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Generative Models by Stability AI
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Using Low-rank adaptation to quickly fine-tune diffusion models.
[ICLR 2023 Spotlight] EVA3D: Compositional 3D Human Generation from 2D Image Collections
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A latent text-to-image diffusion model
Open CS Application | 开源CS申请
Body, Eye and Face Tracking code sample.