Skip to content
View dunzeng's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report dunzeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official PyTorch Implementation of Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment

Python 40 1 Updated Aug 12, 2025

A comprehensive collection of IQA papers

TeX 1,395 82 Updated Oct 27, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,492 1,152 Updated Sep 26, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,709 1,366 Updated Nov 27, 2025

Search Self-Play: Pushing the Frontier of Agent Capability without Supervision

Python 63 4 Updated Nov 13, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,363 1,330 Updated Nov 20, 2025

Repo for preprint paper - Understanding Generalization of Federated Learning: the Trade-off between Model Stability and Optimization

1 Updated Oct 19, 2025

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

2,928 132 Updated Nov 27, 2025

This is a public repository for Image Clustering Conditioned on Text Criteria (IC|TC)

Python 91 7 Updated Mar 19, 2024

The implementation for the work "Unconstrained Monotonic Calibration of Predictions in Deep Ranking Systems".

Python 18 6 Updated Jun 11, 2025

[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Python 172 8 Updated Jul 7, 2025

ICLR 2021, Contrastive Learning with Hard Negative Samples

Python 279 31 Updated Aug 26, 2021

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,448 210 Updated Nov 13, 2025

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,613 133 Updated Nov 26, 2025

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 490 45 Updated Nov 13, 2025

The development and future prospects of large multimodal reasoning models.

552 20 Updated Aug 2, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,053 96 Updated Nov 23, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,039 6,951 Updated Nov 27, 2025

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

699 34 Updated Oct 20, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,570 302 Updated Nov 13, 2025
Jupyter Notebook 56 14 Updated Oct 6, 2025

Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

Python 21 2 Updated Apr 24, 2025

A programming framework for agentic AI

Python 52,006 7,913 Updated Oct 8, 2025

An open protocol enabling communication and interoperability between opaque agentic applications.

Shell 20,834 2,127 Updated Nov 26, 2025

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)

Python 160 21 Updated May 29, 2025

Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

Python 4,554 375 Updated Jul 29, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,032 8,902 Updated Nov 17, 2025

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]

Jupyter Notebook 580 38 Updated Jul 29, 2025

Playwright MCP server

TypeScript 23,694 1,920 Updated Nov 21, 2025
Next