Skip to content
View ggg0919's full-sized avatar

Block or report ggg0919

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)

Python 166 12 Updated Nov 4, 2025

✨✨[AAAI 2026] This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 73 2 Updated Apr 28, 2025

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,284 4,644 Updated Jun 21, 2022

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,441 177 Updated Mar 28, 2025

GeoIntel using Google's Gemini API to uncover the location where photos were taken through AI-powered geo-location analysis.

Python 757 86 Updated Aug 29, 2025

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

679 25 Updated Aug 22, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,648 1,073 Updated Nov 9, 2025
HTML 91 10 Updated May 10, 2024

The Fast Cross-Platform Package Manager

C++ 7,741 419 Updated Oct 29, 2025

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

16,232 2,821 Updated Feb 1, 2024

PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

Python 1,094 206 Updated Sep 11, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 63,094 9,275 Updated Nov 6, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,410 199 Updated May 7, 2025

[NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models

Python 46 1 Updated Mar 18, 2024

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Python 699 65 Updated Sep 19, 2024

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 598 45 Updated May 8, 2024

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 638 29 Updated Dec 23, 2024

OpenMMLab Detection Toolbox and Benchmark

Python 31,969 9,788 Updated Aug 21, 2024

课堂专注度及考试作弊系统、课堂动态点名。情绪识别、表情识别、姿态识别和人脸识别结合

Python 476 88 Updated Apr 5, 2024

🏨TopView工作室一轮考核项目:一个酒店管理系统,提供查看房间,对房间进行模糊查询,预订房间,个人信息管理,房间和酒店信息管理(管理员)等功能,后台使用Java,tomcat,mysql,servlet,jsp实现,没有使用任何框架

Java 304 87 Updated Mar 11, 2021

这是一个基于ssm框架和mysql数据库开发的一个酒店管理系统

Java 102 37 Updated May 12, 2018