PaPaQQQ

🎯

Focusing

Chongkai Yu PaPaQQQ

🎯

Focusing

Segmentation; Object Detection;

0 followers · 9 following

BIT
Bei Jing

Lists (10)

Sort

Stars

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,967 958 Updated Nov 10, 2025

zai-org / GLM-V

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 1,740 103 Updated Oct 28, 2025

QwenLM / Qwen3Guard

Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.

Python 347 21 Updated Oct 21, 2025

MuiseDestiny / zotero-gpt

GPT Meet Zotero.

TypeScript 6,683 282 Updated Oct 11, 2025

guaguastandup / zotero-pdf2zh

PDF2zh for Zotero | Zotero PDF中文翻译插件

Python 1,851 88 Updated Nov 5, 2025

windingwind / zotero-pdf-translate

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 9,741 432 Updated Nov 10, 2025

zotero / zotero

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 12,776 897 Updated Nov 10, 2025

curryqka / AgentThink

[EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!

Python 127 4 Updated Sep 27, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,487 58 Updated Jun 14, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,367 31,107 Updated Nov 11, 2025

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…

Python 3,060 161 Updated Nov 9, 2025

deepseek-ai / DeepSeek-V3

Python 100,221 16,331 Updated Aug 28, 2025

Mini-o3 / Mini-o3

Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"

Python 359 15 Updated Sep 15, 2025

PaddlePaddle / FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

Python 3,558 651 Updated Nov 11, 2025

svenstaro / miniserve

🌟 For when you really just want to serve some files over HTTP right now!

Rust 7,159 350 Updated Nov 1, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,689 11,169 Updated Nov 11, 2025

multimodal-art-projection / MAP-NEO

Python 967 89 Updated Feb 7, 2025

facebookresearch / cc_net

Tools to download and cleanup Common Crawl data

Python 1,030 153 Updated Apr 25, 2023

togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,844 365 Updated Dec 7, 2024

mosaicml / streaming

A Data Streaming Library for Efficient Neural Network Training

Python 1,412 176 Updated Oct 27, 2025

google-research / deduplicate-text-datasets

Rust 1,250 125 Updated Jul 30, 2024

allenai / dolma

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,341 152 Updated Nov 5, 2025

p-lambda / dsir

DSIR large-scale data selection framework for language model training

Python 265 19 Updated Apr 7, 2024

LDNOOBW / List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words

List of Dirty, Naughty, Obscene, and Otherwise Bad Words

3,233 689 Updated Aug 5, 2024

facebookresearch / MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,708 71 Updated Nov 9, 2025