Lists (10)
Sort Name ascending (A-Z)
Stars
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.
PDF2zh for Zotero | Zotero PDF中文翻译插件
Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.
Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.
[EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
🌟 For when you really just want to serve some files over HTTP right now!
A high-throughput and memory-efficient inference and serving engine for LLMs
Tools to download and cleanup Common Crawl data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
A Data Streaming Library for Efficient Neural Network Training
Data and tools for generating and inspecting OLMo pre-training data.
DSIR large-scale data selection framework for language model training
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Virtual whiteboard for sketching hand-drawn like diagrams
Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos
[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark
[CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".