Skip to content
View wildphoton's full-sized avatar

Block or report wildphoton

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Entropy Based Sampling and Parallel CoT Decoding

Python 3,420 325 Updated Nov 13, 2024

Official Repo for Open-Reasoner-Zero

Python 2,059 119 Updated Jun 2, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,478 294 Updated Oct 29, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,390 185 Updated Nov 10, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 872 54 Updated Jul 22, 2025

ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,241 76 Updated May 16, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,105 3,943 Updated Nov 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,280 2,458 Updated Nov 10, 2025

Fully open reproduction of DeepSeek-R1

Python 25,623 2,400 Updated Sep 8, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,375 1,523 Updated Apr 24, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,766 99 Updated Mar 18, 2025

A curated reading list of research in Adaptive Computation, Inference-Time Computation & Mixture of Experts (MoE).

158 10 Updated Jan 1, 2025

awesome grounding: A curated list of research papers in visual grounding

1,120 105 Updated Sep 21, 2025

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,761 142 Updated Jul 10, 2025

Refine high-quality datasets and visual AI models

Python 10,015 681 Updated Nov 8, 2025

git extension for {collaborative, communal, continual} model development

Python 215 9 Updated Nov 14, 2024

🏆 A ranked gallery of awesome streamlit apps built by the community

1,323 156 Updated Jun 28, 2024

Lexical Substitution Framework

Python 46 14 Updated Apr 7, 2023

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,491 3,299 Updated Aug 17, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,116 971 Updated Nov 8, 2025

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,116 80 Updated Jun 17, 2024

A playbook for systematically maximizing the performance of deep learning models.

29,363 2,400 Updated Jun 18, 2024

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,679 607 Updated Jul 25, 2023

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,909 3,001 Updated Aug 15, 2024

Pre-trained V+L Data Preparation

Python 46 4 Updated Jun 2, 2020

CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet

Python 223 8 Updated Dec 16, 2022

EVA Series: Visual Representation Fantasies from BAAI

Python 2,597 187 Updated Aug 1, 2024
Jupyter Notebook 689 54 Updated Nov 5, 2025

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 5,569 724 Updated Aug 5, 2024

[CVPR 2022] Official code for "Unified Contrastive Learning in Image-Text-Label Space"

Python 402 31 Updated Nov 10, 2023
Next