Skip to content
View probe2's full-sized avatar

Block or report probe2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Contain a list of papers on commonsense modeling

2 1 Updated Nov 2, 2019

Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Python 503 41 Updated Dec 31, 2025
Python 873 70 Updated Dec 19, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,560 2,010 Updated Nov 1, 2025

💡 Awesome RAG: A resource of Retrieval-Augmented Generation (RAG) for LLMs, focusing on the development of technology.

401 18 Updated Oct 17, 2025
Python 5 Updated Dec 3, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 52,360 9,197 Updated Jan 5, 2026

R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Python 667 47 Updated Aug 5, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,250 284 Updated Nov 26, 2025

A Survey on Multimodal Retrieval-Augmented Generation

455 22 Updated Nov 8, 2025
3 Updated Sep 28, 2024

Reproduce R1 Zero on Logic Puzzle

Python 2,428 165 Updated Mar 20, 2025

🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techn…

Python 528 40 Updated Oct 23, 2025

Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.

1,378 166 Updated Oct 20, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,435 155 Updated Aug 12, 2025

Replicating O1 inference-time scaling laws

Python 91 4 Updated Dec 1, 2024
Python 3 Updated Mar 25, 2023
5 1 Updated Feb 17, 2025

[EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation

Python 310 35 Updated Oct 18, 2024

Mastering Applied AI, One Concept at a Time

Jupyter Notebook 1,729 192 Updated Nov 27, 2025

The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"

187 4 Updated Oct 28, 2024

An automatic evaluation framework for Multimodal Chain-of-Thought.

Python 5 Updated Nov 7, 2024

Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"

Python 24 2 Updated Nov 10, 2024

O1 Replication Journey

2,003 63 Updated Jan 14, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,687 1,182 Updated Apr 30, 2025

📚 AIGC 求职面经、必备基础知识、提示词工程、ChatGPT、Stable Diffusion、Prompt、Embedding、Fintune 等 AIGC 求职你所需要知道的一切~

764 59 Updated Jun 26, 2024

论文里可以用到的实验图示例

Python 294 66 Updated Jan 24, 2024

A reading list on LLM based Synthetic Data Generation 🔥

1,496 90 Updated Jun 5, 2025
Next