Stars
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Tongyi Deep Research, the Leading Open-source Deep Research Agent
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
Collect Some Paper Of Compute Advertising
[EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.
An open-source recreation of the AgentInstruct agentic workflow for synthetic data generation
Collection of awesome medical dataset resources.
本『ChatGPT资源库(原理/微调/代码/论文)』的初始版本来自July CSDN博客上阅读量高达50万的ChatGPT系列,联合发起人:七月ChatGPT原理课学员,6月初正式对外发布
Header-only C++/python library for fast approximate nearest neighbors
Library for reading and writing large multi-dimensional arrays.
Apache HoraeDB (incubating) is a high-performance, distributed, cloud native time-series database.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Curated tutorials and resources for Large Language Models, AI Painting, and more.
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
The Prometheus monitoring system and time series database.
A production-grade java implementation of RAFT consensus algorithm.
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
The Fastest Distributed Database for Transactional, Analytical, and AI Workloads.
An embedded KV storage engine for learning HBase
A library that provides an embeddable, persistent key-value store for fast storage.