Stars
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Minimal reproduction of DeepSeek R1-Zero
Fully open reproduction of DeepSeek-R1
阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构
翻墙、免费翻墙、免费科学上网、免费节点、免费梯子、免费ss/ssr/v2ray/trojan节点、蓝灯、谷歌商店、翻墙梯子 、外网游戏、国外游戏、vpn、vpn推荐、每天更新、上外网、外网、V2rayN、Qv2ray、V2rayW、V2RayS、Mellow、V2rayX、V2rayU、ClashX、Kitsunebi、BifrostV、i2Ray 、Quantumult、Surge 4、w…
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
Doing simple retrieval from LLM models at various context lengths to measure accuracy
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
A framework for the evaluation of autoregressive code generation language models.
Now we have become very big, Different from the original idea. Collect premium software in various categories.
Efficient Training (including pre-training and fine-tuning) for Big Models
Large World Model -- Modeling Text and Video with Millions Context
DLRover: An Automatic Distributed Deep Learning System
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
An Autonomous LLM Agent for Complex Task Solving
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone