Skip to content
View bytes-lost's full-sized avatar

Block or report bytes-lost

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 778 56 Updated Jul 31, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,387 1,521 Updated Apr 24, 2025

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

TypeScript 16,134 1,225 Updated Sep 21, 2025

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 429 84 Updated Sep 6, 2024

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 18,318 2,120 Updated Sep 24, 2025

Kortix – build, manage and train AI Agents. Fully Open Source.

TypeScript 18,569 3,173 Updated Nov 10, 2025

所有小初高、大学PDF教材。

Roff 55,493 12,428 Updated Oct 18, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 59,119 12,246 Updated Nov 7, 2025

Repository hosting code for "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Python 1,543 303 Updated Nov 11, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,490 295 Updated Oct 29, 2025

🪄 Create rich visualizations with AI

TypeScript 14,133 1,252 Updated Nov 7, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 29,747 2,005 Updated Nov 7, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,363 809 Updated Nov 9, 2025

Official Repo for Open-Reasoner-Zero

Python 2,061 119 Updated Jun 2, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,970 123 Updated Apr 3, 2025

坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

42,904 4,355 Updated Mar 20, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 20,069 3,344 Updated Nov 12, 2025

微信小程序组件 / API / 云开发示例

JavaScript 7,058 2,200 Updated Aug 7, 2025

英语字典 英语词库 字典词库 四级单词 六级单词 考研单词 雅思 托福 SAT GMAT TOEFL GRE

Python 2,834 665 Updated Apr 5, 2024

微信小程序开发资源汇总 💯

49,618 8,964 Updated Feb 20, 2025

CasaOS - A simple, easy-to-use, elegant open-source Personal Cloud system.

Go 32,316 1,788 Updated Aug 6, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 55,701 5,586 Updated Nov 11, 2025

LLM training in simple, raw C/CUDA

Cuda 28,135 3,285 Updated Jun 26, 2025

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

TypeScript 38,952 2,390 Updated Nov 12, 2025

A Massively Parallel Large Scale Self-Play Framework

Python 356 39 Updated Jan 9, 2023

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,386 81 Updated Oct 31, 2025

科技爱好者周刊,每周五发布

78,799 3,700 Updated Nov 7, 2025

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

HTML 3,999 447 Updated Nov 12, 2025

Retrieval and Retrieval-augmented LLMs

Python 10,817 805 Updated Oct 22, 2025

A blazing fast inference solution for text embeddings models

Rust 4,188 322 Updated Nov 11, 2025
Next