Skip to content
View thsno02's full-sized avatar

Block or report thsno02

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 59,340 12,263 Updated Nov 7, 2025

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,860 367 Updated Dec 7, 2024

A quick guide (especially) for trending instruction finetuning datasets

3,317 223 Updated Nov 28, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 50,217 8,396 Updated Nov 12, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,412 5,827 Updated Aug 14, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,040 284 Updated Nov 26, 2025

中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.

HTML 209 36 Updated Apr 15, 2024

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 1,838 71 Updated May 13, 2024

A curation of awesome tools, documents and projects about LLM Security.

1,457 146 Updated Aug 20, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,039 3,018 Updated Aug 15, 2024

Official Repository for "The Curious Case of Neural Text Degeneration"

HTML 166 18 Updated Apr 18, 2023

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,885 144 Updated Dec 30, 2024

[CVPR 2024 & TPAMI 2025] UniRepLKNet

Python 1,046 60 Updated Aug 10, 2025

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Python 4,739 505 Updated Sep 25, 2025

12 Weeks, 24 Lessons, AI for All!

Jupyter Notebook 43,961 8,708 Updated Nov 26, 2025

🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…

TypeScript 68,167 14,094 Updated Nov 28, 2025
Python 2,551 306 Updated May 19, 2024

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,406 104 Updated Mar 3, 2024

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,563 6,542 Updated Nov 11, 2025

互联网常用敏感词、停止词词库

1,487 640 Updated Jun 4, 2024

互联网常用敏感词库

359 186 Updated Dec 4, 2018

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 64,121 11,585 Updated Nov 28, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,280 4,775 Updated Jun 2, 2025

小火箭 shadowrocket 配置文件 模块 脚本 module sgmodule 图文教程 规则 分流 破解 解锁

JavaScript 6,022 317 Updated Nov 28, 2025
Python 821 82 Updated Sep 14, 2023

Train transformer language models with reinforcement learning.

Python 16,452 2,321 Updated Nov 27, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 16,658 1,314 Updated Oct 6, 2025

Examples and guides for using the OpenAI API

Jupyter Notebook 69,408 11,639 Updated Nov 26, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,775 2,068 Updated May 19, 2025

Forum for discussing Internet censorship circumvention

Python 4,701 106 Updated Sep 30, 2025
Next