Skip to content
View lzzk's full-sized avatar

Block or report lzzk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 82,482 12,390 Updated Jan 4, 2026

删库

9,799 1,616 Updated Oct 20, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,525 2,859 Updated Nov 3, 2025

Awesome-LLM: a curated list of Large Language Model

25,964 2,245 Updated Jul 31, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,927 2,688 Updated Dec 15, 2025

Making large AI models cheaper, faster and more accessible

Python 41,315 4,545 Updated Dec 22, 2025

Simple UI for LLM Model Finetuning

Jupyter Notebook 2,062 132 Updated Dec 21, 2023

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,067 6,645 Updated Sep 30, 2025

Reading list for research topics in multimodal machine learning

6,783 897 Updated Aug 20, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 78,295 15,118 Updated May 10, 2024

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

TeX 2,441 443 Updated Aug 9, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,562 1,318 Updated Jan 1, 2026

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,982 2,256 Updated Oct 14, 2025

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Java 10,032 2,713 Updated Nov 27, 2025

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,794 6,167 Updated Jul 13, 2023

A curated list of speech and natural language processing resources

2,223 294 Updated Apr 2, 2019

NanGe - A Rule-based Chinese-English Machine Translation System

C++ 20 8 Updated Jul 23, 2017

The "Python Machine Learning (1st edition)" book code repository and info resource

Jupyter Notebook 12,568 4,406 Updated Nov 20, 2024