Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Awesome-LLM: a curated list of Large Language Model
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Making large AI models cheaper, faster and more accessible
Simple UI for LLM Model Finetuning
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Reading list for research topics in multimodal machine learning
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
Unsupervised text tokenizer for Neural Network-based text generation.
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
A curated list of speech and natural language processing resources
NanGe - A Rule-based Chinese-English Machine Translation System
The "Python Machine Learning (1st edition)" book code repository and info resource