Stars
Collect Some Paper Of Compute Advertising
A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Fast and Accurate ML in 3 Lines of Code
A quicker, cleaner way to get started blogging with Hydejack.
High accuracy RAG for answering questions from scientific documents with citations
A curated list of Decision Transformer resources (continually updated)
A website startup template using the Chirpy theme gem.
pytorch入门项目,包括线性回归、垃圾分类、水果目标检测、ssd
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Deep Learning Specialization by Andrew Ng on Coursera.
此 GitHub 作为《不明白播客》官网的备份站,用于分享文字版播客。 版权所有 ©️ 不明白播客 bumingbai.net
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2
A collection of resources for Recommender Systems (RecSys)
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
A list of semi to fully remote-friendly companies (jobs) in tech.
NeuroCSUT / DeepMind-Atari-Deep-Q-Learner-2Player
Forked from DorianKodelja/DeepMind-Atari-Deep-Q-Learner-2PlayerMultiagent Cooperation and Competition with Deep Reinforcement Learning
Repo containing code for multi-agent deep reinforcement learning (MADRL).