Skip to content
View xuemei-ye's full-sized avatar

Block or report xuemei-ye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collect Some Paper Of Compute Advertising

6 2 Updated Oct 29, 2021

A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.

3,667 946 Updated Dec 20, 2024
Shell 1 Updated Oct 5, 2025

Deep Reinforcement Learning

4,233 649 Updated Dec 10, 2022

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,822 134 Updated Jan 17, 2025

Fast and Accurate ML in 3 Lines of Code

Python 9,479 1,065 Updated Sep 26, 2025

A quicker, cleaner way to get started blogging with Hydejack.

HTML 170 461 Updated Sep 14, 2024

https://hrl.boyuai.com/

Jupyter Notebook 4,016 742 Updated Nov 22, 2022

High accuracy RAG for answering questions from scientific documents with citations

Python 7,759 778 Updated Oct 11, 2025

A curated list of Decision Transformer resources (continually updated)

823 35 Updated Sep 12, 2025

A minimal, type-focused Jekyll theme.

SCSS 702 797 Updated Aug 10, 2024

A website startup template using the Chirpy theme gem.

Shell 939 485 Updated Jul 26, 2025

pytorch入门项目,包括线性回归、垃圾分类、水果目标检测、ssd

Python 125 24 Updated Jun 17, 2020

水果检测并分类

Python 24 7 Updated Feb 24, 2022

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 12,284 1,173 Updated Oct 12, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,389 2,040 Updated May 19, 2025

"桃李“: 国际中文教育大模型

Python 183 21 Updated Nov 13, 2023

Deep Learning Specialization by Andrew Ng on Coursera.

Jupyter Notebook 7,689 5,517 Updated May 22, 2019

此 GitHub 作为《不明白播客》官网的备份站,用于分享文字版播客。 版权所有 ©️ 不明白播客 bumingbai.net

1,565 48 Updated Apr 24, 2025

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,626 6,161 Updated Jul 13, 2023

AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2

Python 41,547 11,593 Updated Nov 12, 2024

A collection of resources for Recommender Systems (RecSys)

536 115 Updated Dec 13, 2021

18.06 course at MIT

Jupyter Notebook 2,897 744 Updated May 12, 2025

翻墙-科学上网

Kotlin 40,656 7,393 Updated Oct 6, 2025

DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)

Python 66 7 Updated Nov 8, 2019

learning fomula

Jupyter Notebook 293 60 Updated Jul 24, 2021

A list of semi to fully remote-friendly companies (jobs) in tech.

JavaScript 39,281 3,848 Updated Oct 12, 2025

Multiagent Cooperation and Competition with Deep Reinforcement Learning

Lua 123 35 Updated Nov 26, 2015

Repo containing code for multi-agent deep reinforcement learning (MADRL).

Python 720 124 Updated Apr 12, 2023
Next