Stars
verl: Volcano Engine Reinforcement Learning for LLMs
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world application of a neural net trained with backpropagation.
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, suc…
Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Uplift modeling and causal inference with machine learning algorithms
Learning Scheduling Algorithms for Data Processing Clusters
www2022paper / WWW-2022-PAPER-SUPPLEMENTARY-MATERIALS
Forked from causalcausalcausal/WWW-2022-PAPER-SUPPLEMENTARY-MATERIALSIllustrated Examples from Sutton and Barto
xingtian is a componentized library for the development and verification of reinforcement learning algorithms
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
This repository contains implementations and illustrative code to accompany DeepMind publications
Machine Learning for Combinatorial Optimization - NeurIPS'21 competition
Exact Combinatorial Optimization with Graph Convolutional Neural Networks (NeurIPS 2019)
wingsweihua / gym_cityflow
Forked from myunchul/gym_cityflowAdds CityFlow to Gym
Effcient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning