Skip to content
View jinglong92's full-sized avatar

Block or report jinglong92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,497 3,061 Updated Jan 19, 2026

LLM101n: Let's build a Storyteller

36,171 1,969 Updated Aug 1, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 14,382 2,165 Updated Aug 8, 2024

Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world application of a neural net trained with backpropagation.

Jupyter Notebook 705 78 Updated Feb 3, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,392 1,264 Updated Aug 4, 2025

Consistency Distilled Diff VAE

Python 2,206 77 Updated Nov 7, 2023

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Python 1,230 198 Updated Jul 18, 2024

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 23,033 2,793 Updated Jun 12, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 87,096 60,542 Updated Dec 2, 2025

Inference code for Llama models

Python 59,074 9,810 Updated Jan 26, 2025

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, suc…

Jupyter Notebook 3,525 2,114 Updated Jun 14, 2021

Deep Recommenders

Python 331 108 Updated Jul 6, 2023

Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning

Python 47 20 Updated Dec 13, 2019

Papers on Computational Advertising

Python 4,378 1,192 Updated Feb 9, 2021

Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

Python 27 9 Updated Aug 12, 2020

Uplift modeling and causal inference with machine learning algorithms

Python 5,698 843 Updated Nov 7, 2025

Learning Scheduling Algorithms for Data Processing Clusters

Python 320 94 Updated Jun 15, 2021

An implementation of GCN-NPEC for VRP

Python 37 4 Updated Jul 14, 2021

Illustrated Examples from Sutton and Barto

Jupyter Notebook 38 10 Updated May 11, 2023

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

Python 318 89 Updated Sep 12, 2023

Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"

Jupyter Notebook 17 2 Updated Nov 14, 2019

Grokking Deep Reinforcement Learning

Jupyter Notebook 991 269 Updated Feb 4, 2022

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,635 2,830 Updated Jan 8, 2026

Machine Learning for Combinatorial Optimization - NeurIPS'21 competition

Python 139 32 Updated Aug 29, 2022

Exact Combinatorial Optimization with Graph Convolutional Neural Networks (NeurIPS 2019)

Python 402 113 Updated Dec 21, 2021

Adds CityFlow to Gym

Python 32 17 Updated Nov 15, 2021

Effcient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning

Python 51 32 Updated May 17, 2020
Next