jinglong92

Follow

Espresso jinglong92

Follow

8 followers · 14 following

Meituan

Stars

agent-lab / diffusion-integer-programming

Python 7 Updated Sep 10, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,497 3,061 Updated Jan 19, 2026

karpathy / LLM101n

LLM101n: Let's build a Storyteller

36,171 1,969 Updated Aug 1, 2024

karpathy / micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 14,382 2,165 Updated Aug 8, 2024

karpathy / lecun1989-repro

Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world application of a neural net trained with backpropagation.

Jupyter Notebook 705 78 Updated Feb 3, 2024

lucidrains / denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,392 1,264 Updated Aug 4, 2025

openai / consistencydecoder

Consistency Distilled Diff VAE

Python 2,206 77 Updated Nov 7, 2023

jannerm / diffuser

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Python 1,230 198 Updated Jul 18, 2024

datawhalechina / llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

Jupyter Notebook 23,033 2,793 Updated Jun 12, 2025

ChatGPTNextWeb / NextChat

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 87,096 60,542 Updated Dec 2, 2025

meta-llama / llama

Inference code for Llama models

Python 59,074 9,810 Updated Jan 26, 2025

NELSONZHAO / zhihu

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, suc…

Jupyter Notebook 3,525 2,114 Updated Jun 14, 2021

LongmaoTeamTf / deep_recommenders

Deep Recommenders

Python 331 108 Updated Jul 6, 2023

venkatacrc / Budget_Constrained_Bidding

Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning

Python 47 20 Updated Dec 13, 2019

wzhe06 / Ad-papers

Papers on Computational Advertising

Python 4,378 1,192 Updated Feb 9, 2021

tjuHaoXiaotian / ICML-2020-MSBCB

Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

Python 27 9 Updated Aug 12, 2020

uber / causalml

Uplift modeling and causal inference with machine learning algorithms

Python 5,698 843 Updated Nov 7, 2025

hongzimao / decima-sim

Learning Scheduling Algorithms for Data Processing Clusters

Python 320 94 Updated Jun 15, 2021

fulcrum-zou / VRP-GCN-NPEC

An implementation of GCN-NPEC for VRP

Python 37 4 Updated Jul 14, 2021

www2022paper / WWW-2022-PAPER-SUPPLEMENTARY-MATERIALS

Forked from causalcausalcausal/WWW-2022-PAPER-SUPPLEMENTARY-MATERIALS

Jupyter Notebook 34 20 Updated Jan 26, 2022

laxatives / rl

Illustrated Examples from Sutton and Barto

Jupyter Notebook 38 10 Updated May 11, 2023

huawei-noah / xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

Python 318 89 Updated Sep 12, 2023

YRussac / WeightedLinearBandits

Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"

Jupyter Notebook 17 2 Updated Nov 14, 2019

mimoralea / gdrl

Grokking Deep Reinforcement Learning

Jupyter Notebook 991 269 Updated Feb 4, 2022

google-deepmind / deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,635 2,830 Updated Jan 8, 2026

ds4dm / ml4co-competition

Machine Learning for Combinatorial Optimization - NeurIPS'21 competition

Python 139 32 Updated Aug 29, 2022

ds4dm / learn2branch

Exact Combinatorial Optimization with Graph Convolutional Neural Networks (NeurIPS 2019)

Python 402 113 Updated Dec 21, 2021

huanghanchi / Quant-AI-OR-Math-Statistics

141 26 Updated Oct 4, 2021

wingsweihua / gym_cityflow

Forked from myunchul/gym_cityflow

Adds CityFlow to Gym

Python 32 17 Updated Nov 15, 2021

UMich-ML-Group / RL-Ridesharing

Effcient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning

Python 51 32 Updated May 17, 2020