Stars
Learning Multi-Armed Bandits by Examples. Currently covering MAB, UCB, Boltzmann Exploration, Thompson Sampling, Contextual MAB, LinUCB, Deep MAB.
The Game of Life, also known as Conway's Game of Life, is a cellular automaton devised by the British mathematician John Horton Conway in 1970.
Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda
BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).
Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning
React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely cust…
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
Latest Advances on System-2 Reasoning
xorbitsai / xllamacpp
Forked from shakfu/cyllamaxllamacpp - a Python wrapper of llama.cpp
Drag-and-drop, grouping, sorting bookmarklet plugin
《动手学大模型Dive into LLMs》系列编程实践教程
A simple screen parsing tool towards pure vision based GUI agent
A curated list of awesome model based RL resources (continually updated)
Safe, minimal import sorting for Python projects.
A PyTorch library for all things Reinforcement Learning (RL) for Combinatorial Optimization (CO)
An elegant PyTorch deep reinforcement learning library.
The interactive graphing library for Python ✨
pytablewriter is a Python library to write a table in various formats: AsciiDoc / CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pan…
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
A node graph UI framework written in python using Qt.
how to optimize some algorithm in cuda.