-
University of Wisconsin-Madison
Highlights
- Pro
-
-
-
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedOct 7, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedOct 7, 2025 -
verifiers Public
Forked from PrimeIntellect-ai/verifiersEnvironments for LLM Reinforcement Learning
Python MIT License UpdatedSep 23, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python Apache License 2.0 UpdatedSep 22, 2025 -
RL-Factory Public
Forked from Simple-Efficient/RL-FactoryTrain your Agent model via our easy and efficient framework
Python Apache License 2.0 UpdatedSep 20, 2025 -
-
-
-
langgraph Public
Forked from langchain-ai/langgraphBuild resilient language agents as graphs.
Python MIT License UpdatedJul 12, 2025 -
-
EmoBench Public
Forked from Sahandfer/EmoBench[ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models
Python MIT License UpdatedMay 16, 2025 -
persona-hub Public
Forked from tencent-ailab/persona-hubOfficial repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Python UpdatedFeb 19, 2025 -
gradio-chatgpt-app Public
Forked from aimerou/gradio-chatgpt-appA demonstration of a chatbot interface that uses the OpenAI ChatGPT API
Python UpdatedSep 19, 2024 -
mint-bench Public
Forked from xingyaoww/mint-benchOfficial Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and …
Python Apache License 2.0 UpdatedJun 4, 2024 -
pychrono-feedstock Public
Forked from conda-forge/pychrono-feedstockA conda-smithy repository for pychrono.
Shell BSD 3-Clause "New" or "Revised" License UpdatedMay 12, 2024 -
-
-
llama-2-7B-4bit-python-coder Public
Forked from edumunozsala/llama-2-7B-4bit-python-coderFine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..
Jupyter Notebook GNU General Public License v3.0 UpdatedFeb 13, 2024 -
low-fidelity-dynamic-models Public
Forked from uwsbel/low-fidelity-dynamic-modelsA library of fast and accurate low fidelity dynamic models for applications in robotics
C++ MIT License UpdatedFeb 6, 2024 -
-
tabnet Public
Forked from dreamquark-ai/tabnetPyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
Python MIT License UpdatedNov 13, 2023 -
LLMs_interview_notes Public
Forked from YangQianli92/LLMs_interview_notesLLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
-
ODE Public
Forked from Francesco-Zeno-Costanzo/ODEdifferent methods for solving several ode
Python GNU General Public License v3.0 UpdatedAug 25, 2023 -
heatherjiazg.github.io Public
Forked from yingxin-jia/heatherjiazg.github.ioHTML Other UpdatedAug 21, 2023 -
RLHF-Label-Tool Public
Forked from SupritYoung/RLHF-Label-Tool用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.
Python UpdatedAug 1, 2023 -
academicpages.github.io Public
Forked from xinyan-wang-stat/academicpages.github.ioGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript MIT License UpdatedJun 23, 2023 -
test Public
Forked from hendrycks/testMeasuring Massive Multitask Language Understanding | ICLR 2021
Python MIT License UpdatedMay 28, 2023