jqwang2373

🎯

Focusing

jqwang2373

🎯

Focusing

9 followers · 47 following

University of Wisconsin-Madison

Highlights

RLEQ Public
Forked from Tencent/digitalhuman

Python Other Updated Oct 21, 2025
muti_turn_RL Public

Python Updated Oct 20, 2025
EQPerfBench Public

Updated Oct 14, 2025
verl Public
Forked from volcengine/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python Apache License 2.0 Updated Oct 7, 2025
sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python Apache License 2.0 Updated Oct 7, 2025
verifiers Public
Forked from PrimeIntellect-ai/verifiers

Environments for LLM Reinforcement Learning

Python MIT License Updated Sep 23, 2025
LLaMA-Factory Public
Forked from hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python Apache License 2.0 Updated Sep 22, 2025
RL-Factory Public
Forked from Simple-Efficient/RL-Factory

Train your Agent model via our easy and efficient framework

Python Apache License 2.0 Updated Sep 20, 2025
longform-writing-bench Public
Forked from EQ-bench/longform-writing-bench

HTML Updated Aug 12, 2025
Chrono_data Public

Updated Jul 24, 2025
chrono_agent Public

Jupyter Notebook Updated Jul 12, 2025
langgraph Public
Forked from langchain-ai/langgraph

Build resilient language agents as graphs.

Python MIT License Updated Jul 12, 2025
PrefEval Public
Forked from amazon-science/PrefEval

Python Other Updated May 30, 2025
EmoBench Public
Forked from Sahandfer/EmoBench

[ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models

Python MIT License Updated May 16, 2025
persona-hub Public
Forked from tencent-ailab/persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python Updated Feb 19, 2025
gradio-chatgpt-app Public
Forked from aimerou/gradio-chatgpt-app

A demonstration of a chatbot interface that uses the OpenAI ChatGPT API

Python Updated Sep 19, 2024
mint-bench Public
Forked from xingyaoww/mint-bench

Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and …

Python Apache License 2.0 Updated Jun 4, 2024
pychrono-feedstock Public
Forked from conda-forge/pychrono-feedstock

A conda-smithy repository for pychrono.

Shell BSD 3-Clause "New" or "Revised" License Updated May 12, 2024
Indentify_constraints_from_MBD Public

BSD 3-Clause "New" or "Revised" License Updated Apr 8, 2024
AutoTAMP Public
Forked from yongchao98/AutoTAMP

Jupyter Notebook Updated Apr 2, 2024
llama-2-7B-4bit-python-coder Public
Forked from edumunozsala/llama-2-7B-4bit-python-coder

Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..

Jupyter Notebook GNU General Public License v3.0 Updated Feb 13, 2024
low-fidelity-dynamic-models Public
Forked from uwsbel/low-fidelity-dynamic-models

A library of fast and accurate low fidelity dynamic models for applications in robotics

C++ MIT License Updated Feb 6, 2024
PNODE-for-MBD Public

Python BSD 3-Clause "New" or "Revised" License Updated Jan 5, 2024
tabnet Public
Forked from dreamquark-ai/tabnet

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Python MIT License Updated Nov 13, 2023
LLMs_interview_notes Public
Forked from YangQianli92/LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

1 MIT License Updated Oct 16, 2023
ODE Public
Forked from Francesco-Zeno-Costanzo/ODE

different methods for solving several ode

Python GNU General Public License v3.0 Updated Aug 25, 2023
heatherjiazg.github.io Public
Forked from yingxin-jia/heatherjiazg.github.io

HTML Other Updated Aug 21, 2023
RLHF-Label-Tool Public
Forked from SupritYoung/RLHF-Label-Tool

用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.

Python Updated Aug 1, 2023
academicpages.github.io Public
Forked from xinyan-wang-stat/academicpages.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript MIT License Updated Jun 23, 2023
test Public
Forked from hendrycks/test

Measuring Massive Multitask Language Understanding | ICLR 2021

Python MIT License Updated May 28, 2023

jqwang2373

Highlights

RLEQ Public

Uh oh!

muti_turn_RL Public

Uh oh!

EQPerfBench Public

Uh oh!

verl Public

Uh oh!

sglang Public

Uh oh!

verifiers Public

Uh oh!

LLaMA-Factory Public

Uh oh!

RL-Factory Public

Uh oh!

longform-writing-bench Public

Uh oh!

Chrono_data Public

Uh oh!

chrono_agent Public

Uh oh!

langgraph Public

Uh oh!

PrefEval Public

Uh oh!

EmoBench Public

Uh oh!

persona-hub Public

Uh oh!

gradio-chatgpt-app Public

Uh oh!

mint-bench Public

Uh oh!

pychrono-feedstock Public

Uh oh!

Indentify_constraints_from_MBD Public

Uh oh!

AutoTAMP Public

Uh oh!

llama-2-7B-4bit-python-coder Public

Uh oh!

low-fidelity-dynamic-models Public

Uh oh!

PNODE-for-MBD Public

Uh oh!

tabnet Public

Uh oh!

LLMs_interview_notes Public

Uh oh!

ODE Public

Uh oh!

heatherjiazg.github.io Public

Uh oh!

RLHF-Label-Tool Public

Uh oh!

academicpages.github.io Public

Uh oh!

test Public

Uh oh!