- Göttingen
-
02:38
(UTC +01:00) - https://jpwahle.com
- @jpwahle
- in/jan-philip-wahle
Stars
A modernized, complete, self-contained TeX/LaTeX engine, powered by XeTeX and TeXLive.
verl: Volcano Engine Reinforcement Learning for LLMs
[ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use
Web interface for browsing, search and filtering recent arxiv submissions
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
An open-source AI agent that brings the power of Gemini directly into your terminal.
news-please - an integrated web crawler and information extractor for news that just works
Repository for the LREC 2022 submission on Emotion Word Dynamics in Geolocated Tweet data.
Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"
The official evaluation implementation of the paper "BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages"
an application to analyze and visualize research data
The ultimate training toolkit for finetuning diffusion models
A library for advanced large language model reasoning
Language-annotated Abstraction and Reasoning Corpus
SemEval2024-task 11: Bridging the Gap in Text-Based Emotion Detection
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
🐝 When Agent Meets RL and Prompt Optimization the First Time
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
A framework for few-shot evaluation of language models.
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.