xysunn

Xueyao Sun xysunn

4 followers · 4 following

Lists (6)

Sort

Stars

DataScienceUIBK / Rankify

🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techn…

Python 517 39 Updated Oct 23, 2025

castorini / anserini

Anserini is a Lucene toolkit for reproducible information retrieval research

Java 1,083 530 Updated Oct 18, 2025

sabithsn / APPDIA-Discourse-Style-Transfer

Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022).

Python 13 1 Updated Sep 8, 2022

s-nlp / paradetox

Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"

Python 32 4 Updated Apr 2, 2025

mireshghallah / mixmatch

Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models

Python 45 5 Updated Mar 13, 2022

ysw1021 / ScoPE

Official repository of "Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor" (ACL 2024 main)

Python 9 Updated Aug 27, 2024

ewulczyn / wiki-detox

See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse

Jupyter Notebook 150 49 Updated Aug 3, 2020

t-davidson / hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Jupyter Notebook 831 334 Updated Jun 12, 2023

aymeam / Datasets-for-Hate-Speech-Detection

Datasets for Hate Speech Detection

132 13 Updated May 12, 2023

rudinger / winogender-schemas

Data for evaluating gender bias in coreference resolution systems.

Python 80 14 Updated May 14, 2019

nyu-mll / BBQ

Repository for the Bias Benchmark for QA dataset.

Python 129 33 Updated Jan 8, 2024

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 65,293 6,787 Updated Oct 16, 2025

f / awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

JavaScript 135,688 18,057 Updated Oct 14, 2025

gabriben / awesome-generative-information-retrieval

706 51 Updated Oct 7, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,975 2,632 Updated Oct 24, 2025

McGill-NLP / bias-bench

ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.

Python 149 41 Updated Aug 18, 2025

princeton-nlp / EntityQuestions

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535

Python 146 10 Updated Feb 21, 2022

txsun1997 / Black-Box-Tuning

ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Models

Python 272 32 Updated Nov 8, 2022

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,300 12,406 Updated Sep 17, 2025

AlexTMallen / adaptive-retrieval

Python 189 12 Updated Jul 2, 2025

facebookresearch / KILT

Library for Knowledge Intensive Language Tasks

Python 956 91 Updated Mar 31, 2022

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Python 117,969 19,414 Updated Oct 24, 2025

txsun1997 / Metric-Fairness

EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation

Jupyter Notebook 41 4 Updated Oct 19, 2022

google-research-datasets / gap-coreference

GAP is a gender-balanced dataset containing 8,908 coreference-labeled pairs of (ambiguous pronoun, antecedent name), sampled from Wikipedia for the evaluation of coreference resolution in practica…

Python 227 82 Updated Nov 10, 2020