Skip to content
View xysunn's full-sized avatar

Block or report xysunn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techn…

Python 517 39 Updated Oct 23, 2025

Anserini is a Lucene toolkit for reproducible information retrieval research

Java 1,083 530 Updated Oct 18, 2025

Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022).

Python 13 1 Updated Sep 8, 2022

Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"

Python 32 4 Updated Apr 2, 2025

Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models

Python 45 5 Updated Mar 13, 2022

Official repository of "Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor" (ACL 2024 main)

Python 9 Updated Aug 27, 2024

See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse

Jupyter Notebook 150 49 Updated Aug 3, 2020

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Jupyter Notebook 831 334 Updated Jun 12, 2023

Datasets for Hate Speech Detection

132 13 Updated May 12, 2023

Data for evaluating gender bias in coreference resolution systems.

Python 80 14 Updated May 14, 2019

Repository for the Bias Benchmark for QA dataset.

Python 129 33 Updated Jan 8, 2024

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 65,293 6,787 Updated Oct 16, 2025

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

JavaScript 135,688 18,057 Updated Oct 14, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,975 2,632 Updated Oct 24, 2025

ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.

Python 149 41 Updated Aug 18, 2025

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535

Python 146 10 Updated Feb 21, 2022

ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Models

Python 272 32 Updated Nov 8, 2022

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,300 12,406 Updated Sep 17, 2025

Library for Knowledge Intensive Language Tasks

Python 956 91 Updated Mar 31, 2022

🦜🔗 Build context-aware reasoning applications

Python 117,969 19,414 Updated Oct 24, 2025

EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation

Jupyter Notebook 41 4 Updated Oct 19, 2022

GAP is a gender-balanced dataset containing 8,908 coreference-labeled pairs of (ambiguous pronoun, antecedent name), sampled from Wikipedia for the evaluation of coreference resolution in practica…

Python 227 82 Updated Nov 10, 2020

Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper

81 14 Updated Mar 2, 2021

Repository for research in the field of Responsible NLP at Meta.

Python 202 34 Updated May 15, 2025

Paper collections of retrieval-based (augmented) language model.

232 12 Updated May 24, 2024

Supercharge Your LLM Application Evaluations 🚀

Python 11,184 1,133 Updated Oct 24, 2025

A new markup-based typesetting system that is powerful and easy to learn.

Rust 47,198 1,282 Updated Oct 24, 2025

A simple and elegant Jekyll theme for an academic personal homepage

CSS 886 784 Updated Apr 9, 2025
Next