Highlights
- Pro
Starred repositories
Named Entity Recognition, Entity Linking, and more
A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings
Official code for our paper "An Autoregressive Text-to-Graph Framework for Joint Entity and Relation Extraction" which will be published at AAAI 2024.
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
[EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
A comprehensive, unified and modular event extraction toolkit.
Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
A comprehensive benchmark for entity disambiguation
[ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training
ReFinED is an efficient and accurate entity linking (EL) system.
Repository for Temporal Entity Linking (TempEL), accepted to NeurIPS 2022 Dataset and Benchmarks
Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer-based Named Entity Recognition, EACL 2021"
Label shift estimation for transfer difficulty with Familiarity.
A soft and fast pattern matcher for billion-scale corpora.
Thesaurus Based Weakly (Distant) Supervised Named Entity Recognizer
Guideline following Large Language Model for Information Extraction
Document-Level Multi-Event Extraction with Event Proxy Nodes and Hausdorff Distance Minimization
Source code for ACL-IJCNLP 2021 Long paper: Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker.