Stars
A Python framework for accelerated simulation, data generation and spatial computing.
Unify Efficient Fine-tuning of RAG Retrieval, including Embedding, ColBERT, ReRanker.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
[ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
RAGChecker: A Fine-grained Framework For Diagnosing RAG
High-performance retrieval engine for unstructured data
Code and data for "The Power of Noise: Redefining Retrieval for RAG Systems"
Leveraging passage embeddings for efficient listwise reranking with large language models.
This repository presents the original implementation of LumberChunker: Long-Form Narrative Document Segmentation by André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li and Arlindo L.…
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Official repository of the MIRAGE benchmark
MTEB: Massive Text Embedding Benchmark
Retrieval and Retrieval-augmented LLMs
AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that's as simple as writing Pandas code
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
DSPy: The framework for programming—not prompting—language models
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Supercharge Your LLM Application Evaluations 🚀