Skip to content
View yichuan-w's full-sized avatar

Highlights

  • Pro

Block or report yichuan-w

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tools for various benchmarking scenarios of the Weaviate Query Agent

Jupyter Notebook 6 1 Updated Sep 24, 2025

cuML - RAPIDS Machine Learning Library

C++ 4,959 594 Updated Oct 15, 2025

cuVS - a library for vector search and clustering on the GPU

Cuda 541 133 Updated Oct 14, 2025

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 218 17 Updated Jun 13, 2025

Highly Performant, Modular, Memory Safe and Production-ready Inference, Ingestion and Indexing built in Rust πŸ¦€

Rust 740 67 Updated Oct 5, 2025

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 90,731 8,041 Updated Oct 13, 2025

Fast and memory-efficient exact kmeans

Python 105 6 Updated Sep 30, 2025

Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"

Python 933 102 Updated Mar 4, 2024

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 2,015 156 Updated Jan 15, 2025

High-Performance Engine for Multi-Vector Search

Rust 170 10 Updated Oct 7, 2025

Tinker, but open-source one

2 Updated Oct 8, 2025

What does gpt-oss tell us about OpenAI's training data?

Python 25 2 Updated Sep 19, 2025

How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models

Python 32 Updated Oct 6, 2025

Late Interaction Models Training & Retrieval

Python 619 47 Updated Oct 14, 2025

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

Python 244 32 Updated Aug 4, 2025
Python 52 1 Updated Feb 27, 2025

Communication-Efficient Diffusion Denoising Parallelization via Reuse-then-Predict Mechanism (NIPS'25)

Python 12 Updated Oct 6, 2025

Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. πŸ‘¨πŸ»β€πŸ³

336 27 Updated Jun 2, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 15,953 1,199 Updated Oct 11, 2025

SOTA search powered LLM

Python 3,686 341 Updated Apr 4, 2025
Python 71 11 Updated Aug 7, 2025

XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval

Jupyter Notebook 58 3 Updated Jun 20, 2024

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,705 255 Updated May 17, 2025

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 16,372 1,134 Updated Oct 4, 2025

On the Theoretical Limitations of Embedding-Based Retrieval

Jupyter Notebook 578 44 Updated Sep 15, 2025

πŸ“„πŸ§  PageIndex: Document Index for Reasoning-based RAG

Python 2,764 206 Updated Oct 14, 2025

XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.

Python 167 13 Updated May 3, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,655 455 Updated Oct 14, 2025

Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a single tool call.

Python 179 21 Updated Mar 29, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,248 205 Updated Oct 6, 2025
Next