Skip to content
View chasingegg's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shanghai, China
  • 18:33 (UTC +08:00)

Highlights

  • Pro

Block or report chasingegg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,431 233 Updated Nov 2, 2025

An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.

Rust 2,287 96 Updated Nov 27, 2025

Segmented Code Adjustment Quantization (SAQ)

C++ 13 2 Updated Sep 22, 2025

Graph Library for Approximate Similarity Search

C++ 134 24 Updated Sep 9, 2025

A lightweight library for the RaBitQ algorithm and its applications in vector search.

C++ 104 27 Updated Oct 13, 2025

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

Rust 4,860 353 Updated Nov 27, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,847 431 Updated Mar 5, 2025

[ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval

Jupyter Notebook 237 9 Updated Nov 6, 2025

Waymo Open Dataset

Python 3,132 679 Updated Jun 10, 2025

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,487 688 Updated Nov 27, 2025

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

C++ 3,309 239 Updated Nov 13, 2025

A complement to pgvector for high performance, cost efficient vector search on large workloads.

Rust 2,440 113 Updated Nov 4, 2025

Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse Representations." Long Paper @ ACM SIG…

Rust 99 9 Updated Oct 23, 2025

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, …

C 1,587 94 Updated Nov 13, 2025

CMU-DB's Cascades optimizer framework

Rust 404 29 Updated Jan 6, 2025

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

C++ 4,205 397 Updated Nov 28, 2025

Apache OpenDAL: One Layer, All Storage.

Rust 4,618 664 Updated Nov 27, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,198 3,916 Updated Nov 28, 2025

CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.

Go 31,521 4,020 Updated Nov 28, 2025

Generate x86 Assembly with Go

Go 2,902 92 Updated Nov 1, 2025

Benchmark for vector databases.

Python 944 284 Updated Nov 28, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,010 225 Updated Oct 16, 2025

Source code for the X Recommendation Algorithm

Scala 67,850 12,626 Updated Sep 8, 2025

An open-source C++ library developed and used at Facebook.

C++ 30,091 5,809 Updated Nov 27, 2025

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 5,766 490 Updated Nov 28, 2025

MOSS is a conversational language model like ChatGPT.

745 25 Updated Apr 20, 2023

Apache Doris is an easy-to-use, high performance and unified analytics database.

Java 14,652 3,618 Updated Nov 28, 2025

Web-scale retrieval for knowledge-intensive NLP

Python 556 27 Updated Dec 6, 2022

PISA: Performant Indexes and Search for Academia

C++ 1,037 73 Updated Oct 25, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,051 6,955 Updated Nov 28, 2025
Next