-
https://github.com/zinggAI/zingg
- India
- @sonalgoyal
Stars
Moving data tables from one account to another
Example project using Zingg on Databricks
Run TUIs and terminals in your browser
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
JVector: the most advanced embedded vector search engine
Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.
An example of SparkConnect extension.
Zingg fuzzy matching for products using metadata and images
Snowflake Snowpark Java & Scala API
An End-to-End Evaluation Framework for Entity Resolution Systems
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
What's in your data? Extract schema, statistics and entities from datasets
Schema modelling framework for decentralised domain-driven ownership of data.
Translating text attributes (like name, address, phone number) into quantifiable numerical representations Training ML models to determine if these numerical labels form a match Scoring the confide…
lakeFS - Data version control for your data lake | Git for data
A collection of research papers and software related to explainability in graph machine learning.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
A completely-from-scratch hobby operating system: bootloader, kernel, drivers, C library, and userspace including a composited graphical UI, dynamic linker, syntax-highlighting text editor, network…
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
A system for quickly generating training data with weak supervision
Examples showing real-life use cases for fal + dbt
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such …
DuckDB is an analytical in-process SQL database management system
SPEAR: Programmatically label and build training data quickly.
🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊