Skip to content
View sonalgoyal's full-sized avatar

Organizations

@zinggAI

Block or report sonalgoyal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

In-memory Java DataFrame library

Java 282 28 Updated Nov 15, 2025
Python 1 4 Updated Nov 13, 2025

Moving data tables from one account to another

Python 5 Updated Jan 21, 2025

Example project using Zingg on Databricks

Jupyter Notebook 3 Updated Jan 2, 2025

Run TUIs and terminals in your browser

Python 1,271 29 Updated Aug 30, 2024

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

C++ 3,545 464 Updated Oct 22, 2025

JVector: the most advanced embedded vector search engine

Java 1,652 140 Updated Nov 18, 2025

Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.

Python 344 27 Updated Jun 16, 2024

Semantic Matrix Operations

Roff 1 Updated Mar 5, 2025

An example of SparkConnect extension.

Java 15 2 Updated Mar 5, 2024

Zingg fuzzy matching for products using metadata and images

Python 9 Updated May 20, 2024

Snowflake Snowpark Java & Scala API

Scala 23 23 Updated Nov 18, 2025

An End-to-End Evaluation Framework for Entity Resolution Systems

Python 32 10 Updated Dec 3, 2023

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Java 1,115 145 Updated Nov 18, 2025

What's in your data? Extract schema, statistics and entities from datasets

Python 1,528 177 Updated Sep 26, 2025

Schema modelling framework for decentralised domain-driven ownership of data.

Java 259 17 Updated Dec 5, 2023

Translating text attributes (like name, address, phone number) into quantifiable numerical representations Training ML models to determine if these numerical labels form a match Scoring the confide…

Python 30 9 Updated Mar 4, 2024

lakeFS - Data version control for your data lake | Git for data

Go 4,977 407 Updated Nov 18, 2025

A collection of research papers and software related to explainability in graph machine learning.

1,981 135 Updated Apr 4, 2022

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 4,749 462 Updated Nov 17, 2025

A completely-from-scratch hobby operating system: bootloader, kernel, drivers, C library, and userspace including a composited graphical UI, dynamic linker, syntax-highlighting text editor, network…

C 6,542 524 Updated Nov 18, 2025

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 7,999 1,529 Updated Nov 18, 2025

A system for quickly generating training data with weak supervision

Python 5,923 857 Updated May 2, 2024

Examples showing real-life use cases for fal + dbt

Jupyter Notebook 22 4 Updated Apr 27, 2022

do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.

Python 857 76 Updated Apr 5, 2024

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such …

JavaScript 804 55 Updated Aug 10, 2022

DuckDB is an analytical in-process SQL database management system

C++ 34,207 2,724 Updated Nov 18, 2025

SPEAR: Programmatically label and build training data quickly.

Jupyter Notebook 109 22 Updated Jun 27, 2024

🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊

C++ 845 80 Updated Nov 18, 2025
Next