Skip to content
View saidbouras's full-sized avatar

Block or report saidbouras

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fluss with Iceberg integration

Dockerfile 5 Updated Oct 14, 2025

Manage your database schema as code

Go 7,444 311 Updated Oct 2, 2025

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 100,725 53,328 Updated Oct 20, 2025

Learn Python using your Java Knowledge

Python 60 79 Updated Dec 9, 2019

Converting a json schema to a spark schema (struct) representation

Scala 12 5 Updated Mar 18, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,033 1,184 Updated Oct 22, 2025

A Model Context Protocol (MCP) server for discovering data products and requesting access in Data Mesh Manager, and executing queries on the data platform to access business data.

Python 40 3 Updated Oct 22, 2025

Interactive CLI for analyzing Kafka health and configuration according to best practices and industry standards.

JavaScript 80 7 Updated Oct 20, 2025

Testing framework for Databricks notebooks

Python 309 44 Updated Apr 20, 2024

Python Testing for Databricks

Python 100 10 Updated Oct 17, 2025

An example showing how to apply software engineering best practices to Databricks notebooks.

Python 143 73 Updated Jul 24, 2024

The Metadata Platform for your Data and AI Stack

Java 11,154 3,241 Updated Oct 22, 2025

POC of a Spring Boot - DataHub integration reporting its data lineage.

Java 3 1 Updated Jun 13, 2025

Serialization format for row-based incremental data processing

Rust 131 12 Updated Jul 15, 2025

Fastest SQL pipeline engine in a single C++ binary, for stream processing, analytics, observability and AI.

C++ 2,010 94 Updated Oct 22, 2025

🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).

Rust 16,940 678 Updated Oct 22, 2025

⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Python 2,201 246 Updated Oct 22, 2025

Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pain points while using Apache Kafka for pub-sub message queue…

Java 92 15 Updated Oct 7, 2025

Used to generate mock Avro data

Java 7 1 Updated May 17, 2024

Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme

Python 11,383 1,210 Updated Oct 11, 2025

Docker container with a data volume from s3.

Shell 280 68 Updated Mar 28, 2024
Python 75 39 Updated Jul 9, 2025

A Kubernetes controller to watch changes in ConfigMap and Secrets and do rolling upgrades on Pods with their associated Deployment, StatefulSet, DaemonSet and DeploymentConfig – [✩Star] if you're u…

Go 9,244 601 Updated Sep 15, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,529 573 Updated Aug 27, 2025

Enforce Data Contracts

Python 701 171 Updated Oct 20, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 39,955 2,564 Updated Oct 21, 2025

A library that provides an in-memory Kafka instance to run your tests against.

Scala 408 46 Updated Oct 21, 2025

Secure and fast microVMs for serverless computing.

Rust 30,819 2,119 Updated Oct 22, 2025

Open-source search and retrieval database for AI applications.

Rust 24,016 1,878 Updated Oct 22, 2025

A curated list of awesome ASGI servers, frameworks, apps, libraries, and other resources

Python 1,774 107 Updated Oct 12, 2025
Next