Highlights
-
iceberg Public
Forked from apache/icebergApache Iceberg
Java Apache License 2.0 UpdatedNov 8, 2025 -
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
-
-
spark-on-k8s-operator Public
Forked from kubeflow/spark-operatorKubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Go Apache License 2.0 UpdatedMay 19, 2025 -
trino Public
Forked from trinodb/trinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Java Apache License 2.0 UpdatedMar 23, 2025 -
debezium Public
Forked from debezium/debeziumChange data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Java Apache License 2.0 UpdatedNov 19, 2024 -
debezium-jdbc-sink Public
Forked from debezium/debezium-connector-jdbcAn exploration for building a JDBC sink connector aware of the Debezium change event format
Java Apache License 2.0 UpdatedJul 28, 2024 -
cloudevents-sdk-java Public
Forked from cloudevents/sdk-javaJava SDK for CloudEvents
Java Apache License 2.0 UpdatedJul 19, 2024 -
airflow Public
Forked from apache/airflowApache Airflow
-
superset Public
Forked from apache/supersetApache Superset (incubating) is a modern, enterprise-ready business intelligence web application
-
spark-excel Public
Forked from nightscape/spark-excelA Spark plugin for reading and writing Excel files
Scala Apache License 2.0 UpdatedApr 2, 2023 -
ansible-superset Public archive
Ansible playbook for Apache Superset
-
onnx-rest Public archive
A Simple and Fast Rest API for productionization the ONNX models
-
pypinot Public archive
A DB-API to interact with Apache Pinot
-
ibis Public
Forked from ibis-project/ibisExpressive analytics in Python at any scale.
Python Apache License 2.0 UpdatedNov 2, 2022 -
datahub Public
Forked from datahub-project/datahubThe Metadata Platform for the Modern Data Stack
Java Apache License 2.0 UpdatedOct 28, 2022 -
confluent-kafka-python Public
Forked from confluentinc/confluent-kafka-pythonConfluent's Kafka Python Client
Python Other UpdatedAug 3, 2022 -
-
kafka Public
Forked from apache/kafkaMirror of Apache Kafka
Java Apache License 2.0 UpdatedJul 20, 2022 -
kafka-connect-jdbc Public
Forked from confluentinc/kafka-connect-jdbcKafka Connect connector for JDBC-compatible databases
Java Other UpdatedJun 8, 2021 -
testcontainers-python Public
Forked from testcontainers/testcontainers-pythonPython Apache License 2.0 UpdatedJan 6, 2021 -
awesome-ci Public
Simple, lightweight Docker images to do all stuff easily on CI/CD
GNU General Public License v3.0 UpdatedOct 31, 2020 -
debezium-incubator Public
Forked from debezium/debezium-incubatorNew Debezium modules and connectors in incubation phase
Java Apache License 2.0 UpdatedOct 22, 2020 -
mleap Public
Forked from combust/mleapMLeap: Deploy Spark Pipelines to Production
Scala Apache License 2.0 UpdatedJul 3, 2020 -
onnxmltools Public
Forked from onnx/onnxmltoolsONNXMLTools enables conversion of models to ONNX
Python MIT License UpdatedJun 24, 2020 -
ansible Public
Forked from ansible/ansibleAnsible is a radically simple IT automation platform that makes your applications and systems easier to deploy. Avoid writing scripts or custom code to deploy and update your applications — automat…
Python GNU General Public License v3.0 UpdatedOct 14, 2019 -
-
python-instagram Public
Forked from facebookarchive/python-instagramPython Client for Instagram API
Python Other UpdatedAug 1, 2018 -
Instagram-API-python Public
Forked from tmhlsky/Instagram-API-pythonUnofficial instagram API, give you access to ALL instagram features (like, follow, upload photo and video and etc)! Write on python.
Python Other UpdatedJul 19, 2018 -
bbqsql Public
Forked from CiscoCXSecurity/bbqsqlSQL Injection Exploitation Tool
Python Other UpdatedJul 15, 2018