-
spark-bigquery-connector Public
Forked from GoogleCloudDataproc/spark-bigquery-connectorBigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Java Apache License 2.0 UpdatedAug 21, 2025 -
OpenLineage Public
Forked from OpenLineage/OpenLineageAn Open Standard for lineage metadata collection
Java Apache License 2.0 UpdatedJul 14, 2025 -
mongo-spark Public
Forked from mongodb/mongo-sparkThe MongoDB Spark Connector
Java Apache License 2.0 UpdatedMay 7, 2025 -
spark-bigtable-connector Public
Forked from GoogleCloudDataproc/spark-bigtable-connectorScala Apache License 2.0 UpdatedMar 27, 2025 -
hbase-connectors Public
Forked from apache/hbase-connectorsApache HBase Connectors
Scala Apache License 2.0 UpdatedSep 4, 2024 -
spark-spanner-connector Public
Forked from GoogleCloudDataproc/spark-spanner-connectorCloud Spanner Connector for Apache Spark
Java Apache License 2.0 UpdatedJul 23, 2024 -
docs Public
Forked from OpenLineage/docsDocumentation and website for OpenLineage
HTML UpdatedJul 8, 2024 -
flink-kubernetes-operator Public
Forked from apache/flink-kubernetes-operatorApache Flink Kubernetes Operator
-
1brc Public
Forked from gunnarmorling/1brc1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
Java Apache License 2.0 UpdatedJan 6, 2024 -
beam Public
Forked from apache/beamApache Beam is a unified programming model for Batch and Streaming data processing.
Java Apache License 2.0 UpdatedSep 26, 2023 -
scio Public
Forked from spotify/scioA Scala API for Apache Beam and Google Cloud Dataflow.
Scala Apache License 2.0 UpdatedAug 31, 2023 -
jupyter-images Public
Forked from getindata/jupyter-imagesReceipes of publicly-available Jupyter images
Dockerfile MIT License UpdatedDec 28, 2021 -
-
-
-
-
-
aperte-workflow-core Public
Forked from maciejpawlak/aperte-workflow-coreAperte Workflow Core
Java Other UpdatedAug 18, 2015