-
parquet-mr Public
Forked from apache/parquet-javaApache Parquet
Java Apache License 2.0 UpdatedNov 20, 2025 -
beam Public
Forked from apache/beamApache Beam is a unified programming model for Batch and Streaming data processing.
Java Apache License 2.0 UpdatedNov 18, 2025 -
-
hadoop-connectors Public
Forked from GoogleCloudDataproc/hadoop-connectorsLibraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Java Apache License 2.0 UpdatedSep 9, 2025 -
scala-steward Public
Forked from scala-steward-org/scala-steward🤖 A bot that helps you keep your projects up-to-date
Scala Apache License 2.0 UpdatedSep 4, 2025 -
sbt-avro Public
Forked from sbt/sbt-avrosbt plugin for compiling Avro schemas, similar to sbt-protobuf
Scala Other UpdatedMay 7, 2025 -
-
-
-
-
sbt-missinglink Public
Forked from scalacenter/sbt-missinglinkAn sbt plugin for missinglink
Scala Apache License 2.0 UpdatedDec 5, 2022 -
missinglink Public
Forked from spotify/missinglinkBuild time tool for detecting link problems in java projects
Java Apache License 2.0 UpdatedDec 1, 2022 -
socco-ng Public
Forked from regadas/socco-ngsocco-ng is a fork from criteo/socco: A Scala compiler plugin to generate documentation from Scala source files.
Scala Apache License 2.0 UpdatedJan 14, 2022 -
flytepropeller Public
Forked from flyteorg/flytepropellerFlytePropeller is a Kubernetes native operator, that executes Flyte Workflows and Tasks. It has its own kubectl-flyte CLI to interact and is extensible using the flyteplugins/pluginmachinery interface
Go Apache License 2.0 UpdatedSep 13, 2021 -
flyteplugins Public
Forked from flyteorg/flytepluginsFlyte Backend Plugins contributed by the Flyte community.
Go Apache License 2.0 UpdatedSep 10, 2021 -
styx Public
Forked from spotify/styx"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
Java Apache License 2.0 UpdatedMay 6, 2021 -
nevillelyh.github.io Public
Forked from nevillelyh/nevillelyh.github.ioRepository for www.lyh.me
HTML UpdatedMar 12, 2020 -
luigi Public
Forked from spotify/luigiLuigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Python Apache License 2.0 UpdatedJul 8, 2019 -
scio Public
Forked from spotify/scioA Scala API for Apache Beam and Google Cloud Dataflow.
Scala Apache License 2.0 UpdatedNov 26, 2018