-
moa Public
Forked from Waikato/moaMOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detecti…
Java GNU General Public License v3.0 UpdatedSep 13, 2022 -
cx-flow Public
Forked from checkmarx-ltd/cx-flowCheckmarx Scan and Result Orchestration
-
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedMay 4, 2021 -
Align two embeddings (EN - FR) using MUSE (Unsupervised)
-
seq-datasource-v2 Public
Sequence Data Source for Apache Spark
-
scikit-multiflow Public
Forked from scikit-multiflow/scikit-multiflowA multi-output/multi-label and stream data framework. Inspired by MOA and MEKA, following scikit-learn's philosophy.
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 23, 2020 -
parquet-mr Public
Forked from apache/parquet-javaApache Parquet
Java Apache License 2.0 UpdatedApr 21, 2020 -
incubator-iceberg Public
Forked from apache/icebergApache Iceberg (Incubating)
Java Apache License 2.0 UpdatedApr 15, 2020 -
cloudera-spark Public
Forked from baeeq/incubator-sparkMirror of Apache Spark
Scala Apache License 2.0 UpdatedOct 28, 2019 -
koalas Public
Forked from databricks/koalasKoalas: Pandas API on Apache Spark
-
-
LH-BloomFilter Public
Less Hash Bloom Filter
-
pandas Public
Forked from pandas-dev/pandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 26, 2019 -
weka-trunk Public
Forked from Waikato/weka-trunkRead-only mirror of the offical Weka subversion repository (trunk, aka developer version).
Java UpdatedMay 9, 2019 -
autokeras Public
Forked from keras-team/autokerasaccessible AutoML for deep learning.
-
kafka-broker-k8s Public
Deploy a Kafka broker in 2 minutes - Kubernetes