-
tiflow Public
Forked from pingcap/tiflowThis repo maintains DM (a data migration platform) and TiCDC (change data capture for TiDB)
Go Apache License 2.0 UpdatedOct 2, 2025 -
-
debezium Public
Forked from debezium/debeziumChange data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Java Apache License 2.0 UpdatedSep 6, 2025 -
kafka-connect-mq-source Public
Forked from ibm-messaging/kafka-connect-mq-sourceThis repository contains a Kafka Connect source connector for copying data from IBM MQ into Apache Kafka.
Java Apache License 2.0 UpdatedAug 13, 2025 -
kafka-connect-jdbc Public
Forked from confluentinc/kafka-connect-jdbcKafka Connect connector for JDBC-compatible databases
Java Other UpdatedAug 12, 2025 -
lance-spark Public
Forked from lancedb/lance-sparkSpark integrations for working with Lance datasets
-
lance Public
Forked from lancedb/lanceModern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
Rust Apache License 2.0 UpdatedJul 31, 2025 -
flink-kubernetes-operator Public
Forked from apache/flink-kubernetes-operatorApache Flink Kubernetes Operator
Java Apache License 2.0 UpdatedJul 28, 2025 -
fluss Public
Forked from apache/flussFluss is a streaming storage built for real-time analytics.
-
blaze Public
Forked from apache/auronBlazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Rust Apache License 2.0 UpdatedJun 23, 2025 -
kafka Public
Forked from apache/kafkaMirror of Apache Kafka
Java Apache License 2.0 UpdatedJun 22, 2025 -
-
jmx_exporter Public
Forked from prometheus/jmx_exporterA process for exposing JMX Beans via HTTP for Prometheus consumption
Java Apache License 2.0 UpdatedJun 3, 2025 -
mysql-binlog-connector-java Public
Forked from osheroff/mysql-binlog-connector-javaMySQL Binary Log connector
-
paimon Public
Forked from apache/paimonApache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
-
doris-flink-connector Public
Forked from apache/doris-flink-connectorFlink Connector for Apache Doris
Java Apache License 2.0 UpdatedMay 12, 2025 -
datafusion Public
Forked from apache/datafusionApache Arrow DataFusion SQL Query Engine
Rust Apache License 2.0 UpdatedMay 7, 2025 -
kafka-connect-xml-converter Public
Forked from ibm-messaging/kafka-connect-xml-converterA Kafka Connect plugin to make it easier to work with XML data in Kafka Connect pipelines
Java Apache License 2.0 UpdatedApr 28, 2025 -
seatunnel Public
Forked from apache/seatunnelSeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Java Apache License 2.0 UpdatedApr 25, 2025 -
tispark Public
Forked from pingcap/tisparkTiSpark is built for running Apache Spark on top of TiDB/TiKV
Scala Apache License 2.0 UpdatedApr 23, 2025 -
datafusion-comet Public
Forked from apache/datafusion-cometApache DataFusion Comet Spark Accelerator
Rust Apache License 2.0 UpdatedApr 21, 2025 -
polars Public
Forked from pola-rs/polarsDataframes powered by a multithreaded, vectorized query engine, written in Rust
Rust Other UpdatedApr 19, 2025 -
XChange Public
Forked from knowm/XChangeXChange is a Java library providing a streamlined API for interacting with 60+ Bitcoin and Altcoin exchanges providing a consistent interface for trading and accessing market data.
Java MIT License UpdatedApr 13, 2025 -
opendal Public
Forked from apache/opendalApache OpenDAL: One Layer, All Storage.
Rust Apache License 2.0 UpdatedMar 26, 2025 -
parquet-java Public
Forked from apache/parquet-javaApache Parquet Java
Java Apache License 2.0 UpdatedMar 15, 2025 -
spark Public
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing
Scala Apache License 2.0 UpdatedMar 12, 2025 -
celeborn Public
Forked from apache/celebornApache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Java Apache License 2.0 UpdatedJan 3, 2025 -
datafusion-orc Public
Forked from datafusion-contrib/datafusion-orcImplementation of Apache ORC file format use Apache Arrow in-memory format
-
arrow-rs Public
Forked from apache/arrow-rsOfficial Rust implementation of Apache Arrow
Rust Apache License 2.0 UpdatedDec 24, 2024 -
orc-rust Public
Forked from datafusion-contrib/orc-rustRust implementation of Apache ORC
Rust Apache License 2.0 UpdatedOct 29, 2024