Stars
A simple Java library for interacting with Ollama server.
The most widely used, high performance Minecraft server that aims to fix gameplay and mechanics inconsistencies
Flink Agents is an Agentic AI framework based on Apache Flink
🔍 Unified Search MCP Server - Search across Google Scholar, Web, and YouTube with a single query
A client for connecting and running DDLs on hive metastore.
A tool for visually designing and inspecting Elasticsearch index structures.
🔥 Seata is an easy-to-use, high-performance, open source distributed transaction solution.
The codebase for the book "AI-Powered Search" (Manning Publications, 2025)
Monolingual wordlists with pronunciation information in IPA
Mail Connector for Apache Beam / Google Cloud Dataflow
Orchestrate everything - from scripts to data, infra, AI, and business - as code, with UI and AI Copilot. Simple. Fast. Scalable.
🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop
Elasticsearch in Action Book
Example on how to deploy Apache beam, Spark Cluster on Kubernetes and run Python code
An artifact of fully-specified annotations to power static-analysis checks, beginning with nullness analysis.
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
Apache Fluss is a streaming storage built for real-time analytics.
Integrates LLMs as PTransform in Apache Beam pipelines using LangChain
An Apache Beam source to connect and consume data from TREP using the Websocket API.
The Dataflow Solution Guides offer full end-to-end deployment for the most common streaming solutions to run on Dataflow.
Examples for High Performance Spark
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Python library for converting Python calculations into rendered latex.