Stars
Apache Fluss is a streaming storage built for real-time analytics.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Open-Source ClickHouse http proxy and load balancer
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Apache Spark - A unified analytics engine for large-scale data processing
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去…
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
https://github.com/CyC2018/Interview-Notebook