Skip to content
View akshayrai's full-sized avatar

Block or report akshayrai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An extensible distributed system for reliable nearline data streaming at scale

Java 950 140 Updated Nov 11, 2025

A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.

Java 302 74 Updated Oct 30, 2025

📚 List of awesome university courses for learning Computer Science!

64,610 8,280 Updated May 4, 2023

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,195 3,389 Updated Nov 22, 2025

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

66,843 6,696 Updated Oct 4, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 327,299 53,346 Updated Nov 3, 2025

Getting Started with Spring Boot 3:

Java 37,300 54,006 Updated Nov 22, 2025

Hadoop filesystem implementation for Aliyun OSS

Java 13 Updated Feb 14, 2016

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and bat…

Java 2,251 750 Updated Nov 20, 2025

Simple JVM Profiler Using StatsD and Other Metrics Backends

Java 333 85 Updated May 9, 2023

Sends stacktrace-level performance data from a JVM process to Riemann.

Clojure 294 19 Updated Sep 3, 2024

Chef cookbook to install Dr Elephant for Hadoop.

HTML 3 4 Updated Jul 13, 2021

Docker files for Linkedin's Dr. Elephant https://github.com/linkedin/dr-elephant

Dockerfile 9 10 Updated Dec 24, 2018

Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark

Java 1,368 850 Updated Aug 22, 2023

Mirror of Apache Oozie

Java 726 475 Updated Jan 27, 2025