Skip to content
View amanmj's full-sized avatar
👨‍🎓
Studying
👨‍🎓
Studying

Block or report amanmj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 240 84 Updated Dec 29, 2020

Custom Jupyter Notebook Themes

CSS 9,842 1,048 Updated Jun 22, 2025

A python binding for crfsuite

Python 772 222 Updated Sep 5, 2025

Simple web service providing a word embedding model

Python 1,443 355 Updated May 1, 2023

Topic Modelling for Humans

Python 16,226 4,412 Updated Oct 16, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,111 28,874 Updated Oct 18, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,256 30,836 Updated Oct 18, 2025

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`

HTML 9,599 1,492 Updated Apr 15, 2023

A fast, robust Python library to check for offensive language in strings.

Python 651 119 Updated Jul 27, 2024

A universal Python library for detecting and filtering profanity

Python 80 28 Updated Nov 25, 2024

Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of …

Java 2,939 631 Updated Oct 9, 2025

Spark: The Definitive Guide's Code Repository

Scala 3,042 2,870 Updated Aug 26, 2020

A board editor for Halma game. Support output monitoring/applying and game running.

C# 54 1 Updated Oct 25, 2019

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 323,162 52,707 Updated May 21, 2025

A list of upcoming hackathons from around the world.

HTML 493 287 Updated Dec 27, 2024

Code for: "And the bit goes down: Revisiting the quantization of neural networks"

Python 631 123 Updated Nov 9, 2020

Text and supporting code for Think Stats, 2nd Edition

Jupyter Notebook 4,157 11,400 Updated Jan 23, 2025

Sparkling Pandas

Python 364 79 Updated Jul 6, 2023

A Scala API for Cascading

Scala 3,523 706 Updated May 28, 2023

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 42,841 15,791 Updated Oct 18, 2025

Kafka Connect connector to stream data in real time from Twitter.

Java 127 82 Updated Dec 9, 2022

HOCON parser for Python

Python 522 119 Updated May 30, 2024

Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.

Go 16,370 5,729 Updated Oct 17, 2025

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 792 159 Updated Oct 3, 2025

An application observability facade for the most popular observability tools. Think SLF4J, but for observability.

Java 4,739 1,054 Updated Oct 17, 2025

Java client for InfluxDB

Java 1,199 476 Updated Aug 4, 2025

A Java library that implements application/problem+json

Java 927 92 Updated Oct 3, 2025

Qubole Sparklens tool for performance tuning Apache Spark

Scala 584 143 Updated Jun 26, 2024

Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.

Java 5,989 5,084 Updated Oct 18, 2025
Next