-
Mailchimp
- Atlanta, GA
- http://framebit.org/
- in/emilymaycurtin
Stars
A community driven, not-so-curated list of projects you can bolt into your AwesomeWM Configuration
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
A kubernetes operator you should never run under any circumstances
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
Faster way to switch between clusters and namespaces in kubectl
mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model training", "model hosting" etc.
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
convert Betterment .csv file to Quicken-importable .ofx
An EXIF-based photo assistant, organizer and workflow automation tool.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
🦄 The Enterprise™ programming language
Build reveal.js presentations in Scala
A more maintainable, easier to share version of the infamous http://mindprod.com/jgloss/unmain.html
Validate your Kubernetes configuration files, supports multiple Kubernetes versions
A convenient package for SparkPi. Useful for testing clusters and such
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A flexible instrumentation package for visualizing the internal operation of Apache Spark and related tools
CBT - fun, fast, intuitive, compositional, statically checked builds written in Scala
A list of Free Software network services and web applications which can be hosted on your own servers
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
apache-spark-on-k8s / spark
Forked from apache/sparkApache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apa…
ecurtin / spark-bench
Forked from CODAIT/spark-benchBenchmark Suite for Apache Spark
Base classes to use when writing tests with Spark
Class materials for a distributed systems lecture series