Skip to content
View dimanzt's full-sized avatar

Highlights

  • Pro

Block or report dimanzt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]

C++ 51 10 Updated Dec 5, 2024

Large Scale Failure recovery algorithms

Python 3 Updated Jan 26, 2017

Example of multi-process, multi-GPU training using Torch-parallel, nVidia-nccl, and nVidia-MPS

Lua 17 4 Updated Sep 22, 2016

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,774 4,644 Updated Nov 19, 2025

PEAKS: Power Efficiency Aware Kubernetes Scheduler

Jupyter Notebook 36 4 Updated Oct 23, 2024

GTNS is a discrete-event network simulator targeted primarily for research and educational use. GTNS is written in Visual C++ programming language and supports different network topologies. This si…

C++ 21 8 Updated Apr 13, 2021

Kubernetes training from basics to advanced

Go 62 6 Updated Nov 20, 2025

Locust load-testing tool on OpenFaaS

Python 9 2 Updated Nov 29, 2017

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…

Python 3,976 762 Updated Oct 28, 2025

A library for Partially Homomorphic Encryption in Python

Python 630 139 Updated Aug 4, 2023
Python 9 2 Updated Apr 19, 2022

Declarative cluster management using constraint programming, where constraints are described using SQL.

Java 102 19 Updated May 9, 2023

AWS virtual gpu device plugin provides capability to use smaller virtual gpus for your machine learning inference workloads

Jupyter Notebook 205 31 Updated Nov 22, 2023

Reference implementations of MLPerf® training benchmarks

Python 1,724 585 Updated Nov 5, 2025

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Go 3,197 369 Updated Mar 20, 2025

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,628 2,255 Updated Nov 2, 2025

YOLO3D: End-to-end real-time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud (ECCV 2018)

Python 310 46 Updated Aug 7, 2020
Python 1 2 Updated Jun 15, 2023

Kubernetes networking based on Open vSwitch

Go 1,752 428 Updated Nov 22, 2025

Borg cluster traces from Google

TeX 1,005 206 Updated Aug 14, 2025

Microsoft Azure Traces

Jupyter Notebook 1,030 169 Updated Oct 20, 2025

P4_16 reference compiler

C++ 793 476 Updated Nov 20, 2025
C++ 2 1 Updated May 9, 2020

Sources and examples for ASPLOS20 paper

C++ 14 7 Updated Jul 21, 2020

Set of Experiments for Lambda NIC project

P4 1 2 Updated Apr 23, 2021
Shell 3 3 Updated Dec 10, 2021

Open API for IP Applications to Offload TCP/UDP Session Packet Processing to Hardware

C 21 19 Updated Apr 7, 2023
Next