Skip to content
View dzhulgakov's full-sized avatar

Block or report dzhulgakov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Optimizing inference proxy for LLMs

Python 3,264 261 Updated Dec 25, 2025

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,667 304 Updated May 21, 2025

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 555 34 Updated Oct 28, 2023

A list of startups that have employee-friendly terms for exercising your options past 90 days.

1,190 140 Updated Mar 7, 2025

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,694 384 Updated Dec 17, 2025

Development repository for the Triton language and compiler

MLIR 18,083 2,495 Updated Jan 10, 2026

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 913 218 Updated Jan 7, 2026

miniz: Single C source file zlib-replacement library, originally from code.google.com/p/miniz

C++ 2,614 379 Updated Sep 21, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 96,506 26,479 Updated Jan 11, 2026

Write PyTorch code at the level of individual examples, then run it efficiently on minibatches.

Python 484 22 Updated Feb 12, 2022

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Jupyter Notebook 2,507 465 Updated Sep 12, 2025

Demo of running NNs across different frameworks

Jupyter Notebook 1,656 356 Updated Oct 8, 2022

The convertor/conversion of deep learning models for different deep learning frameworks/softwares.

3,247 482 Updated Jun 26, 2023

Original Python version of Intel® Nervana™ Graph

Python 214 38 Updated Oct 5, 2022