Skip to content
View kayathi's full-sized avatar
  • Intel
  • Portland, OR

Block or report kayathi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Community maintained hardware plugin for vLLM on Intel Gaudi

Python 21 90 Updated Jan 7, 2026

Helper scripts to install pip, in a Python installation that doesn't have it.

Python 854 332 Updated Oct 25, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

8 15 Updated Jan 7, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,653 842 Updated Jan 7, 2026

Source for "Neural Magic Workshop: Hands-On AI Optimization with OpenShift AI" Lab

Jupyter Notebook 6 3 Updated Apr 13, 2025

This repository serves as a comprehensive collection of code examples, research papers, and practical resources from my Generative AI (GenAI) series published on MLWhiz.

Jupyter Notebook 10 Updated Jul 6, 2025
Shell 115 15 Updated Jan 7, 2026

Dynamically route user prompts to LoRA adapters or a base LLM using semantic evaluation on Red Hat OpenShift AI with LiteLLM and vLLM.

Python 2 Updated Aug 19, 2025

Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.

Python 144 139 Updated Sep 3, 2025

Intel® Gaudi® Software is an implementation of the runtime and graph compiler for Gaudi3

C++ 11 7 Updated Jun 17, 2025

The image registry operator installs+maintains the internal registry on a cluster

Go 66 134 Updated Jan 2, 2026

Model Context Protocol Servers

TypeScript 75,694 9,180 Updated Jan 6, 2026

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 4,513 353 Updated Aug 10, 2024

Tools and pipelines for automated LLM performance evaluation

Python 14 21 Updated Dec 18, 2025

Containerization and cloud native suite for OPEA

Go 1 Updated May 19, 2025

Containerization and cloud native suite for OPEA

Go 1 1 Updated Jul 23, 2025

This repository contains all the helm charts to deploy LLM service, Llama Stack server, configuring pipeline server, minio, pgvector

Python 10 22 Updated Dec 9, 2025

Intel® AI Assistant Builder

JavaScript 141 27 Updated Jan 6, 2026

With OpenVINO Test Drive, users can run large language models (LLMs) and models trained by Intel Geti on their devices, including AI PCs and Edge devices.

Dart 35 6 Updated Dec 15, 2025

Enable RHOAI User Workload Metrics for Single Serving Models

8 13 Updated Dec 2, 2025

This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow and PyTorch that have been optimized for Intel platforms. Sc…

Python 58 29 Updated Jan 7, 2026
Next