This repository serves as a comprehensive collection of code examples, research papers, and practical resources from my Generative AI (GenAI) series published on MLWhiz.

Jupyter Notebook 10 Updated Jul 6, 2025

intel / llm-scaler

Shell 115 15 Updated Jan 7, 2026

Caseraw / OpenShiftDemoTime

Shell 19 12 Updated Jan 7, 2026

preethivenkatesh / dynamic-model-router

Forked from rh-ai-quickstart/dynamic-model-router

Dynamically route user prompts to LoRA adapters or a base LLM using semantic evaluation on Red Hat OpenShift AI with LiteLLM and vLLM.

Python 2 Updated Aug 19, 2025

guimou / rclone-web-on-openshift

Shell 3 2 Updated Feb 7, 2023

rh-aiservices-bu / llm-on-openshift

Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.

Python 144 139 Updated Sep 3, 2025

HabanaAI / Intel_Gaudi3_Software

Intel® Gaudi® Software is an implementation of the runtime and graph compiler for Gaudi3

C++ 11 7 Updated Jun 17, 2025

openshift / cluster-image-registry-operator

The image registry operator installs+maintains the internal registry on a cluster

Go 66 134 Updated Jan 2, 2026

modelcontextprotocol / servers

Model Context Protocol Servers

TypeScript 75,694 9,180 Updated Jan 6, 2026

lm-sys / RouteLLM

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 4,513 353 Updated Aug 10, 2024

IBM / fmwork

Tools and pipelines for automated LLM performance evaluation

Python 14 21 Updated Dec 18, 2025

RHEcosystemAppEng / AI-Observability-Metric-Summarizer

Forked from RHEcosystemAppEng/RAG-Blueprint

Python 1 Updated Apr 23, 2025

rhai-code / GenAIInfra

Forked from edlee123/GenAIInfra

Containerization and cloud native suite for OPEA

Go 1 Updated May 19, 2025

edlee123 / GenAIInfra

Forked from opea-project/GenAIInfra

Containerization and cloud native suite for OPEA

Go 1 1 Updated Jul 23, 2025

rh-ai-quickstart / ai-architecture-charts

This repository contains all the helm charts to deploy LLM service, Llama Stack server, configuring pipeline server, minio, pgvector

Python 10 22 Updated Dec 9, 2025

intel / intel-ai-assistant-builder

Intel® AI Assistant Builder

JavaScript 141 27 Updated Jan 6, 2026

openvinotoolkit / openvino_testdrive

With OpenVINO Test Drive, users can run large language models (LLMs) and models trained by Intel Geti on their devices, including AI PCs and Edge devices.

Dart 35 6 Updated Dec 15, 2025

rh-aiservices-bu / rhoai-uwm

Enable RHOAI User Workload Metrics for Single Serving Models

8 13 Updated Dec 2, 2025

intel / ai-containers

This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow and PyTorch that have been optimized for Intel platforms. Sc…

Python 58 29 Updated Jan 7, 2026

RHEcosystemAppEng / intel-gaudi-reference-doc

1 Updated Jun 13, 2025

vllm-project / vllm-openvino

Python 30 11 Updated Dec 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sridhar Reddy Kayathi kayathi

Block or report kayathi

Stars

vllm-project / vllm-gaudi

pypa / get-pip

red-hat-data-services / vllm

redhat-ai-services / modelcar-catalog

openshift / csi-driver-nfs

IsaiahStapleton / rhoai-model-deployment-guide

LMCache / LMCache

HabanaAI / vllm-hpu-extension

luis5tb / neural-magic-workshop

MLWhiz / genai