saidrhs

Follow

saidrhs

Follow

0 followers · 2 following

Amazon
Seattle, WA
https://www.saidarahas.com

Achievements

Achievements

Starred repositories

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,446 957 Updated Oct 24, 2025

aws-samples / awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

Shell 365 149 Updated Nov 7, 2025

exa-labs / exa-mcp-server

Exa MCP for web search and web crawling!

TypeScript 3,204 239 Updated Nov 7, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,978 220 Updated Nov 5, 2025

kserve / kserve

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Shell 4,746 1,292 Updated Nov 8, 2025

kubernetes-sigs / lws

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 606 115 Updated Nov 4, 2025

ai-dynamo / grove

Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling

Go 86 19 Updated Nov 7, 2025

awslabs / data-on-eks

DoEKS is a tool to build, deploy and scale Data Platforms on Amazon EKS

HCL 813 279 Updated Nov 8, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 50,038 6,191 Updated Nov 8, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,685 441 Updated Nov 4, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,189 11,552 Updated Nov 6, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,866 738 Updated Oct 15, 2025

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 704 181 Updated Nov 8, 2025

perplexityai / pplx-kernels

Perplexity GPU Kernels

C++ 527 68 Updated Nov 7, 2025

going-doer / Paper2Code

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Python 3,610 546 Updated Jul 18, 2025

jax-ml / scaling-book

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 681 98 Updated Oct 22, 2025

ExtensityAI / symbolicai

A neurosymbolic perspective on LLMs

Python 1,629 80 Updated Nov 6, 2025

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 1,994 226 Updated Nov 6, 2025

aws / aws-eks-best-practices

A best practices guide for day 2 operations, including operational excellence, security, reliability, performance efficiency, and cost optimization.

Python 2,137 546 Updated Oct 29, 2025

awslabs / agent-squad

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

Python 7,042 643 Updated Oct 21, 2025

kubernetes-sigs / gateway-api-inference-extension

Gateway API Inference Extension

Go 514 190 Updated Nov 8, 2025

deepjavalibrary / djl-serving

A universal scalable machine learning model deployment solution

Java 240 82 Updated Nov 8, 2025

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,358 480 Updated Nov 8, 2025

aws / amazon-ecs-agent

Amazon Elastic Container Service Agent

Go 2,130 640 Updated Nov 6, 2025

aws / containers-roadmap

This is the public roadmap for AWS container services (ECS, ECR, Fargate, and EKS).

Shell 5,321 329 Updated Mar 25, 2025

aws-samples / amazon-bedrock-samples

This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models

Jupyter Notebook 1,271 584 Updated Oct 8, 2025

NirDiamant / GenAI_Agents

This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…

Jupyter Notebook 17,599 2,884 Updated Oct 30, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,673 319 Updated Aug 19, 2025

n8n-io / n8n

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 154,874 49,544 Updated Nov 8, 2025

mlc-ai / web-llm

High-performance In-browser LLM Inference Engine

TypeScript 16,766 1,131 Updated Nov 2, 2025

Starred topics

Machine learning

Rust

React

Python

Operating system

Natural language processing

macOS

Linux

Kubernetes

Java

See all starred topics