SRE
Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.
Here are 40 public repositories matching this topic...
Open source AI terminal and SSH Client for EC2, Database and Kubernetes.
-
Updated
Nov 27, 2025 - TypeScript
New Relic One quickstarts help accelerate your New Relic journey by providing immediate value for your specific use cases.
-
Updated
Nov 17, 2025 - TypeScript
Create custom DevOps AI agents that understand and manage your infrastructure.
-
Updated
Feb 27, 2025 - TypeScript
Collection of AWS Fault Injection Simulator (FIS) experiment templates deploy-able via the AWS CDK
-
Updated
Nov 9, 2022 - TypeScript
Configuration as code for the masses
-
Updated
Nov 20, 2021 - TypeScript
Everything you need to build, deploy, and collaborate with agents. Ride the llama, avoid the drama.
-
Updated
Nov 20, 2025 - TypeScript
InfraGPT is an AI SRE Copilot for the Cloud that provides infrastructure management agents through Slack integration. The system consists of multiple services that work together to deliver intelligent DevOps workflows.
-
Updated
Nov 21, 2025 - TypeScript
A prometheus exporter exposing metrics for the official MongoDB Node.js driver.
-
Updated
Nov 26, 2025 - TypeScript
Easlity generate Prometheus Alerts and SLO Rules
-
Updated
Nov 19, 2025 - TypeScript
A prometheus exporter for node-postgres
-
Updated
Nov 26, 2025 - TypeScript
SRE Agent for VS Code
-
Updated
Jan 29, 2025 - TypeScript
A prometheus exporter exposing metrics for KafkaJS
-
Updated
Nov 26, 2025 - TypeScript
Tool to coordinate on-call, incident and maintenance management
-
Updated
Dec 16, 2021 - TypeScript
GitHub Pages - Curriculum | Josenilto Luis | Profissional com experiência em implementações, virtualizações, configurações, gerenciamento de servidores Windows/Linux. Na área de desenvolvimento tenho sólido conhecimento na implementação de sistemas utilizando CMS.
-
Updated
Nov 26, 2025 - TypeScript
🔥GitHub Action to trigger alerts in incident.io.
-
Updated
Nov 25, 2025 - TypeScript
Errloom is an interactive learning platform that teaches developers how to debug real-world production outages.
-
Updated
Nov 24, 2025 - TypeScript
- Followers
- 144 followers
- Website
- github.com/topics/sre
- Wikipedia
- Wikipedia