Devopness - Painless essential DevOps to everyone
-
Updated
Jul 19, 2024 - TypeScript
Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.
Devopness - Painless essential DevOps to everyone
On-Call/DevOps Assistant - Get a head start on fixing alerts with AI investigation
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd.io/a4Zu_sH4TZGeih-xCimi3Q
Open source AI on-call developer 🧙♂️ Get relevant context & root cause analysis in seconds about production incidents and make on-call engineers 10x better 🏎️
Web UI for Jaeger
[FSE'24] BARO: Robust Root Cause Analysis for Microservices via Multivariate Bayesian Online Change Point Detection
An easy to use and powerful chaos engineering experiment toolkit.(阿里巴巴开源的一款简单易用、功能强大的混沌实验注入工具)
A Chaos Engineering Platform for Kubernetes.
Technical blogs on topics of Kubernetes, GitOps, CI/CD and SRE in general. Created with ❤️ using Markdown format.
👨💻 blog using github pages | About SRE | PT-BR 🇧🇷
Site Reliability Engineering Munich Meetup Page
A curated list of Site Reliability and Production Engineering Tools
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Chaos testing, network emulation, and stress testing tool for containers
This project is a proof-of-concept - which is a rewrite of my old college project - to demonstrate my skills as a DevOps Engineer before anything else after earning the Microsoft Certified: DevOps Engineer Expert certification
Noticias, Tutoriales, Información, Comunidad DevOps, Site Reliability Engineering (SRE) y Platform Engineering 🌎 🇨🇱 🇧🇷 🇪🇸
A collection of opensource runbooks / playbooks
A curated list of Site Reliability and Production Engineering resources.