site-reliability-engineering

Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

site-reliability-engineering

Here are 95 public repositories matching this topic...

at15 / sre-handbook

chiaen / sre-book-in-audio

vacovsky / poolse

jacob-hudson / ideal-enigma

jkpl / sre-env

MrSaints / terraform-provider-cabot

komlog-io / komlogd

bparli / goavail

lukebrady / resourced

dastergon / error-budget-calculator

dastergon / sreworkbook-templates-md

dastergon / CardsAgainstReliability

digitalascension / azure-tm-monitor

skyzyx / engineering-for-site-reliability

byn3 / holberton-system_engineering-devops

abrunner94 / maia

TheJokersThief / go-enc

gremlin / sre-tools

wh211212 / awesome-sre-cn

danrl / skinny