the.com/sre

software engineers who treat outages like crime scenes and uptime like religion.

means site reliability engineering: applying software engineering discipline to keep systems running, using code instead of manual ops to prevent things from breaking.

from coined at google around 2003 when ben treynor sloss was asked to run a production team and, being a software engineer, built one out of engineers instead of sysadmins.

the.com/
what’s happening now · the.com · generated