Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Why SRE documents matter
Nukala S., Rau V. Communications of the ACM61 (12):45-51,2018.Type:Article
Date Reviewed: Mar 4 2019

This article is about ensuring good documentation to help site reliability engineering (SRE) teams function effectively, scale seamlessly to accommodate new services, and ensure that web products and services run reliably. It focuses on moving from one-off team member skill-based performance to a process that relies on high-quality documentation for teams to scale up and execute efficiently and reliably.

There are several SRE tasks, and the article outlines a set of documents that need to be adequately maintained for effectively accomplishing each of these tasks. These include: i) “documents for new service onboarding,” which include templates for architecture and dependencies, capacity planning, failure modes, process and automation, and external dependencies; ii) “documents for running a service,” which include documents for service overviews, playbooks, postmortem, policies, and service-level agreements (SLAs), and a subset of documents for production products such as an about page, codelabs, frequently asked questions (FAQs), support, application programming interface (API) references, and concept, how-to, and developer guides; iii) “documents for reporting service states,” such as periodic service reviews and best practices reviews; iv) “documents for running SRE teams,” such as team sites and team charters; v) “documents for new SRE onboarding,” such as repository access and management; and vi) “documents for service decommissioning.”

The article is hence a holistic description of a set of documents that need to be created, maintained, and used over the life cycle of services or products (especially web products, as addressed here) to enable and ensure efficiency and reliability. It is a good reference for those interested in joining the SRE profession, as well as for project management and system quality professionals.

Reviewer:  Srini Ramaswamy Review #: CR146457 (1905-0176)
Bookmark and Share
  Featured Reviewer  
 
General (D.2.0 )
 
 
General (D.2.0 )
 
 
Software Engineering (D.2 )
 
Would you recommend this review?
yes
no
Other reviews under "General": Date
Development of distributed software
Shatz S. (ed), Macmillan Publishing Co., Inc., Indianapolis, IN, 1993. Type: Book (9780024096111)
Aug 1 1994
Fundamentals of software engineering
Ghezzi C., Jazayeri M., Mandrioli D., Prentice-Hall, Inc., Upper Saddle River, NJ, 1991. Type: Book (013820432)
Jul 1 1992
Software engineering
Sodhi J., TAB Books, Blue Ridge Summit, PA, 1991. Type: Book (9780830633425)
Feb 1 1992
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy