Hands-On Infrastructure Monitoring with Prometheus

Hands-On Infrastructure Monitoring with Prometheus : Implement and scale queries, dashboards, and alerting across machines and containers

4.71 (14 ratings by Goodreads)
By (author)  , By (author) 
4.71 (14 ratings by Goodreads)
Free delivery worldwide Expected delivery to the United States in 7-10 business days.

Not ordering to the United States? Click here.
Order now for expected delivery to the United States by Christmas Order now for expected delivery to the United States by Christmas


Build Prometheus ecosystems with metric-centric visualization, alerting, and querying

Key Features

Integrate Prometheus with Alertmanager and Grafana for building a complete monitoring system
Explore PromQL, Prometheus' functional query language, with easy-to-follow examples
Learn how to deploy Prometheus components using Kubernetes and traditional instances

Book DescriptionPrometheus is an open source monitoring system. It provides a modern time series database, a robust query language, several metric visualization possibilities, and a reliable alerting solution for traditional and cloud-native infrastructure.

This book covers the fundamental concepts of monitoring and explores Prometheus architecture, its data model, and how metric aggregation works. Multiple test environments are included to help explore different configuration scenarios, such as the use of various exporters and integrations. You'll delve into PromQL, supported by several examples, and then apply that knowledge to alerting and recording rules, as well as how to test them. After that, alert routing with Alertmanager and creating visualizations with Grafana is thoroughly covered. In addition, this book covers several service discovery mechanisms and even provides an example of how to create your own. Finally, you'll learn about Prometheus federation, cross-sharding aggregation, and also long-term storage with the help of Thanos.

By the end of this book, you'll be able to implement and scale Prometheus as a full monitoring system on-premises, in cloud environments, in standalone instances, or using container orchestration with Kubernetes.

What you will learn

Grasp monitoring fundamentals and implement them using Prometheus
Discover how to extract metrics from common infrastructure services
Find out how to take full advantage of PromQL
Design a highly available, resilient, and scalable Prometheus stack
Explore the power of Kubernetes Prometheus Operator
Understand concepts such as federation and cross-shard aggregation
Unlock seamless global views and long-term retention in cloud-native apps with Thanos

Who this book is forIf you're a software developer, cloud administrator, site reliability engineer, DevOps enthusiast or system admin looking to set up a fail-safe monitoring and alerting system for sustaining infrastructure security and performance, this book is for you. Basic networking and infrastructure monitoring knowledge will help you understand the concepts covered in this book.
show more

Product details

  • Paperback | 430 pages
  • 75 x 93 x 22.86mm | 752.96g
  • Birmingham, United Kingdom
  • English
  • 1789612349
  • 9781789612349
  • 1,590,150

Table of contents

Table of Contents

Monitoring Fundamentals
An Overview of the Prometheus Ecosystem
Setting Up a Test Environment
Prometheus Metrics Fundamentals
Running a Prometheus Server
Exporters and Integrations
Prometheus Query Language - PromQL
Troubleshooting and Validation
Defining Alerting and Recording Rules
Discovering and Creating Grafana Dashboards
Understanding and Extending Alertmanager
Choosing the Right Service Discovery
Scaling and Federating Prometheus
Integrating Long-Term Storage with Prometheus
show more

About Joel Bastos

Joel Bastos is an open source supporter and contributor, with a background in infrastructure security and automation. He is always striving for the standardization of processes, code maintainability, and code reusability. He has defined, led, and implemented critical, highly available, and fault-tolerant enterprise and web-scale infrastructures in several organizations, with Prometheus as the cornerstone. He has worked at two unicorn companies in Portugal and at one of the largest transaction-oriented gaming companies in the world. Previously, he has supported several governmental entities with projects such as the Public Key Infrastructure for the Portuguese citizen card. You can find his blog at blog.kintoandar.com and on Twitter with the handle @kintoandar. Pedro Araujo is a site reliability and automation engineer and has defined and implemented several standards for monitoring at scale. His contributions have been fundamental in connecting development teams to infrastructure. He is highly knowledgeable about infrastructure, but his passion is in the automation and management of large-scale, highly-transactional systems. Pedro has contributed to several open source projects, such as Riemann, OpenTSDB, Sensu, Prometheus, and Thanos. You can find him on Twitter with the handle @phcrva.
show more

Rating details

14 ratings
4.71 out of 5 stars
5 79% (11)
4 14% (2)
3 7% (1)
2 0% (0)
1 0% (0)
Book ratings by Goodreads
Goodreads is the world's largest site for readers with over 50 million reviews. We're featuring millions of their reader ratings on our book pages to help you find your new favourite book. Close X