Deep dive into postgres stats: Introduction

Deep dive into postgres stats: Introduction

Everything you always wanted to know about PostgreSQL stats

This blogpost is all about postgres activity stats. Stats are the very important since they allow us to understand what’s going on with the system. With that in mind, stats also have some weak points and that’s what I would like to discuss here.

In these blog series I will try to explain how to use stats effectively and how to detect and solve problems with their help. I will start with an overview and will move on discussion on specific usages with receipts and built in tools.

I hope my posts will provide you with some hands-on tools that will help you using postgres stats and will reassure you that stats aren’t as scary as they seems at first glance.
What is postgres? For people who are not familiar with postgres, it’s a bunch of processes in the ‚ps auxf‘ output. What they can see are operating system’s metrics, such as CPU usage, memory or swap consumed by these processes, but nothing more. This, however, is not sufficient to effectively troubleshoot postgres. To do that one needs to know how postgres stats are collected throughout postgres lifetime and to also be able to properly use them.

If we look inside postgres and try to understand what it consists of, we see that there are many subsystems that work together and problems in one part may cause problems in the other parts. Generally, postgres can be splitted with two abstract parts, first is the servicing clients and second is the background service operations, e.g. vacuums, checkpoints, write ahead logs. Thus slow down in background operations will negatively affect servicing clients tasks and vice versa. Postgres activity stats are used for observing, predicting or eliminating possible problems, and almost all postgres parts have specific stats which describe what’s going on there.

Even provided all these strengths, stats also have a few weak points. First, is the fact that there are so many of them and one should know which source to use in each particular case. Second, almost all stats are provided as permanently incremented counters and often you can see billion values that say nothing to you. Third, stats don’t provide history, so there is no built-in way to check what happened five, ten or thirty minutes ago. And last and the least is that postgres doesn’t provide handy tool for working with stats, and end-user has to use only sql clients, such as psql.

Despite all that, stats are the first thing which can help you to troubleshoot postgres. Let’s take a closer look. Stats provide different types of information, these are

Events that happened in postgres instance (counters), such as table or index operations, number of commits and rollbacks, such as block hits, reads etc.
Database’s objects properties (current values), e.g. transactions or queries start time and states and relations sizes.
Time spent for reading and writing operations (counters).

Tracking stats are enabled by default and there are no additional steps for configuring them (except for stats that are provided by contrib modules). Stats interface based on functions and views, these functions and views available by default in all existing and new created databases. Thus getting stats is possible using psql or any other client and SELECT queries to these views.

As mentioned above, there are many stats functions and views and in addition postgres has multiple subsystems. This information can be represented in the following diagram.

In my next post in this series I will be focusing on particular stats view and will explain what kind of problems it allows to solve and how to do it.

Bemerkungen: 3

3 Antworten zu “Deep dive into postgres stats: Introduction”

Tom Kincaid sagt:

Februar 21, 2017 um 7:53 pm Uhr

Nice blog, thanks for publishing!

Antworten
Unknown sagt:

Februar 27, 2017 um 5:40 am Uhr

Nice Blog 😊

Antworten
GRANT sagt:

Februar 28, 2017 um 9:45 pm Uhr

Really good! thanks for the sharing!

Antworten

Schreibe einen Kommentar Antworten abbrechen

Paket	Preis*
PostgreSQL Configuration-Analyse	ab 500 €
Hardware und Betriebssystem Analyse	ab 600 €
Analyse langsamer SQL-Abfragen	ab 500 €
Umfassender Health Check	ab 3500 €
* für eine einzelne PostgreSQL Installation (1 master und 1 replica)

Basic	Premium	VIP
Inkl. bis zu 5 Stunden Beratungszeit pro Monat	Inkl. bis zu 15 Stunden Beratungszeit pro Monat	Inkl. bis zu 30 Stunden Beratungszeit pro Monat
Slack, Mattermost, Telegram	Slack, Mattermost, Telegram	Slack, Mattermost, Telegram
Antwortzeit bis zu — 1 Stunde	Antwortzeit bis zu — 1 Stunde	Antwortzeit bis zu — 30 Minuten
2 PostgreSQL Instance max	5 PostgreSQL Instance max	10 PostgreSQL Instance max

Basic

Premium

VIP

Inkl. bis zu 5 Stunden Beratungszeit pro Monat

Inkl. bis zu 15 Stunden Beratungszeit pro Monat

Inkl. bis zu 30 Stunden Beratungszeit pro Monat

Slack, Mattermost, Telegram

Antwortzeit bis zu — 1 Stunde

Antwortzeit bis zu — 30 Minuten

2 PostgreSQL Instance max

5 PostgreSQL Instance max

10 PostgreSQL Instance max

Nachrichten und Blog zurück

Deep dive into postgres stats: Introduction

Everything you always wanted to know about PostgreSQL stats

Das könnte Sie auch interessieren:

Operating PostgreSQL as a Data Source for Analytics Pipelines – Recap from the Stuttgart Meetup

Data archiving and retention in PostgreSQL. Best practices for large datasets

Taming large datasets in PostgreSQL: archiving and retention without the pain

How to Upgrade RDS PostgreSQL with Minimal Downtime