Quick 📸 on 🐘 wait events from pg_stat_activity

Franck Pachot - Dec 7 '23 - - Dev Community

For quick performance troubleshooting, here is a simple script that reads 150 samples from pg_stat_activity, every 100 milliseconds during 15 seconds, to sum the Active Session History per wait event and time buckets, like this:
Image description

I create the temporary tables to store the samples:


-- create temporary views and table to store samples
drop table if exists temp_ash;
drop view if exists  temp_psa;
create temporary view  temp_psa
 as select now() as ts, * from pg_stat_activity;
create temporary table temp_ash
 as select * from psa where false;

Enter fullscreen mode Exit fullscreen mode

I gather the 150 samples using \watch in psql:


-- take samples every 100ms during 15s
insert into temp_ash
 select * from temp_psa
\watch interval=0.1 count=150
--              ^^^       ^^^
--     frequency(s)   samples 

Enter fullscreen mode Exit fullscreen mode

I query the samples, using \crosstabview in psql:


-- query and group into 5 buckets ( = every 3 seconds for 15s gathered)
select ev as "state                              wait_event_type : wait_event"
 , bucket, to_char(count(*)/count(distinct ts),'99999D9') from (
 select ev, ts, to_char(max(ts)over(partition by bucket),'HH24:MI:SS') bucket from (
 select ts
 , format('%14s %-30s %10s:%-15s',backend_type, state, wait_event_type,wait_event) ev
 , ntile(5) over(order by ts) bucket from ash
--       ^
--       time buckets columns
 ) x ) x group by ev, bucket order by 3 desc,ev, bucket
\crosstabview

Enter fullscreen mode Exit fullscreen mode

This gives the following result:

     state                              wait_event_type : wait_event      | 19:24:16 | 19:24:19 | 19:24:07 | 19:24:13 | 19:24:10
--------------------------------------------------------------------------+----------+----------+----------+----------+----------
 client backend idle                               Client:ClientRead      |     83.0 |     83.0 |     82.0 |     80.0 |     79.0
                                                         :                |      4.0 |      4.0 |      4.0 |      4.0 |      4.0
 client backend active                                 IO:DataFileRead    |      2.0 |      3.0 |      2.0 |      3.0 |      2.0
 client backend active                             LWLock:buffer_mapping  |      1.0 |      2.0 |          |          |
 client backend active                                   :                |      1.0 |      1.0 |      1.0 |      1.0 |      1.0
 client backend active                             LWLock:buffer_io       |      1.0 |      1.0 |      1.0 |      1.0 |      1.0
 client backend idle in transaction                      :                |          |      1.0 |          |          |
 client backend idle in transaction (aborted)      Client:ClientRead      |          |          |      1.0 |      1.0 |      1.0
 client backend idle in transaction                Client:ClientRead      |      1.0 |      1.0 |      1.0 |      1.0 |      1.0
(9 rows)
Enter fullscreen mode Exit fullscreen mode

The Amazon Aurora documentation has a great description of PostgreSQL wait events. In the example above Client:ClientRead is the idle backend waiting from something to do (Oracle DBAs will recognize SQL*Net message from client) so I have mostly idle users here and a few ones are within a transaction. The numbers are the average number of session in this state during the time bucket.

Of course, you can gather samples multiple times, with different frequency, and change the number of columns (in ntile())

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .