Commits · e01e66f808fbd161b2714eab34bb9e9d0db0db53 · Jakob Huber / postgres-lambda-diff

Apr 30, 2012

Converge all SQL-level statistics timing values to float8 milliseconds. · 809e7e21

Tom Lane authored 12 years ago

This patch adjusts the core statistics views to match the decision already
taken for pg_stat_statements, that values representing elapsed time should
be represented as float8 and measured in milliseconds.  By using float8,
we are no longer tied to a specific maximum precision of timing data.
(Internally, it's still microseconds, but we could now change that without
needing changes at the SQL level.)

The columns affected are
pg_stat_bgwriter.checkpoint_write_time
pg_stat_bgwriter.checkpoint_sync_time
pg_stat_database.blk_read_time
pg_stat_database.blk_write_time
pg_stat_user_functions.total_time
pg_stat_user_functions.self_time
pg_stat_xact_user_functions.total_time
pg_stat_xact_user_functions.self_time

The first four of these are new in 9.2, so there is no compatibility issue
from changing them.  The others require a release note comment that they
are now double precision (and can show a fractional part) rather than
bigint as before; also their underlying statistics functions now match
the column definitions, instead of returning bigint microseconds.

809e7e21

Rename I/O timing statistics columns to blk_read_time and blk_write_time. · 1dd89ead

Tom Lane authored 12 years ago

This seems more consistent with the pre-existing choices for names of
other statistics columns.  Rename assorted internal identifiers to match.

1dd89ead

Apr 05, 2012

Publish checkpoint timing information to pg_stat_bgwriter. · b736aef2
Robert Haas authored 12 years ago
```
Greg Smith, Peter Geoghegan, and Robert Haas
```
b736aef2

Expose track_iotiming data via the statistics collector. · 64482890

Robert Haas authored 12 years ago

Ants Aasma's original patch to add timing information for buffer I/O
requests exposed this data at the relation level, which was judged too
costly. I've here exposed it at the database level instead.

64482890

Jan 26, 2012

Add deadlock counter to pg_stat_database · 61cb8c5a

Magnus Hagander authored 13 years ago

Adds a counter that tracks number of deadlocks that occurred in
each database to pg_stat_database.

Magnus Hagander, reviewed by Jaime Casanova

61cb8c5a

Track temporary file count and size in pg_stat_database · bc334748

Magnus Hagander authored 13 years ago

Add counters for number and size of temporary files used
for spill-to-disk queries for each database to the
pg_stat_database view.

Tomas Vondra, review by Magnus Hagander

bc334748

Jan 19, 2012

Separate state from query string in pg_stat_activity · 4f42b546

Magnus Hagander authored 13 years ago

This separates the state (running/idle/idleintransaction etc) into
it's own field ("state"), and leaves the query field containing just
query text.

The query text will now mean "current query" when a query is running
and "last query" in other states. Accordingly,the field has been
renamed from current_query to query.

Since backwards compatibility was broken anyway to make that, the procpid
field has also been renamed to pid - along with the same field in
pg_stat_replication for consistency.

Scott Mead and Magnus Hagander, review work from Greg Smith

4f42b546

Jan 02, 2012
- Update copyright notices for year 2012. · e126958c
  Bruce Momjian authored 13 years ago
  
  e126958c
Nov 09, 2011

In COPY, insert tuples to the heap in batches. · d326d9e8

Heikki Linnakangas authored 13 years ago

This greatly reduces the WAL volume, especially when the table is narrow.
The overhead of locking the heap page is also reduced. Reduced WAL traffic
also makes it scale a lot better, if you run multiple COPY processes at
the same time.

d326d9e8

Oct 21, 2011

Code review for pgstat_get_crashed_backend_activity patch. · f9c92a5a

Tom Lane authored 13 years ago

Avoid possibly dumping core when pgstat_track_activity_query_size has a
less-than-default value; avoid uselessly searching for the query string
of a successfully-exited backend; don't bother putting out an ERRDETAIL if
we don't have a query to show; some other minor stylistic improvements.

f9c92a5a

Try to log current the query string when a backend crashes. · c8e8b5a6

Robert Haas authored 13 years ago

To avoid minimize risk inside the postmaster, we subject this feature
to a number of significant limitations. We very much wish to avoid
doing any complex processing inside the postmaster, due to the
posssibility that the crashed backend has completely corrupted shared
memory. To that end, no encoding conversion is done; instead, we just
replace anything that doesn't look like an ASCII character with a
question mark. We limit the amount of data copied to 1024 characters,
and carefully sanity check the source of that data. While these
restrictions would doubtless be unacceptable in a general-purpose
logging facility, even this limited facility seems like an improvement
over the status quo ante.

Marti Raudsepp, reviewed by PDXPUG and myself

c8e8b5a6

Sep 09, 2011

Move Timestamp/Interval typedefs and basic macros into datatype/timestamp.h. · a7801b62

Tom Lane authored 13 years ago

As per my recent proposal, this refactors things so that these typedefs and
macros are available in a header that can be included in frontend-ish code.
I also changed various headers that were undesirably including
utils/timestamp.h to include datatype/timestamp.h instead.  Unsurprisingly,
this showed that half the system was getting utils/timestamp.h by way of
xlog.h.

No actual code changes here, just header refactoring.

a7801b62

May 30, 2011

Fix VACUUM so that it always updates pg_class.reltuples/relpages. · b4b6923e

Tom Lane authored 13 years ago

When we added the ability for vacuum to skip heap pages by consulting the
visibility map, we made it just not update the reltuples/relpages
statistics if it skipped any pages. But this could leave us with extremely
out-of-date stats for a table that contains any unchanging areas,
especially for TOAST tables which never get processed by ANALYZE. In
particular this could result in autovacuum making poor decisions about when
to process the table, as in recent report from Florian Helmberger. And in
general it's a bad idea to not update the stats at all. Instead, use the
previous values of reltuples/relpages as an estimate of the tuple density
in unvisited pages. This approach results in a "moving average" estimate
of reltuples, which should converge to the correct value over multiple
VACUUM and ANALYZE cycles even when individual measurements aren't very
good.

This new method for updating reltuples is used by both VACUUM and ANALYZE,
with the result that we no longer need the grotty interconnections that
caused ANALYZE to not update the stats depending on what had happened
in the parent VACUUM command.

Also, fix the logic for skipping all-visible pages during VACUUM so that it
looks ahead rather than behind to decide what to do, as per a suggestion
from Greg Stark. This eliminates useless scanning of all-visible pages at
the start of the relation or just after a not-all-visible page. In
particular, the first few pages of the relation will not be invariably
included in the scanned pages, which seems to help in not overweighting
them in the reltuples estimate.

Back-patch to 8.4, where the visibility map was introduced.

b4b6923e

Apr 10, 2011
- pgindent run before PG 9.1 beta 1. · bf50caf1
  Bruce Momjian authored 13 years ago
  
  bf50caf1
Feb 17, 2011
- Add client_hostname field to pg_stat_activity. · 4a25bc14
  Robert Haas authored 14 years ago
  
  Peter Eisentraut, reviewed by Steve Singer, Alvaro Herrera, and me.
  4a25bc14
Feb 10, 2011

Track last time for statistics reset on databases and bgwriter · 4c468b37

Magnus Hagander authored 14 years ago

Tracks one counter for each database, which is reset whenever
the statistics for any individual object inside the database is
reset, and one counter for the background writer.

Tomas Vondra, reviewed by Greg Smith

4c468b37

Jan 03, 2011

Add views and functions to monitor hot standby query conflicts · 40d9e94b

Magnus Hagander authored 14 years ago

Add the view pg_stat_database_conflicts and a column to pg_stat_database,
and the underlying functions to provide the information.

40d9e94b

Jan 01, 2011
- Stamp copyrights for year 2011. · 5d950e3b
  Bruce Momjian authored 14 years ago
  
  5d950e3b
Nov 15, 2010

Add new buffers_backend_fsync field to pg_stat_bgwriter. · 3134d886

Robert Haas authored 14 years ago

This new field counts the number of times that a backend which writes a
buffer out to the OS must also fsync() it.  This happens when the
bgwriter fsync request queue is full, and is generally detrimental to
performance, so it's good to know when it's happening.  Along the way,
log a new message at level DEBUG1 whenever we fail to hand off an fsync,
so that the problem can also be seen in examination of log files
(if the logging level is cranked up high enough).

Greg Smith, with minor tweaks by me.

3134d886

Oct 12, 2010

Remove some unnecessary tests of pgstat_track_counts. · f4d242ef

Tom Lane authored 14 years ago

We may as well make pgstat_count_heap_scan() and related macros just count
whenever rel->pgstat_info isn't null. Testing pgstat_track_counts buys
nothing at all in the normal case where that flag is ON; and when it's OFF,
the pgstat_info link will be null, so it's still a useless test.

This change is unlikely to buy any noticeable performance improvement,
but a cycle shaved is a cycle earned; and my investigations earlier today
convinced me that we're down to the point where individual instructions in
the inner execution loops are starting to matter.

f4d242ef

Sep 20, 2010
- Remove cvs keywords from all files. · 9f2e2113
  Magnus Hagander authored 14 years ago
  
  9f2e2113
Aug 21, 2010
- Add vacuum and analyze counters to pg_stat_*_tables views. · 946045f0
  Magnus Hagander authored 14 years ago
  
  946045f0
Aug 08, 2010

Add stats functions and views to provide access to a transaction's own · 46aa77c7

Tom Lane authored 14 years ago

statistics counts. These numbers are being accumulated but haven't yet been
transmitted to the collector (and won't be, until the transaction ends).
For some purposes, though, it's handy to be able to look at them.

Joel Jacobson, reviewed by Itagaki Takahiro

46aa77c7

Feb 26, 2010
- pgindent run for 9.0 · 65e806cb
  Bruce Momjian authored 15 years ago
  
  65e806cb
Jan 28, 2010
- Add functions to reset the statistics counter for a single table/index or · 083e1b0f
  Magnus Hagander authored 15 years ago
  
  a single function.
  083e1b0f
Jan 19, 2010
- Add pg_stat_reset_shared('bgwriter') to reset the cluster-wide shared · 7e40cdc0
  Magnus Hagander authored 15 years ago
  
  statistics of the bgwriter. Greg Smith
  7e40cdc0
Jan 02, 2010
- Update copyright for the year 2010. · 02398008
  Bruce Momjian authored 15 years ago
  
  02398008
Dec 30, 2009

Revise pgstat's tracking of tuple changes to improve the reliability of · 48c192c1

Tom Lane authored 15 years ago

decisions about when to auto-analyze.

The previous code depended on n_live_tuples + n_dead_tuples - last_anl_tuples,
where all three of these numbers could be bad estimates from ANALYZE itself.
Even worse, in the presence of a steady flow of HOT updates and matching
HOT-tuple reclamations, auto-analyze might never trigger at all, even if all
three numbers are exactly right, because n_dead_tuples could hold steady.

To fix, replace last_anl_tuples with an accurately tracked count of the total
number of committed tuple inserts + updates + deletes since the last ANALYZE
on the table. This can still be compared to the same threshold as before, but
it's much more trustworthy than the old computation. Tracking this requires
one more intra-transaction counter per modified table within backends, but no
additional memory space in the stats collector. There probably isn't any
measurable speed difference; if anything it might be a bit faster than before,
since I was able to eliminate some per-tuple arithmetic operations in favor of
adding sums once per (sub)transaction.

Also, simplify the logic around pgstat vacuum and analyze reporting messages
by not trying to fold VACUUM ANALYZE into a single pgstat message.

The original thought behind this patch was to allow scheduling of analyzes
on parent tables by artificially inflating their changes_since_analyze count.
I've left that for a separate patch since this change seems to stand on its
own merit.

48c192c1

Nov 29, 2009
- Add support for an application_name parameter, which is displayed in · 8217cfbd
  Tom Lane authored 15 years ago
  
  pg_stat_activity and recorded in log entries. Dave Page, reviewed by Andres Freund
  8217cfbd
Jun 11, 2009
- 8.4 pgindent run, with new combined Linux/FreeBSD/MinGW typedef list · d7471402
  Bruce Momjian authored 15 years ago
  
  provided by Andrew.
  d7471402
Jan 04, 2009
- Add contrib/pg_stat_statements for server-wide tracking of statement execution · 7466eeac
  Tom Lane authored 16 years ago
  
  statistics. Takahiro Itagaki
  7466eeac
Jan 01, 2009
- Update copyright for 2009. · 511db38a
  Bruce Momjian authored 16 years ago
  
  511db38a
Dec 17, 2008

Don't reset pg_class.reltuples and relpages in VACUUM, if any pages were · dcf84099

Heikki Linnakangas authored 16 years ago

skipped. We could update relpages anyway, but it seems better to only
update it together with reltuples, because we use the reltuples/relpages
ratio in the planner. Also don't update n_live_tuples in pgstat.

ANALYZE in VACUUM ANALYZE now needs to update pg_class, if the
VACUUM-phase didn't do so. Added some boolean-passing to let analyze_rel
know if it should update pg_class or not.

I also moved the relcache invalidation (to update rd_targblock) from
vac_update_relstats to where RelationTruncate is called, because
vac_update_relstats is not called for partial vacuums anymore. It's more
obvious to send the invalidation close to the truncation that requires it.

Per report by Ned T. Crigler.

dcf84099

Nov 03, 2008

Change the pgstat logic so that the stats collector writes the stats file only · 3c2313f4

Tom Lane authored 16 years ago

upon requests from backends, rather than on a fixed 500msec cycle. (There's
still throttling logic to ensure it writes no more often than once per
500msec, though.) This should result in a significant reduction in stats file
write traffic in typical scenarios where the stats are demanded only
infrequently.

This approach also means that the former difficulty with changing
stats_temp_directory on-the-fly has gone away, so remove the caution about
that as well as the thrashing we did to minimize the trouble window.

In passing, also fix pgstat_report_stat() so that we will send a stats
message if we have function call stats but not table stats to report;
this fixes a bug in the recent patch to support function-call stats.

Martin Pihlak

3c2313f4

Aug 15, 2008

Make the temporary directory for pgstat files configurable by the GUC · 5b8eb2b4

Magnus Hagander authored 16 years ago

variable stats_temp_directory, instead of requiring the admin to
mount/symlink the pg_stat_tmp directory manually.

For now the config variable is PGC_POSTMASTER. Room for further improvment
that would allow it to be changed on-the-fly.

5b8eb2b4

Jun 30, 2008

Turn PGBE_ACTIVITY_SIZE into a GUC variable, track_activity_query_size. · 995fb742

Heikki Linnakangas authored 16 years ago

As the buffer could now be a lot larger than before, and copying it could
thus be a lot more expensive than before, use strcpy instead of memcpy to
copy the query string, as was already suggested in comments. Also, only copy
the PgBackendStatus struct and string if the slot is in use.

Patch by Thomas Lee, with some changes by me.

995fb742

Jun 19, 2008

Improve our #include situation by moving pointer types away from the · a3540b0f

Alvaro Herrera authored 16 years ago

corresponding struct definitions. This allows other headers to avoid including
certain highly-loaded headers such as rel.h and relscan.h, instead using just
relcache.h, heapam.h or genam.h, which are more lightweight and thus cause less
unnecessary dependencies.

a3540b0f

May 15, 2008

Add support for tracking call counts and elapsed runtime for user-defined · 93c701ed

Tom Lane authored 16 years ago

functions.

Note that because this patch changes FmgrInfo, any external C functions
you might be testing with 8.4 will need to be recompiled.

Patch by Martin Pihlak, some editorialization by me (principally, removing
tracking of getrusage() numbers)

93c701ed

Apr 03, 2008

Teach ANALYZE to distinguish dead and in-doubt tuples, which it formerly · 51e1445f

Tom Lane authored 16 years ago

classed all as "dead"; also get it to count DEAD item pointers as dead rows,
instead of ignoring them as before. Also improve matters so that tuples
previously inserted or deleted by our own transaction are handled nicely:
the stats collector's live-tuple and dead-tuple counts will end up correct
after our transaction ends, regardless of whether we end in commit or abort.

While there's more work that could be done to improve the counting of in-doubt
tuples in both VACUUM and ANALYZE, this commit is enough to alleviate some
known bad behaviors in 8.3; and the other stuff that's been discussed seems
like research projects anyway.

Pavan Deolasee and Tom Lane

51e1445f

Mar 24, 2008

Adjust the recent patch for reporting of deadlocked queries so that we report · 9b8e1eb3

Tom Lane authored 17 years ago

query texts only to the server log. This eliminates the issue of possible
leaking of security-sensitive data in other sessions' queries. Since the
log is presumed secure, we can now log the queries of all sessions involved
in the deadlock, whether or not they belong to the same user as the one
reporting the failure.

9b8e1eb3