Commits · 2d53003432f8560b9c3adf569118747c8ac8447d · Jakob Huber / postgres-lambda-diff

Oct 24, 2014
- Complain if too many options are passed to pg_controldata or pg_resetxlog. · 2d530034
  Heikki Linnakangas authored 10 years ago
  
  2d530034
Sep 25, 2014

Add -D option to specify data directory to pg_controldata and pg_resetxlog. · b0d81ade

Heikki Linnakangas authored 10 years ago

It was confusing that to other commands, like initdb and postgres, you would
pass the data directory with "-D datadir", but pg_controldata and
pg_resetxlog would take just plain path, without the "-D". With this patch,
pg_controldata and pg_resetxlog also accept "-D datadir".

Abhijit Menon-Sen, with minor kibitzing by me

b0d81ade

Aug 09, 2014
- Small message fixes · f25e0bf5
  Peter Eisentraut authored 10 years ago
  
  f25e0bf5
Jun 05, 2014

Add defenses against running with a wrong selection of LOBLKSIZE. · 5f93c378

Tom Lane authored 10 years ago

It's critical that the backend's idea of LOBLKSIZE match the way data has
actually been divided up in pg_largeobject. While we don't provide any
direct way to adjust that value, doing so is a one-line source code change
and various people have expressed interest recently in changing it. So,
just as with TOAST_MAX_CHUNK_SIZE, it seems prudent to record the value in
pg_control and cross-check that the backend's compiled-in setting matches
the on-disk data.

Also tweak the code in inv_api.c so that fetches from pg_largeobject
explicitly verify that the length of the data field is not more than
LOBLKSIZE. Formerly we just had Asserts() for that, which is no protection
at all in production builds. In some of the call sites an overlength data
value would translate directly to a security-relevant stack clobber, so it
seems worth one extra runtime comparison to be sure.

In the back branches, we can't change the contents of pg_control; but we
can still make the extra checks in inv_api.c, which will offer some amount
of protection against running with the wrong value of LOBLKSIZE.

5f93c378

May 28, 2014

Propagate system identifier generation improvement into pg_resetxlog. · 4bcb3946

Tom Lane authored 10 years ago

Commit 5035701e improved xlog.c's method
for creating a database system identifier, but I neglected to fix the
copy of that code appearing in pg_resetxlog.c.  Spotted by Andres Freund.

4bcb3946

May 06, 2014

pgindent run for 9.4 · 0a783200

Bruce Momjian authored 10 years ago

This includes removing tabs after periods in C comments, which was
applied to back branches, so this change should not effect backpatching.

0a783200

Mar 21, 2014
- Remove MinGW readdir/errno bug workaround fixed on 2003-10-10 · 1494931d
  Bruce Momjian authored 11 years ago
  
  1494931d
- Properly check for readdir/closedir() failures · 6f03927f
  Bruce Momjian authored 11 years ago
  
  Clear errno before calling readdir() and handle old MinGW errno bug while adding full test coverage for readdir/closedir failures. Backpatch through 8.4.
  6f03927f
Mar 13, 2014
- C comments: remove odd blank lines after #ifdef WIN32 lines · 242c2737
  Bruce Momjian authored 11 years ago
  
  A few more
  242c2737
Feb 15, 2014

Centralize getopt-related declarations in a new header file pg_getopt.h. · 60ff2fdd

Tom Lane authored 11 years ago

We used to have externs for getopt() and its API variables scattered
all over the place.  Now that we find we're going to need to tweak the
variable declarations for Cygwin, it seems like a good idea to have
just one place to tweak.

In this commit, the variables are declared "#ifndef HAVE_GETOPT_H".
That may or may not work everywhere, but we'll soon find out.

Andres Freund

60ff2fdd

Jan 07, 2014

Update copyright for 2014 · 7e04792a

Bruce Momjian authored 11 years ago

Update all files in head, and files COPYRIGHT and legal.sgml in all back
branches.

7e04792a

Dec 20, 2013
- Rename wal_log_hintbits to wal_log_hints, per discussion on pgsql-hackers. · 961bf59f
  Fujii Masao authored 11 years ago
  
  Sawada Masahiko
  961bf59f
Dec 13, 2013

Add GUC to enable WAL-logging of hint bits, even with checksums disabled. · 50e54709

Heikki Linnakangas authored 11 years ago

WAL records of hint bit updates is useful to tools that want to examine
which pages have been modified. In particular, this is required to make
the pg_rewind tool safe (without checksums).

This can also be used to test how much extra WAL-logging would occur if
you enabled checksums, without actually enabling them (which you can't
currently do without re-initdb'ing).

Sawada Masahiko, docs by Samrat Revagade. Reviewed by Dilip Kumar, with
further changes by me.

50e54709

Dec 12, 2013
- Display old and new values in pg_resetxlog -n output. · 108e3992
  Heikki Linnakangas authored 11 years ago
  
  For extra clarity. Rajeev Rastogi, reviewed by Amit Kapila
  108e3992
Jul 07, 2013
- pg_resetxlog: Make --help consistent with man page · e714d031
  Peter Eisentraut authored 11 years ago
  
  Use "MXID" as placeholder for -m option, instead of just "XID".
  e714d031
Jul 04, 2013

Add new GUC, max_worker_processes, limiting number of bgworkers. · 6bc8ef0b

Robert Haas authored 11 years ago

In 9.3, there's no particular limit on the number of bgworkers;
instead, we just count up the number that are actually registered,
and use that to set MaxBackends.  However, that approach causes
problems for Hot Standby, which needs both MaxBackends and the
size of the lock table to be the same on the standby as on the
master, yet it may not be desirable to run the same bgworkers in
both places.  9.3 handles that by failing to notice the problem,
which will probably work fine in nearly all cases anyway, but is
not theoretically sound.

A further problem with simply counting the number of registered
workers is that new workers can't be registered without a
postmaster restart.  This is inconvenient for administrators,
since bouncing the postmaster causes an interruption of service.
Moreover, there are a number of applications for background
processes where, by necessity, the background process must be
started on the fly (e.g. parallel query).  While this patch
doesn't actually make it possible to register new background
workers after startup time, it's a necessary prerequisite.

Patch by me.  Review by Michael Paquier.

6bc8ef0b

Jun 27, 2013

Update pg_resetxlog's documentation on multixacts · 9db4ad44

Alvaro Herrera authored 11 years ago

I added some more functionality to it in 0ac5ad51 but neglected to
add it to the docs.

Per Peter Eisentraut in message
1367112171.32604.4.camel@vanquo.pezone.net

9db4ad44

May 29, 2013

pgindent run for release 9.3 · 9af4159f

Bruce Momjian authored 11 years ago

This is the first run of the Perl-based pgindent script.  Also update
pgindent instructions.

9af4159f

Apr 30, 2013

Record data_checksum_version in control file. · 44395174

Simon Riggs authored 11 years ago

The value is not used anywhere in code, but will
allow future changes to the checksum version
should that become necessary in the future.

44395174

Mar 22, 2013

Allow I/O reliability checks using 16-bit checksums · 96ef3b8f

Simon Riggs authored 12 years ago

Checksums are set immediately prior to flush out of shared buffers
and checked when pages are read in again. Hint bit setting will
require full page write when block is dirtied, which causes various
infrastructure changes. Extensive comments, docs and README.

WARNING message thrown if checksum fails on non-all zeroes page;
ERROR thrown but can be disabled with ignore_checksum_failure = on.

Feature enabled by an initdb option, since transition from option off
to option on is long and complex and has not yet been implemented.
Default is not to use checksums.

Checksum used is WAL CRC-32 truncated to 16-bits.

Simon Riggs, Jeff Davis, Greg Smith
Wide input and assistance from many community members. Thank you.

96ef3b8f

Mar 17, 2013
- pg_resetxlog: Capitalize placeholder in --help output · d2bef5f7
  Peter Eisentraut authored 12 years ago
  
  d2bef5f7
Feb 12, 2013

Create libpgcommon, and move pg_malloc et al to it · 8396447c

Alvaro Herrera authored 12 years ago

libpgcommon is a new static library to allow sharing code among the
various frontend programs and backend; this lets us eliminate duplicate
implementations of common routines.  We avoid libpgport, because that's
intended as a place for porting issues; per discussion, it seems better
to keep them separate.

The first use case, and the only implemented by this patch, is pg_malloc
and friends, which many frontend programs were already using.

At the same time, we can use this to provide palloc emulation functions
for the frontend; this way, some palloc-using files in the backend can
also be used by the frontend cleanly.  To do this, we change palloc() in
the backend to be a function instead of a macro on top of
MemoryContextAlloc().  This was previously believed to cause loss of
performance, but this implementation has been tweaked by Tom and Andres
so that on modern compilers it provides a slight improvement over the
previous one.

This lets us clean up some places that were already with
localized hacks.

Most of the pg_malloc/palloc changes in this patch were authored by
Andres Freund. Zoltán Böszörményi also independently provided a form of
that.  libpgcommon infrastructure was authored by Álvaro.

8396447c

Feb 11, 2013

Support unlogged GiST index. · 62401db4

Heikki Linnakangas authored 12 years ago

The reason this wasn't supported before was that GiST indexes need an
increasing sequence to detect concurrent page-splits. In a regular WAL-
logged GiST index, the LSN of the page-split record is used for that
purpose, and in a temporary index, we can get away with a backend-local
counter. Neither of those methods works for an unlogged relation.

To provide such an increasing sequence of numbers, create a "fake LSN"
counter that is saved and restored across shutdowns. On recovery, unlogged
relations are blown away, so the counter doesn't need to survive that
either.

Jeevan Chalke, based on discussions with Robert Haas, Tom Lane and me.

62401db4

Include previous TLI in end-of-recovery and shutdown checkpoint records. · 7803e932

Heikki Linnakangas authored 12 years ago

This isn't used for anything but a sanity check at the moment, but it could
be highly valuable for debugging purposes. It could also be used to recreate
timeline history by traversing WAL, which seems useful.

7803e932

Jan 23, 2013

Improve concurrency of foreign key locking · 0ac5ad51

Alvaro Herrera authored 12 years ago

This patch introduces two additional lock modes for tuples: "SELECT FOR
KEY SHARE" and "SELECT FOR NO KEY UPDATE".  These don't block each
other, in contrast with already existing "SELECT FOR SHARE" and "SELECT
FOR UPDATE".  UPDATE commands that do not modify the values stored in
the columns that are part of the key of the tuple now grab a SELECT FOR
NO KEY UPDATE lock on the tuple, allowing them to proceed concurrently
with tuple locks of the FOR KEY SHARE variety.

Foreign key triggers now use FOR KEY SHARE instead of FOR SHARE; this
means the concurrency improvement applies to them, which is the whole
point of this patch.

The added tuple lock semantics require some rejiggering of the multixact
module, so that the locking level that each transaction is holding can
be stored alongside its Xid.  Also, multixacts now need to persist
across server restarts and crashes, because they can now represent not
only tuple locks, but also tuple updates.  This means we need more
careful tracking of lifetime of pg_multixact SLRU files; since they now
persist longer, we require more infrastructure to figure out when they
can be removed.  pg_upgrade also needs to be careful to copy
pg_multixact files over from the old server to the new, or at least part
of multixact.c state, depending on the versions of the old and new
servers.

Tuple time qualification rules (HeapTupleSatisfies routines) need to be
careful not to consider tuples with the "is multi" infomask bit set as
being only locked; they might need to look up MultiXact values (i.e.
possibly do pg_multixact I/O) to find out the Xid that updated a tuple,
whereas they previously were assured to only use information readily
available from the tuple header.  This is considered acceptable, because
the extra I/O would involve cases that would previously cause some
commands to block waiting for concurrent transactions to finish.

Another important change is the fact that locking tuples that have
previously been updated causes the future versions to be marked as
locked, too; this is essential for correctness of foreign key checks.
This causes additional WAL-logging, also (there was previously a single
WAL record for a locked tuple; now there are as many as updated copies
of the tuple there exist.)

With all this in place, contention related to tuples being checked by
foreign key rules should be much reduced.

As a bonus, the old behavior that a subtransaction grabbing a stronger
tuple lock than the parent (sub)transaction held on a given tuple and
later aborting caused the weaker lock to be lost, has been fixed.

Many new spec files were added for isolation tester framework, to ensure
overall behavior is sane.  There's probably room for several more tests.

There were several reviewers of this patch; in particular, Noah Misch
and Andres Freund spent considerable time in it.  Original idea for the
patch came from Simon Riggs, after a problem report by Joel Jacobson.
Most code is from me, with contributions from Marti Raudsepp, Alexander
Shulgin, Noah Misch and Andres Freund.

This patch was discussed in several pgsql-hackers threads; the most
important start at the following message-ids:
	AANLkTimo9XVcEzfiBR-ut3KVNDkjm2Vxh+t8kAmWjPuv@mail.gmail.com
	1290721684-sup-3951@alvh.no-ip.org
	1294953201-sup-2099@alvh.no-ip.org
	1320343602-sup-2290@alvh.no-ip.org
	1339690386-sup-8927@alvh.no-ip.org
	4FE5FF020200002500048A3D@gw.wicourts.gov
	4FEAB90A0200002500048B7D@gw.wicourts.gov

0ac5ad51

Jan 01, 2013

Update copyrights for 2013 · bd61a623

Bruce Momjian authored 12 years ago

Fully update git head, and update back branches in ./COPYRIGHT and
legal.sgml files.

bd61a623

Dec 04, 2012

Track the timeline associated with minRecoveryPoint, for more sanity checks. · 5ce108bf

Heikki Linnakangas authored 12 years ago

This allows recovery to notice certain incorrect recovery scenarios.
If a server has recovered to point X on timeline 5, and you restart
recovery, it better be on timeline 5 when it reaches point X again, not on
some timeline with a higher ID. This can happen e.g if you a standby server
is shut down, a new timeline appears in the WAL archive, and the standby
server is restarted. It will try to follow the new timeline, which is wrong
because some WAL on the old timeline was already replayed before shutdown.

Requires an initdb (or at least pg_resetxlog), because this adds a field to
the control file.

5ce108bf

Nov 22, 2012

Fix pg_resetxlog to use correct path to postmaster.pid. · 455b8887

Tom Lane authored 12 years ago

Since we've already chdir'd into the data directory, the file should
be referenced as just "postmaster.pid", without prefixing the directory
path. This is harmless in the normal case where an absolute PGDATA path
is used, but quite dangerous if a relative path is specified, since the
program might then fail to notice an active postmaster.

Reported by Hari Babu. This got broken in my commit
eb5949d1, so patch all active versions.

455b8887

Jun 26, 2012

Fix pg_upgrade, broken by the xlogid/segno -> 64-bit int refactoring. · 038f3a05

Heikki Linnakangas authored 12 years ago

The xlogid + segno representation of a particular WAL segment doesn't make
much sense in pg_resetxlog anymore, now that we don't use that anywhere
else. Use the WAL filename instead, since that's a convenient way to name a
particular WAL segment.

I did this partially for pg_resetxlog in the original xlogid/segno -> uint64
patch, but I neglected pg_upgrade and the docs. This should now be more
complete.

038f3a05

Jun 25, 2012
- Unbreak pg_resetxlog -l. · a6427f1f
  Robert Haas authored 12 years ago
  
  Fujii Masao
  a6427f1f
- Fix warning for 64-bit literal on 32-bit build. · 5c7f954d
  Kevin Grittner authored 12 years ago
  
  5c7f954d
Jun 24, 2012

Replace XLogRecPtr struct with a 64-bit integer. · 0ab9d1c4

Heikki Linnakangas authored 12 years ago

This simplifies code that needs to do arithmetic on XLogRecPtrs.

To avoid changing on-disk format of data pages, the LSN on data pages is
still stored in the old format. That should keep pg_upgrade happy. However,
we have XLogRecPtrs embedded in the control file, and in the structs that
are sent over the replication protocol, so this changes breaks compatibility
of pg_basebackup and server. I didn't do anything about this in this patch,
per discussion on -hackers, the right thing to do would to be to change the
replication protocol to be architecture-independent, so that you could use
a newer version of pg_receivexlog, for example, against an older server
version.

0ab9d1c4

Allow WAL record header to be split across pages. · 061e7efb

Heikki Linnakangas authored 12 years ago

This saves a few bytes of WAL space, but the real motivation is to make it
predictable how much WAL space a record requires, as it no longer depends
on whether we need to waste the last few bytes at end of WAL page because
the header doesn't fit.

The total length field of WAL record, xl_tot_len, is moved to the beginning
of the WAL record header, so that it is still always found on the first page
where a WAL record begins.

Bump WAL version number again as this is an incompatible change.

061e7efb

Don't waste the last segment of each 4GB logical log file. · dfda6eba

Heikki Linnakangas authored 12 years ago

The comments claimed that wasting the last segment made it easier to do
calculations with XLogRecPtrs, because you don't have problems representing
last-byte-position-plus-1 that way. In my experience, however, it only made
things more complicated, because the there was two ways to represent the
boundary at the beginning of a logical log file: logid = n+1 and xrecoff = 0,
or as xlogid = n and xrecoff = 4GB - XLOG_SEG_SIZE. Some functions were
picky about which representation was used.

Also, use a 64-bit segment number instead of the log/seg combination, to
point to a certain WAL segment. We assume that all platforms have a working
64-bit integer type nowadays.

This is an incompatible change in WAL format, so bumping WAL version number.

dfda6eba

Jun 18, 2012

Make documentation of --help and --version options more consistent · bb7520cc

Peter Eisentraut authored 12 years ago

Before, some places didn't document the short options (-? and -V),
some documented both, some documented nothing, and they were listed in
various orders. Now this is hopefully more consistent and complete.

bb7520cc

Jun 07, 2012
- Message style improvements · 5d0109bd
  Peter Eisentraut authored 12 years ago
  
  5d0109bd
May 18, 2012
- Realign some --help output to have better spacing between columns · 2273a503
  Peter Eisentraut authored 12 years ago
  
  2273a503
Jan 25, 2012

Allow pg_basebackup from standby node with safety checking. · 8366c780

Simon Riggs authored 13 years ago

Base backup follows recommended procedure, plus goes to great
lengths to ensure that partial page writes are avoided.

Jun Ishizuka and Fujii Masao, with minor modifications

8366c780

Jan 02, 2012
- Update copyright notices for year 2012. · e126958c
  Bruce Momjian authored 13 years ago
  
  e126958c
Aug 17, 2011
- Teach pg_controldata and pg_resetxlog about the new backupEndRequired field · a1a847d3
  Heikki Linnakangas authored 13 years ago
  
  in control file.
  a1a847d3