Skip to content
Snippets Groups Projects
  1. Nov 29, 2012
    • Tom Lane's avatar
      Fix assorted bugs in CREATE/DROP INDEX CONCURRENTLY. · 94c014b5
      Tom Lane authored
      Commit 8cb53654, which introduced DROP
      INDEX CONCURRENTLY, managed to break CREATE INDEX CONCURRENTLY via a poor
      choice of catalog state representation.  The pg_index state for an index
      that's reached the final pre-drop stage was the same as the state for an
      index just created by CREATE INDEX CONCURRENTLY.  This meant that the
      (necessary) change to make RelationGetIndexList ignore about-to-die indexes
      also made it ignore freshly-created indexes; which is catastrophic because
      the latter do need to be considered in HOT-safety decisions.  Failure to
      do so leads to incorrect index entries and subsequently wrong results from
      queries depending on the concurrently-created index.
      
      To fix, make the final state be indisvalid = true and indisready = false,
      which is otherwise nonsensical.  This is pretty ugly but we can't add
      another column without forcing initdb, and it's too late for that in 9.2.
      (There's a cleaner fix in HEAD.)
      
      In addition, change CREATE/DROP INDEX CONCURRENTLY so that the pg_index
      flag changes they make without exclusive lock on the index are made via
      heap_inplace_update() rather than a normal transactional update.  The
      latter is not very safe because moving the pg_index tuple could result in
      concurrent SnapshotNow scans finding it twice or not at all, thus possibly
      resulting in index corruption.  This is a pre-existing bug in CREATE INDEX
      CONCURRENTLY, which was copied into the DROP code.
      
      In addition, fix various places in the code that ought to check to make
      sure that the indexes they are manipulating are valid and/or ready as
      appropriate.  These represent bugs that have existed since 8.2, since
      a failed CREATE INDEX CONCURRENTLY could leave a corrupt or invalid
      index behind, and we ought not try to do anything that might fail with
      such an index.
      
      Also fix RelationReloadIndexInfo to ensure it copies all the pg_index
      columns that are allowed to change after initial creation.  Previously we
      could have been left with stale values of some fields in an index relcache
      entry.  It's not clear whether this actually had any user-visible
      consequences, but it's at least a bug waiting to happen.
      
      In addition, do some code and docs review for DROP INDEX CONCURRENTLY;
      some cosmetic code cleanup but mostly addition and revision of comments.
      
      Portions of this need to be back-patched even further, but I'll work
      on that separately.
      
      Problem reported by Amit Kapila, diagnosis by Pavan Deolasee,
      fix by Tom Lane and Andres Freund.
      94c014b5
  2. Nov 28, 2012
  3. Nov 26, 2012
    • Tom Lane's avatar
      Revert patch for taking fewer snapshots. · 786afc1c
      Tom Lane authored
      This reverts commit d573e239, "Take fewer
      snapshots".  While that seemed like a good idea at the time, it caused
      execution to use a snapshot that had been acquired before locking any of
      the tables mentioned in the query.  This created user-visible anomalies
      that were not present in any prior release of Postgres, as reported by
      Tomas Vondra.  While this whole area could do with a redesign (since there
      are related cases that have anomalies anyway), it doesn't seem likely that
      any future patch would be reasonably back-patchable; and we don't want 9.2
      to exhibit a behavior that's subtly unlike either past or future releases.
      Hence, revert to prior code while we rethink the problem.
      786afc1c
    • Tom Lane's avatar
      Fix SELECT DISTINCT with index-optimized MIN/MAX on inheritance trees. · eea6ada9
      Tom Lane authored
      In a query such as "SELECT DISTINCT min(x) FROM tab", the DISTINCT is
      pretty useless (there being only one output row), but nonetheless it
      shouldn't fail.  But it could fail if "tab" is an inheritance parent,
      because planagg.c's code for fixing up equivalence classes after making the
      index-optimized MIN/MAX transformation wasn't prepared to find child-table
      versions of the aggregate expression.  The least ugly fix seems to be
      to add an option to mutate_eclass_expressions() to skip child-table
      equivalence class members, which aren't used anymore at this stage of
      planning so it's not really necessary to fix them.  Since child members
      are ignored in many cases already, it seems plausible for
      mutate_eclass_expressions() to have an option to ignore them too.
      
      Per bug #7703 from Maxim Boguk.
      
      Back-patch to 9.1.  Although the same code exists before that, it cannot
      encounter child-table aggregates AFAICS, because the index optimization
      transformation cannot succeed on inheritance trees before 9.1 (for lack
      of MergeAppend).
      eea6ada9
  4. Nov 23, 2012
    • Heikki Linnakangas's avatar
      pg_stat_replication.sync_state was displayed incorrectly at page boundary. · 3f7b04d6
      Heikki Linnakangas authored
      XLogRecPtrIsInvalid() only checks the xrecoff field, which is correct when
      checking if a WAL record could legally begin at the given position, but WAL
      sending can legally be paused at a page boundary, in which case xrecoff is
      0. Use XLByteEQ(..., InvalidXLogRecPtr) instead, which checks that both
      xlogid and xrecoff are 0.
      
      9.3 doesn't have this problem because XLogRecPtr is now a single 64-bit
      integer, so XLogRecPtrIsInvalid() does the right thing. Apply to 9.2, and
      9.1 where pg_stat_replication view was introduced.
      
      Kyotaro HORIGUCHI, reviewed by Fujii Masao.
      3f7b04d6
  5. Nov 22, 2012
    • Tom Lane's avatar
      Fix pg_resetxlog to use correct path to postmaster.pid. · 430b47f3
      Tom Lane authored
      Since we've already chdir'd into the data directory, the file should
      be referenced as just "postmaster.pid", without prefixing the directory
      path.  This is harmless in the normal case where an absolute PGDATA path
      is used, but quite dangerous if a relative path is specified, since the
      program might then fail to notice an active postmaster.
      
      Reported by Hari Babu.  This got broken in my commit
      eb5949d1, so patch all active versions.
      430b47f3
    • Heikki Linnakangas's avatar
      Avoid bogus "out-of-sequence timeline ID" errors in standby-mode. · dda8b87b
      Heikki Linnakangas authored
      When startup process opens a WAL segment after replaying part of it, it
      validates the first page on the WAL segment, even though the page it's
      really interested in later in the file. As part of the validation, it checks
      that the TLI on the page header is >= the TLI it saw on the last page it
      read. If the segment contains a timeline switch, and we have already
      replayed it, and then re-open the WAL segment (because of streaming
      replication got disconnected and reconnected, for example), the TLI check
      will fail when the first page is validated. Fix that by relaxing the TLI
      check when re-opening a WAL segment.
      
      Backpatch to 9.0. Earlier versions had the same code, but before standby
      mode was introduced in 9.0, recovery never tried to re-read a segment after
      partially replaying it.
      
      Reported by Amit Kapila, while testing a new feature.
      dda8b87b
  6. Nov 21, 2012
    • Tom Lane's avatar
      Don't launch new child processes after we've been told to shut down. · 8af60da9
      Tom Lane authored
      Once we've received a shutdown signal (SIGINT or SIGTERM), we should not
      launch any more child processes, even if we get signals requesting such.
      The normal code path for spawning backends has always understood that,
      but the postmaster's infrastructure for hot standby and autovacuum didn't
      get the memo.  As reported by Hari Babu in bug #7643, this could lead to
      failure to shut down at all in some cases, such as when SIGINT is received
      just before the startup process sends PMSIGNAL_RECOVERY_STARTED: we'd
      launch a bgwriter and checkpointer, and then those processes would have no
      idea that they ought to quit.  Similarly, launching a new autovacuum worker
      would result in waiting till it finished before shutting down.
      
      Also, switch the order of the code blocks in reaper() that detect startup
      process crash versus shutdown termination.  Once we've sent it a signal,
      we should not consider that exit(1) is surprising.  This is just a cosmetic
      fix since shutdown occurs correctly anyway, but better not to log a phony
      complaint about startup process crash.
      
      Back-patch to 9.0.  Some parts of this might be applicable before that,
      but given the lack of prior complaints I'm not going to worry too much
      about older branches.
      8af60da9
  7. Nov 20, 2012
    • Tom Lane's avatar
      Improve handling of INT_MIN / -1 and related cases. · 278b6059
      Tom Lane authored
      Some platforms throw an exception for this division, rather than returning
      a necessarily-overflowed result.  Since we were testing for overflow after
      the fact, an exception isn't nice.  We can avoid the problem by treating
      division by -1 as negation.
      
      Add some regression tests so that we'll find out if any compilers try to
      optimize away the overflow check conditions.
      
      Back-patch of commit 1f7cb5c3.
      
      Per discussion with Xi Wang, though this is different from the patch he
      submitted.
      278b6059
  8. Nov 18, 2012
    • Tom Lane's avatar
      Limit values of archive_timeout, post_auth_delay, auth_delay.milliseconds. · 83d48a81
      Tom Lane authored
      The previous definitions of these GUC variables allowed them to range
      up to INT_MAX, but in point of fact the underlying code would suffer
      overflows or other errors with large values.  Reduce the maximum values
      to something that won't misbehave.  There's no apparent value in working
      harder than this, since very large delays aren't sensible for any of
      these.  (Note: the risk with archive_timeout is that if we're late
      checking the state, the timestamp difference it's being compared to
      might overflow.  So we need some amount of slop; the choice of INT_MAX/2
      is arbitrary.)
      
      Per followup investigation of bug #7670.  Although this isn't a very
      significant fix, might as well back-patch.
      83d48a81
    • Tom Lane's avatar
      Fix syslogger to not fail when log_rotation_age exceeds 2^31 milliseconds. · 89067bc1
      Tom Lane authored
      We need to avoid calling WaitLatch with timeouts exceeding INT_MAX.
      Fortunately a simple clamp will do the trick, since no harm is done if
      the wait times out before it's really time to rotate the log file.
      Per bug #7670 (probably bug #7545 is the same thing, too).
      
      In passing, fix bogus definition of log_rotation_age's maximum value in
      guc.c --- it was numerically right, but only because MINS_PER_HOUR and
      SECS_PER_MINUTE have the same value.
      
      Back-patch to 9.2.  Before that, syslogger wasn't using WaitLatch.
      89067bc1
  9. Nov 16, 2012
    • Tom Lane's avatar
      Improve check_partial_indexes() to consider join clauses in proof attempts. · f0461cd8
      Tom Lane authored
      Traditionally check_partial_indexes() has only looked at restriction
      clauses while trying to prove partial indexes usable in queries.  However,
      join clauses can also be used in some cases; mainly, that a strict operator
      on "x" proves an "x IS NOT NULL" index predicate, even if the operator is
      in a join clause rather than a restriction clause.  Adding this code fixes
      a regression in 9.2, because previously we would take join clauses into
      account when considering whether a partial index could be used in a
      nestloop inner indexscan path.  9.2 doesn't handle nestloop inner
      indexscans in the same way, and this consideration was overlooked in the
      rewrite.  Moving the work to check_partial_indexes() is a better solution
      anyway, since the proof applies whether or not we actually use the index
      in that particular way, and we don't have to do it over again for each
      possible outer relation.  Per report from Dave Cramer.
      f0461cd8
  10. Nov 14, 2012
    • Tom Lane's avatar
      Fix the int8 and int2 cases of (minimum possible integer) % (-1). · 3b4db79e
      Tom Lane authored
      The correct answer for this (or any other case with arg2 = -1) is zero,
      but some machines throw a floating-point exception instead of behaving
      sanely.  Commit f9ac414c dealt with this
      in int4mod, but overlooked the fact that it also happens in int8mod
      (at least on my Linux x86_64 machine).  Protect int2mod as well; it's
      not clear whether any machines fail there (mine does not) but since the
      test is so cheap it seems better safe than sorry.  While at it, simplify
      the original guard in int4mod: we need only check for arg2 == -1, we
      don't need to check arg1 explicitly.
      
      Xi Wang, with some editing by me.
      3b4db79e
  11. Nov 13, 2012
    • Tom Lane's avatar
      Fix memory leaks in record_out() and record_send(). · 5355e39c
      Tom Lane authored
      record_out() leaks memory: it fails to free the strings returned by the
      per-column output functions, and also is careless about detoasted values.
      This results in a query-lifespan memory leakage when returning composite
      values to the client, because printtup() runs the output functions in the
      query-lifespan memory context.  Fix it to handle these issues the same way
      printtup() does.  Also fix a similar leakage in record_send().
      
      (At some point we might want to try to run output functions in
      shorter-lived memory contexts, so that we don't need a zero-leakage policy
      for them.  But that would be a significantly more invasive patch, which
      doesn't seem like material for back-patching.)
      
      In passing, use appendStringInfoCharMacro instead of appendStringInfoChar
      in the innermost data-copying loop of record_out, to try to shave a few
      cycles from this function's runtime.
      
      Per trouble report from Carlos Henrique Reimer.  Back-patch to all
      supported versions.
      5355e39c
    • Simon Riggs's avatar
      Skip searching for subxact locks at commit. · 31541778
      Simon Riggs authored
      At commit all standby locks are released
      for the top-level transaction, so searching
      for locks for each subtransaction is both
      pointless and costly (N^2) in the presence
      of many AccessExclusiveLocks.
      31541778
    • Simon Riggs's avatar
      Clarify docs on hot standby lock release · f66e7ab6
      Simon Riggs authored
      Andres Freund and Simon Riggs
      f66e7ab6
    • Tom Lane's avatar
      Fix multiple problems in WAL replay. · 8805ff65
      Tom Lane authored
      Most of the replay functions for WAL record types that modify more than
      one page failed to ensure that those pages were locked correctly to ensure
      that concurrent queries could not see inconsistent page states.  This is
      a hangover from coding decisions made long before Hot Standby was added,
      when it was hardly necessary to acquire buffer locks during WAL replay
      at all, let alone hold them for carefully-chosen periods.
      
      The key problem was that RestoreBkpBlocks was written to hold lock on each
      page restored from a full-page image for only as long as it took to update
      that page.  This was guaranteed to break any WAL replay function in which
      there was any update-ordering constraint between pages, because even if the
      nominal order of the pages is the right one, any mixture of full-page and
      non-full-page updates in the same record would result in out-of-order
      updates.  Moreover, it wouldn't work for situations where there's a
      requirement to maintain lock on one page while updating another.  Failure
      to honor an update ordering constraint in this way is thought to be the
      cause of bug #7648 from Daniel Farina: what seems to have happened there
      is that a btree page being split was rewritten from a full-page image
      before the new right sibling page was written, and because lock on the
      original page was not maintained it was possible for hot standby queries to
      try to traverse the page's right-link to the not-yet-existing sibling page.
      
      To fix, get rid of RestoreBkpBlocks as such, and instead create a new
      function RestoreBackupBlock that restores just one full-page image at a
      time.  This function can be invoked by WAL replay functions at the points
      where they would otherwise perform non-full-page updates; in this way, the
      physical order of page updates remains the same no matter which pages are
      replaced by full-page images.  We can then further adjust the logic in
      individual replay functions if it is necessary to hold buffer locks
      for overlapping periods.  A side benefit is that we can simplify the
      handling of concurrency conflict resolution by moving that code into the
      record-type-specfic functions; there's no more need to contort the code
      layout to keep conflict resolution in front of the RestoreBkpBlocks call.
      
      In connection with that, standardize on zero-based numbering rather than
      one-based numbering for referencing the full-page images.  In HEAD, I
      removed the macros XLR_BKP_BLOCK_1 through XLR_BKP_BLOCK_4.  They are
      still there in the header files in previous branches, but are no longer
      used by the code.
      
      In addition, fix some other bugs identified in the course of making these
      changes:
      
      spgRedoAddNode could fail to update the parent downlink at all, if the
      parent tuple is in the same page as either the old or new split tuple and
      we're not doing a full-page image: it would get fooled by the LSN having
      been advanced already.  This would result in permanent index corruption,
      not just transient failure of concurrent queries.
      
      Also, ginHeapTupleFastInsert's "merge lists" case failed to mark the old
      tail page as a candidate for a full-page image; in the worst case this
      could result in torn-page corruption.
      
      heap_xlog_freeze() was inconsistent about using a cleanup lock or plain
      exclusive lock: it did the former in the normal path but the latter for a
      full-page image.  A plain exclusive lock seems sufficient, so change to
      that.
      
      Also, remove gistRedoPageDeleteRecord(), which has been dead code since
      VACUUM FULL was rewritten.
      
      Back-patch to 9.0, where hot standby was introduced.  Note however that 9.0
      had a significantly different WAL-logging scheme for GIST index updates,
      and it doesn't appear possible to make that scheme safe for concurrent hot
      standby queries, because it can leave inconsistent states in the index even
      between WAL records.  Given the lack of complaints from the field, we won't
      work too hard on fixing that branch.
      8805ff65
  12. Nov 12, 2012
    • Tom Lane's avatar
      Check for stack overflow in transformSetOperationTree(). · 454edf1d
      Tom Lane authored
      Since transformSetOperationTree() recurses, it can be driven to stack
      overflow with enough UNION/INTERSECT/EXCEPT clauses in a query.  Add a
      check to ensure it fails cleanly instead of crashing.  Per report from
      Matthew Gerber (though it's not clear whether this is the only thing
      going wrong for him).
      
      Historical note: I think the reasoning behind not putting a check here in
      the beginning was that the check in transformExpr() ought to be sufficient
      to guard the whole parser.  However, because transformSetOperationTree()
      recurses all the way to the bottom of the set-operation tree before doing
      any analysis of the statement's expressions, that check doesn't save it.
      454edf1d
  13. Nov 09, 2012
    • Peter Eisentraut's avatar
      XSLT stylesheet: Add slash to directory name · 9b06d91a
      Peter Eisentraut authored
      Some versions of the XSLT stylesheets don't handle the missing slash
      correctly (they concatenate directory and file name without the slash).
      This might never have worked correctly.
      9b06d91a
    • Tom Lane's avatar
      Fix WaitLatch() to return promptly when the requested timeout expires. · 8354ce92
      Tom Lane authored
      If the sleep is interrupted by a signal, we must recompute the remaining
      time to wait; otherwise, a steady stream of non-wait-terminating interrupts
      could delay return from WaitLatch indefinitely.  This has been shown to be
      a problem for the autovacuum launcher, and there may well be other places
      now or in the future with similar issues.  So we'd better make the function
      robust, even though this'll add at least one gettimeofday call per wait.
      
      Back-patch to 9.2.  We might eventually need to fix 9.1 as well, but the
      code is quite different there, and the usage of WaitLatch in 9.1 is so
      limited that it's not clearly important to do so.
      
      Reported and diagnosed by Jeff Janes, though I rewrote his patch rather
      heavily.
      8354ce92
  14. Nov 08, 2012
  15. Nov 07, 2012
  16. Nov 06, 2012
  17. Nov 05, 2012
    • Tom Lane's avatar
      Fix handling of inherited check constraints in ALTER COLUMN TYPE. · 329057fd
      Tom Lane authored
      This case got broken in 8.4 by the addition of an error check that
      complains if ALTER TABLE ONLY is used on a table that has children.
      We do use ONLY for this situation, but it's okay because the necessary
      recursion occurs at a higher level.  So we need to have a separate
      flag to suppress recursion without making the error check.
      
      Reported and patched by Pavan Deolasee, with some editorial adjustments by
      me.  Back-patch to 8.4, since this is a regression of functionality that
      worked in earlier branches.
      329057fd
  18. Nov 02, 2012
  19. Nov 01, 2012
    • Tom Lane's avatar
      Limit the number of rel sets considered in consider_index_join_outer_rels. · ca2d6a6c
      Tom Lane authored
      In bug #7626, Brian Dunavant exposes a performance problem created by
      commit 3b8968f2: that commit attempted to
      consider *all* possible combinations of indexable join clauses, but if said
      clauses join to enough different relations, there's an exponential increase
      in the number of outer-relation sets considered.
      
      In Brian's example, all the clauses come from the same equivalence class,
      which means it's redundant to use more than one of them in an indexscan
      anyway.  So we can prevent the problem in this class of cases (which is
      probably the majority of real examples) by rejecting combinations that
      would only serve to add a known-redundant clause.
      
      But that still leaves us exposed to exponential growth of planning time
      when the query has a lot of non-equivalence join clauses that are usable
      with the same index.  I chose to prevent such cases by setting an upper
      limit on the number of relation sets considered, equal to ten times the
      number of index clauses considered so far.  (This sliding limit still
      allows new relsets to be added on as we move to additional index columns,
      which is probably more important than considering even more combinations of
      clauses for the previous column.)  This should keep the amount of work done
      roughly linear rather than exponential in the apparent query complexity.
      This part of the fix is pretty ad-hoc; but without a clearer idea of
      real-world cases for which this would result in markedly inferior plans,
      it's hard to see how to do better.
      ca2d6a6c
  20. Oct 31, 2012
    • Tom Lane's avatar
      Document that TCP keepalive settings read as 0 on Unix-socket connections. · ec397c90
      Tom Lane authored
      Per bug #7631 from Rob Johnson.  The code is operating as designed, but the
      docs didn't explain it.
      ec397c90
    • Alvaro Herrera's avatar
      Fix ALTER EXTENSION / SET SCHEMA · fb3590dd
      Alvaro Herrera authored
      In its original conception, it was leaving some objects into the old
      schema, but without their proper pg_depend entries; this meant that the
      old schema could be dropped, causing future pg_dump calls to fail on the
      affected database.  This was originally reported by Jeff Frost as #6704;
      there have been other complaints elsewhere that can probably be traced
      to this bug.
      
      To fix, be more consistent about altering a table's subsidiary objects
      along the table itself; this requires some restructuring in how tables
      are relocated when altering an extension -- hence the new
      AlterTableNamespaceInternal routine which encapsulates it for both the
      ALTER TABLE and the ALTER EXTENSION cases.
      
      There was another bug lurking here, which was unmasked after fixing the
      previous one: certain objects would be reached twice via the dependency
      graph, and the second attempt to move them would cause the entire
      operation to fail.  Per discussion, it seems the best fix for this is to
      do more careful tracking of objects already moved: we now maintain a
      list of moved objects, to avoid attempting to do it twice for the same
      object.
      
      Authors: Alvaro Herrera, Dimitri Fontaine
      Reviewed by Tom Lane
      fb3590dd
  21. Oct 26, 2012
    • Tom Lane's avatar
      Prefer actual constants to pseudo-constants in equivalence class machinery. · 7e951ba6
      Tom Lane authored
      generate_base_implied_equalities_const() should prefer plain Consts over
      other em_is_const eclass members when choosing the "pivot" value that
      all the other members will be equated to.  This makes it more likely that
      the generated equalities will be useful in constraint-exclusion proofs.
      Per report from Rushabh Lathia.
      7e951ba6
    • Tom Lane's avatar
      In pg_dump, dump SEQUENCE SET items in the data not pre-data section. · 725fa25e
      Tom Lane authored
      Represent a sequence's current value as a separate TableDataInfo dumpable
      object, so that it can be dumped within the data section of the archive
      rather than in pre-data.  This fixes an undesirable inconsistency between
      the meanings of "--data-only" and "--section=data", and also fixes dumping
      of sequences that are marked as extension configuration tables, as per a
      report from Marko Kreen back in July.  The main cost is that we do one more
      SQL query per sequence, but that's probably not very meaningful in most
      databases.
      
      Back-patch to 9.1, since it has the extension configuration issue even
      though not the --section switch.
      725fa25e
  22. Oct 24, 2012
    • Tom Lane's avatar
      Prevent parser from believing that views have system columns. · 1dec7c7c
      Tom Lane authored
      Views should not have any pg_attribute entries for system columns.
      However, we forgot to remove such entries when converting a table to a
      view.  This could lead to crashes later on, if someone attempted to
      reference such a column, as reported by Kohei KaiGai.
      
      This problem is corrected properly in HEAD (by removing the pg_attribute
      entries during conversion), but in the back branches we need to defend
      against existing mis-converted views.  This fix costs us an extra syscache
      lookup per system column reference, which is annoying but probably not
      really measurable in the big scheme of things.
      1dec7c7c
  23. Oct 22, 2012
    • Kevin Grittner's avatar
      Correct predicate locking for DROP INDEX CONCURRENTLY. · 523ecaf4
      Kevin Grittner authored
      For the non-concurrent case there is an AccessExclusiveLock lock
      on both the index and the heap at a time during which no other
      process is using either, before which the index is maintained and
      used for scans, and after which the index is no longer used or
      maintained.  Predicate locks can safely be moved from the index to
      the related heap relation under the protection of these locks.
      This was done prior to the introductin of DROP INDEX CONCURRENTLY
      and continues to be done for non-concurrent index drops.
      
      For concurrent index drops, the predicate locks must be moved when
      there are no index scans in progress on that index and no more can
      subsequently start, and before heap inserts stop maintaining the
      index.  As long as these conditions are guaranteed when the
      TransferPredicateLocksToHeapRelation() function is called,
      stronger locks are not needed for correctness.
      
      Kevin Grittner based on questions by Tom Lane in reviewing the
      DROP INDEX CONCURRENTLY patch and in cooperation with Andres
      Freund and Simon Riggs.
      
      Back-patch of commit 4c9d0901
      523ecaf4
  24. Oct 20, 2012
    • Tom Lane's avatar
      Fix pg_dump's handling of DROP DATABASE commands in --clean mode. · a4ef1f09
      Tom Lane authored
      In commit 4317e024, I accidentally broke
      this behavior while rearranging code to ensure that --create wouldn't
      affect whether a DATABASE entry gets put into archive-format output.
      Thus, 9.2 would issue a DROP DATABASE command in --clean mode, which is
      either useless or dangerous depending on the usage scenario.
      It should not do that, and no longer does.
      
      A bright spot is that this refactoring makes it easy to allow the
      combination of --clean and --create to work sensibly, ie, emit DROP
      DATABASE then CREATE DATABASE before reconnecting.  Ordinarily we'd
      consider that a feature addition and not back-patch it, but it seems
      silly to not include the extra couple of lines required in the 9.2
      version of the code.
      
      Per report from Guillaume Lelarge, though this is slightly more extensive
      than his proposed patch.
      a4ef1f09
    • Tom Lane's avatar
      Fix UtilityContainsQuery() to handle CREATE TABLE AS EXECUTE correctly. · 12b721a7
      Tom Lane authored
      The code seems to have been written to handle the pre-parse-analysis
      representation, where an ExecuteStmt would appear directly under
      CreateTableAsStmt.  But in reality the function is only run on
      already-parse-analyzed statements, so there will be a Query node in
      between.  We'd not noticed the bug because the function is generally
      not used at all except in extended query protocol.
      
      Per report from Robert Haas and Rushabh Lathia.
      12b721a7
  25. Oct 19, 2012
    • Tom Lane's avatar
      Fix hash_search to avoid corruption of the hash table on out-of-memory. · 8a3249f1
      Tom Lane authored
      An out-of-memory error during expand_table() on a palloc-based hash table
      would leave a partially-initialized entry in the table.  This would not be
      harmful for transient hash tables, since they'd get thrown away anyway at
      transaction abort.  But for long-lived hash tables, such as the relcache
      hash, this would effectively corrupt the table, leading to crash or other
      misbehavior later.
      
      To fix, rearrange the order of operations so that table enlargement is
      attempted before we insert a new entry, rather than after adding it
      to the hash table.
      
      Problem discovered by Hitoshi Harada, though this is a bit different
      from his proposed patch.
      8a3249f1
    • Tom Lane's avatar
      Fix ruleutils to print "INSERT INTO foo DEFAULT VALUES" correctly. · 645984e4
      Tom Lane authored
      Per bug #7615 from Marko Tiikkaja.  Apparently nobody ever tried this
      case before ...
      645984e4
    • Simon Riggs's avatar
      Fix orphan on cancel of drop index concurrently. · c5675357
      Simon Riggs authored
      Canceling DROP INDEX CONCURRENTLY during
      wait could allow an orphaned index to be
      left behind which could not be dropped.
      
      Backpatch to 9.2
      
      Andres Freund, tested by Abhijit Menon-Sen
      c5675357
  26. Oct 18, 2012
    • Andrew Dunstan's avatar
      Use a more portable platform test. · f61013a4
      Andrew Dunstan authored
      f61013a4
    • Heikki Linnakangas's avatar
      Further tweaking of the readfile() function in pg_ctl. · e7bab081
      Heikki Linnakangas authored
      Don't leak a file descriptor if the file is empty or we can't read its size.
      
      Expect there to be a newline at the end of the last line, too. If there
      isn't, ignore anything after the last newline. This makes it a tiny bit
      more robust in case the file is appended to concurrently, so that we don't
      return the last line if it hasn't been fully written yet. And this makes
      the code a bit less obscure, anyway. Per Tom Lane's suggestion.
      
      Backpatch to all supported branches.
      e7bab081
Loading