Обсуждение: Upgrade streaming replication and log-shipping standby servers

Поиск
Список
Период
Сортировка

Upgrade streaming replication and log-shipping standby servers

От
Victor Sudakov
Дата:
Dear Colleagues,

When upgrading to a new major version, pg_upgrade documentation
https://www.postgresql.org/docs/current/pgupgrade.html#PGUPGRADE-STEP-REPLICAS
states that pg_upgrade can be used on a master server. After upgrading
the PostgreSQL version on standbys, however, one must either pull the
data from scratch with pg_basebackup, or run rsync from master to
standby.

Is running pg_upgrade on replicas not supported and why?

-- 
Victor Sudakov,  VAS4-RIPE, VAS47-RIPN
2:5005/49@fidonet http://vas.tomsk.ru/



Re: Upgrade streaming replication and log-shipping standby servers

От
Greg Spiegelberg
Дата:
On Mon, Jun 15, 2020 at 4:46 AM Victor Sudakov <vas@sibptus.ru> wrote:
Dear Colleagues,

When upgrading to a new major version, pg_upgrade documentation
https://www.postgresql.org/docs/current/pgupgrade.html#PGUPGRADE-STEP-REPLICAS
states that pg_upgrade can be used on a master server. After upgrading
the PostgreSQL version on standbys, however, one must either pull the
data from scratch with pg_basebackup, or run rsync from master to
standby.

Is running pg_upgrade on replicas not supported and why?


Hi Victor,

I agree it would be extraordinarily convenient if pg_upgrade could operate on standbys.

Via experience and a quick code read of pg_upgrade source confirms that the new, initialized and as yet unmodified cluster is modified in preparation for an upgrade.  Some of those new cluster modifications include execution of analyze, freezing rows, resetting WAL, modification of controldata(?) and restoration of the old schema.  These and others are evident running the utility in verbose mode. Setting aside a standby must be promoted for these modifications, there is no current way to guarantee these modifications will be consistently done on both primary and all standbys. A standby inconsistent with the primary is no standby at all.

Perhaps not the right place but I would argue that IF pg_upgrade could 
1) pause after all these new cluster modifications at or soon after "Restoring database schemas in the new cluster",
2) permit standbys to replicate the new cluster,
3) continue on primary, and
4) run on standbys picking up from where it left off at step 2 perhaps in coordination with primary
then it would be very nice but I'm making assumptions and painting with a wide brush.  :)

-Greg 

Re: Upgrade streaming replication and log-shipping standby servers

От
Victor Sudakov
Дата:
Greg Spiegelberg wrote:
> 
> >
> > When upgrading to a new major version, pg_upgrade documentation
> >
> > https://www.postgresql.org/docs/current/pgupgrade.html#PGUPGRADE-STEP-REPLICAS
> > states that pg_upgrade can be used on a master server. After upgrading
> > the PostgreSQL version on standbys, however, one must either pull the
> > data from scratch with pg_basebackup, or run rsync from master to
> > standby.
> >
> > Is running pg_upgrade on replicas not supported and why?
> >
> >
> Hi Victor,
> 
> I agree it would be extraordinarily convenient if pg_upgrade could operate
> on standbys.
> 
> Via experience and a quick code read of pg_upgrade source confirms that the
> new, initialized and as yet unmodified cluster is modified in preparation
> for an upgrade.  Some of those new cluster modifications include execution
> of analyze, freezing rows, resetting WAL, modification of
> controldata(?) and restoration of the old schema.  These and others are
> evident running the utility in verbose mode. Setting aside a standby must
> be promoted for these modifications, there is no current way to
> guarantee these modifications will be consistently done on both primary and
> all standbys. A standby inconsistent with the primary is no standby at all.

Hi Greg,

Thank you for the detailed explanation.
> 
> Perhaps not the right place but I would argue that IF pg_upgrade could
> 1) pause after all these new cluster modifications at or soon after
> "Restoring database schemas in the new cluster",
> 2) permit standbys to replicate the new cluster,
> 3) continue on primary, and
> 4) run on standbys picking up from where it left off at step 2 perhaps in
> coordination with primary
> then it would be very nice but I'm making assumptions and painting with a
> wide brush.  :)
> 

Sounds very tricky.

-- 
Victor Sudakov,  VAS4-RIPE, VAS47-RIPN
2:5005/49@fidonet http://vas.tomsk.ru/



Re: Upgrade streaming replication and log-shipping standby servers

От
Bruce Momjian
Дата:
On Tue, Jun 16, 2020 at 05:48:10AM -0600, Greg Spiegelberg wrote:
> On Mon, Jun 15, 2020 at 4:46 AM Victor Sudakov <vas@sibptus.ru> wrote:
> 
>     Dear Colleagues,
> 
>     When upgrading to a new major version, pg_upgrade documentation
>     https://www.postgresql.org/docs/current/pgupgrade.html#
>     PGUPGRADE-STEP-REPLICAS
>     states that pg_upgrade can be used on a master server. After upgrading
>     the PostgreSQL version on standbys, however, one must either pull the
>     data from scratch with pg_basebackup, or run rsync from master to
>     standby.
> 
>     Is running pg_upgrade on replicas not supported and why?
> 
> 
> 
> Hi Victor,
> 
> I agree it would be extraordinarily convenient if pg_upgrade could operate on
> standbys.

Uh, pg_upgrade is already documented as being able to upgrade standby
servers:

    https://www.postgresql.org/docs/12/pgupgrade.html
    
    8. Prepare for standby server upgrades

Am I missing something?  The sync is not sync'ing the file contents,
just the hard links.  You can see the details in the docs:

    f. Run rsync
    
    When using link mode, standby servers can be quickly upgraded using
    rsync. To accomplish this, from a directory on the primary server that
    is above the old and new database cluster directories, run this on the
    primary for each standby server:
    
    rsync --archive --delete --hard-links --size-only --no-inc-recursive
    old_cluster new_cluster remote_dir
    
    where old_cluster and new_cluster are relative to the current directory
    on the primary, and remote_dir is above the old and new cluster
    directories on the standby. The directory structure under the specified
    directories on the primary and standbys must match. Consult the rsync
    manual page for details on specifying the remote directory, e.g.
    
    rsync --archive --delete --hard-links --size-only --no-inc-recursive
    /opt/PostgreSQL/9.5 \
          /opt/PostgreSQL/9.6 standby.example.com:/opt/PostgreSQL
    
    You can verify what the command will do using rsync's --dry-run option.
    While rsync must be run on the primary for at least one standby, it is
    possible to run rsync on an upgraded standby to upgrade other standbys,
    as long as the upgraded standby has not been started.
    
    What this does is to record the links created by pg_upgrade's link mode
    that connect files in the old and new clusters on the primary server. It
    then finds matching files in the standby's old cluster and creates links
    for them in the standby's new cluster. Files that were not linked on the
    primary are copied from the primary to the standby. (They are usually
    small.) This provides rapid standby upgrades. Unfortunately, rsync
    needlessly copies files associated with temporary and unlogged tables
    because these files don't normally exist on standby servers.

-- 
  Bruce Momjian  <bruce@momjian.us>        https://momjian.us
  EnterpriseDB                             https://enterprisedb.com

  The usefulness of a cup is in its emptiness, Bruce Lee




Re: Upgrade streaming replication and log-shipping standby servers

От
Greg Spiegelberg
Дата:
On Tue, Jun 16, 2020 at 11:00 AM Bruce Momjian <bruce@momjian.us> wrote:

Uh, pg_upgrade is already documented as being able to upgrade standby
servers:

        https://www.postgresql.org/docs/12/pgupgrade.html


I stand corrected.  Next time I shall RT-Whole-FM.
Thanks Bruce.

-Greg 

Re: Upgrade streaming replication and log-shipping standby servers

От
Bruce Momjian
Дата:
On Tue, Jun 16, 2020 at 11:28:27AM -0600, Greg Spiegelberg wrote:
> On Tue, Jun 16, 2020 at 11:00 AM Bruce Momjian <bruce@momjian.us> wrote:
> 
> 
>     Uh, pg_upgrade is already documented as being able to upgrade standby
>     servers:
> 
>             https://www.postgresql.org/docs/12/pgupgrade.html
> 
> 
> 
> I stand corrected.  Next time I shall RT-Whole-FM.

It is a long, complex set of instructions, and people who try to
short-circuit it can end up not using pg_upgrade to its full potential.

-- 
  Bruce Momjian  <bruce@momjian.us>        https://momjian.us
  EnterpriseDB                             https://enterprisedb.com

  The usefulness of a cup is in its emptiness, Bruce Lee