Home > mailing lists

Re: COMMIT NOWAIT Performance Option - Mailing list pgsql-hackers

From	Gregory Stark
Subject	Re: COMMIT NOWAIT Performance Option
Date	February 28, 2007 17:13:14
Msg-id	87d53uugt7.fsf@stark.xeocode.com Whole thread Raw
In response to	Re: COMMIT NOWAIT Performance Option ("Jonah H. Harris" <jonah.harris@gmail.com>)
Responses	Re: COMMIT NOWAIT Performance Option Re: COMMIT NOWAIT Performance Option
List	pgsql-hackers

Tree view

"Jonah H. Harris" <jonah.harris@gmail.com> writes:

> Which is, of course, how everyone else does it.  

I happen to agree with your conclusion but this line of argument is
exceptionally unconvincing. In fact in this crowd you'll tend to turn people
off and lose people if you say things like that rather than convince anyone of
anything.

> Even pages from the last checkpoint would be a killer.

Hm that's an interesting thought. We only really have to check pages that
would have received a full page write since the last checkpoint. So if we made
turning full page writes off still record the page ids of the pages it *would*
have written then we just need the code that normally replays full page writes
to check the checksum if the page data isn't available.

I can't see how that would be a killer. No matter how large a system you're
talking about you're going to tune checkpoints to be occurring at about the
same interval anyways. So the amount of time the wal replay checksum checking
takes will be more or less constant.

In fact we're already reading in most, if not all, of those pages anyways
since we're replaying wal records that touch them after all. Would we even
have to do anything extra? If we check checksums whenever we read in a page
surely the wal replay code would automatically detect any torn pages without
any special attention.

That also makes it clear just how awful full page writes are for scalability.
As you scale up the system but try to keep checkpoint intervals constant
you're less and less likely to ever see the same page twice between two
checkpoints. So as you scale the system up more and more of the wal will
consist of full page writes.

> All of the databases (Oracle, SQL Server, DB2) have a way to perform a
> database corruption check which does go out and verify all checksums.

Which is pretty poor design. If we implemented a fsck-like tool I would be far
more interested in checking things like "tuples don't overlap" or "hint bits
are set correctly" and so on. Checksums do nothing to protect against software
failures which is the only kind of failure with a good rationale for being in
an external tool.

--  Gregory Stark EnterpriseDB          http://www.enterprisedb.com

pgsql-hackers by date:

From: Tom Lane
Date: 28 February 2007, 17:04:49
Subject: Re: Compilation errors

From: Oleg Bartunov
Date: 28 February 2007, 17:24:13
Subject: Re: SOC & user quotas

Re: COMMIT NOWAIT Performance Option - Mailing list pgsql-hackers

Previous

Next