Re: Nested loops overpriced - Mailing list pgsql-performance

From Tom Lane
Subject Re: Nested loops overpriced
Date
Msg-id 14164.1178812025@sss.pgh.pa.us
Whole thread Raw
In response to Re: Nested loops overpriced  (Peter Eisentraut <peter_e@gmx.net>)
List pgsql-performance
Peter Eisentraut <peter_e@gmx.net> writes:
> Am Mittwoch, 9. Mai 2007 19:40 schrieb Tom Lane:
>> Hmmm ... I see at least part of the problem, which is that email_header
>> is joined twice in this query, which means that it's counted twice in
>> figuring the total volume of pages competing for cache space. So the
>> thing thinks cache space is oversubscribed nearly 3X when in reality
>> the database is fully cached.

> I should add that other, similar queries in this database that do not
> involve joining the same table twice produce seemingly optimal plans.
> (It picks hash joins which are actually faster than nested loops.)

It strikes me that in a situation like this, where the same table is
being scanned twice by concurrent indexscans, we ought to amortize the
fetches across *both* scans rather than treating them independently;
so there are actually two different ways in which we're being too
pessimistic about the indexscanning cost.

Difficult to see how to fix that in the current planner design however;
since it's a bottom-up process, we have to cost the individual scans
without any knowledge of what approach will be chosen for other scans.

            regards, tom lane

pgsql-performance by date:

Previous
From: Peter Eisentraut
Date:
Subject: Re: Nested loops overpriced
Next
From: Bill Moran
Date:
Subject: Re: [PERFORM] REVISIT specific query (not all) on Pg8 MUCH slowerthan Pg7