Re: Proposed Patch to Improve Performance of Multi-BatchHash Join for Skewed Data Sets - Mailing list pgsql-hackers

From Robert Haas
Subject Re: Proposed Patch to Improve Performance of Multi-BatchHash Join for Skewed Data Sets
Date
Msg-id 603c8f070902241918k5274a862ua8b206db145912af@mail.gmail.com
Whole thread Raw
In response to Re: Proposed Patch to Improve Performance of Multi-BatchHash Join for Skewed Data Sets  ("Lawrence, Ramon" <ramon.lawrence@ubc.ca>)
List pgsql-hackers
> Joshua sent us some preliminary data with this query and others and indicated that we could post it.  He wanted time
toclean it up 
> and re-run some experiments, but the data is generally good and the algorithm performs as expected.  I have attached
thisdata to the 
> post.  Note that the last set of data (although labelled as Z7) is actually an almost zero skew database and
representsthe worst-case 
> for the algorithm (for most queries the optimization is not even used).

Sadly, there seem to be a number of cases in the Z7 database where the
optimization makes things significantly worse (specifically, queries
2, 3, and 7, but especially query 3).  Have you investigated what is
going on there?  I had thought that we had sufficient safeguards in
place to prevent this optimization from kicking in in cases where it
doesn't help, but it seems not.  There will certainly be real-world
databases that are more like Z7 than Z1.

...Robert


pgsql-hackers by date:

Previous
From: Frank Featherlight
Date:
Subject: Re: Service not starting: Error 1053
Next
From: Robert Haas
Date:
Subject: Re: Service not starting: Error 1053