Home > mailing lists

Re: WIP: bloom filter in Hash Joins with batches - Mailing list pgsql-hackers

From	Tomas Vondra
Subject	Re: WIP: bloom filter in Hash Joins with batches
Date	December 17, 2015 15:35:00
Msg-id	5672D617.9060601@2ndquadrant.com Whole thread Raw
In response to	Re: WIP: bloom filter in Hash Joins with batches ("Shulgin, Oleksandr" <oleksandr.shulgin@zalando.de>)
Responses	Re: WIP: bloom filter in Hash Joins with batches
List	pgsql-hackers

Tree view

Hi,

On 12/17/2015 10:50 AM, Shulgin, Oleksandr wrote:
> On Tue, Dec 15, 2015 at 11:30 PM, Tomas Vondra
> <tomas.vondra@2ndquadrant.com <mailto:tomas.vondra@2ndquadrant.com>> wrote:
>
>> Attached is a spreadsheet with results for various work_mem
>> values, and also with a smaller data set (just 30M rows in the fact
>> table), which easily fits into memory. Yet it shows similar gains,
>> shaving off ~40% in the best case, suggesting that this is not just
>> thanks to reduction of I/O when forcing the temp files to disk.
>
> A neat idea! Have you possibly tried to also collect statistics
> about actual false-positive rates and filter allocation sizes in
> every of the collected data points?

The patch counts and prints the total number of lookups, and the number 
of eliminated rows. The bloom filter is currently sized for 5% false 
positives rate, and the numbers I've seen match that.

I think ultimately we'll need to measure the false positive rate, so 
that we can use it to dynamically disable the bloom filter if it gets 
inefficient. Also maybe put some of that into EXPLAIN ANALYZE.

regards

-- 
Tomas Vondra                  http://www.2ndQuadrant.com
PostgreSQL Development, 24x7 Support, Remote DBA, Training & Services

pgsql-hackers by date:

From: Andres Freund
Date: 17 December 2015, 15:14:26
Subject: Re: Patch: fix lock contention for HASHHDR.mutex

From: Dean Rasheed
Date: 17 December 2015, 15:38:16
Subject: Re: Inaccurate results from numeric ln(), log(), exp() and pow()

Re: WIP: bloom filter in Hash Joins with batches - Mailing list pgsql-hackers

Previous

Next