Home > mailing lists
Join optimization for inheritance tables - Mailing list pgsql-hackers

From	Nedyalko Borisov
Subject	Join optimization for inheritance tables
Date	June 26, 2009 14:20:25
Msg-id	4A4500AA.3050206@asterdata.com Whole thread Raw
Responses	Re: Join optimization for inheritance tables
List	pgsql-hackers
Tree view
Hi all,

We are working with Aster for the summer and we would like to bounce some ideas that we are having for some possible
PostgreSQLextensions. In order to describe our ideas we will use the following example:
 

create table msg(  msg_id int,  msg text );

create table receiver(  msg_id int,  user_id int,  ts timestamp );


create table msg_100(  check ( 1 <= msg_id and msg_id < 100 )  ) inherits (msg);

create table msg_200(  check ( 100 <= msg_id and msg_id < 200 )  ) inherits (msg);

create table msg_300(  check ( 200 <= msg_id and msg_id < 300 )  ) inherits (msg);


create table receiver_100(  check ( 1 <= msg_id and msg_id < 100 )  ) inherits (receiver);

create table receiver_200(  check ( 100 <= msg_id and msg_id < 200 )  ) inherits (receiver);

create table receiver_300(  check ( 200 <= msg_id and msg_id < 300 )  ) inherits (receiver);


When we are issuing queries on one of the parent tables, like,
   SELECT * FROM msg WHERE msg_id BETWEEN 50 AND 70;

PostgreSQL is smart enough to filter out child tables with check constraints that are refuted by the filter conditions.
Inthis example, the optimizer will pick a plan that only considers the parent table 'msg' and one of the child tables
'msg_100':
   Result      ->  Append            ->  Seq Scan on msg                  Filter: ((msg_id >= 50) AND (msg_id <= 70))
        ->  Seq Scan on msg_100 msg                  Filter: ((msg_id >= 50) AND (msg_id <= 70))
 

Plan costs are removed for simplicity of the presentation.

Now, if we issue a join query between the two parent tables, like,
   SELECT * FROM msg m JOIN receiver r ON m.msg_id = r.msg_id;

the execution plan will be:
   Merge Join      Merge Cond: (m.msg_id = r.msg_id)      ->  Sort            Sort Key: m.msg_id            ->  Append
               ->  Seq Scan on msg m                  ->  Seq Scan on msg_100 m                  ->  Seq Scan on
msg_200m                  ->  Seq Scan on msg_300 m      ->  Sort            Sort Key: r.msg_id            ->  Append
              ->  Seq Scan on receiver r                  ->  Seq Scan on receiver_100 r                  ->  Seq Scan
onreceiver_200 r                  ->  Seq Scan on receiver_300 r
 


During the planning phase, the optimizer treats an entire hierarchy as a single entity. Hence, it first considers the
mostefficient way to create the append paths for the two hierarchies, and then the best way to join them. However,
thereare some optimizations that are possible here, similar to the table filtering described above. In particular,
insteadof joining the two appends, we could "push down" the join to the child relations - that is, create pairwise
joinsbetween the children and then append the join results together.
 

Based on the check conditions of the children and the join predicate, it is possible to filter out joins that cannot
produceany results. For example, joining 'msg_100' with 'receiver_300' is redundant since the check constraints of
thesetwo tables do not overlap. Tuples in 'msg_100' have 'msg_id' between 1 and 100, whereas tuples in 'receiver_300'
have'msg_id' between 200 and 300. Therefore, no tuples can be produce from this join.
 

A plan with such optimizations could be:
 Result   ->  Append        ->  Hash Join              Hash Cond: (msg.msg_id = receiver.msg_id)              ->  Seq
Scanon msg msg              ->  Hash                    ->  Seq Scan on receiver receiver        ->  Hash Join
   Hash Cond: (msg.msg_id = receiver.msg_id)              ->  Seq Scan on msg msg              ->  Hash
  ->  Seq Scan on receiver_100 receiver        ->  Hash Join              Hash Cond: (msg.msg_id = receiver.msg_id)
        ->  Seq Scan on msg msg              ->  Hash                    ->  Seq Scan on receiver_200 receiver
-> Hash Join              Hash Cond: (msg.msg_id = receiver.msg_id)              ->  Seq Scan on msg msg
-> Hash                    ->  Seq Scan on receiver_300 receiver        ->  Hash Join              Hash Cond:
(msg.msg_id= receiver.msg_id)              ->  Seq Scan on msg_100 msg              ->  Hash                    ->  Seq
Scanon receiver receiver        ->  Hash Join              Hash Cond: (msg.msg_id = receiver.msg_id)              ->
SeqScan on msg_100 msg              ->  Hash                    ->  Seq Scan on receiver_100 receiver        ->  Hash
Join             Hash Cond: (msg.msg_id = receiver.msg_id)              ->  Seq Scan on msg_200 msg              ->
Hash                   ->  Seq Scan on receiver receiver        ->  Hash Join              Hash Cond: (msg.msg_id =
receiver.msg_id)             ->  Seq Scan on msg_200 msg              ->  Hash                    ->  Seq Scan on
receiver_200receiver        ->  Hash Join              Hash Cond: (msg.msg_id = receiver.msg_id)              ->  Seq
Scanon msg_300 msg              ->  Hash                    ->  Seq Scan on receiver receiver        ->  Hash Join
       Hash Cond: (msg.msg_id = receiver.msg_id)              ->  Seq Scan on msg_300 msg              ->  Hash
          ->  Seq Scan on receiver_300 receiver
 

The parent tables don't have any check constraints in the above example and hence will need to join with all the child
relations.However, based on usage and feedback among Aster's customers, we believe that it is common practice not to
storeany data in the parent tables. Therefore, we were thinking of creating an "Empty Check Constraint". This
constraintwill enforce that a particular table is empty and will prevent the insertion of any rows into it. If the
parenttables have the empty check constraint, then the optimizer will be able to further eliminate some of the child
joins.

A plan with all optimizations and features suggested could be:
 Result   ->  Append        ->  Hash Join              Hash Cond: (msg.msg_id = receiver.msg_id)              ->  Seq
Scanon msg_100 msg              ->  Hash                    ->  Seq Scan on receiver_100 receiver        ->  Hash Join
           Hash Cond: (msg.msg_id = receiver.msg_id)              ->  Seq Scan on msg_200 msg              ->  Hash
              ->  Seq Scan on receiver_200 receiver        ->  Hash Join              Hash Cond: (msg.msg_id =
receiver.msg_id)             ->  Seq Scan on msg_300 msg              ->  Hash                    ->  Seq Scan on
receiver_300receiver
 


In summary, we are making two suggestions:
1. Extend the optimizer to consider joins between child tables when hierarchies are joined together.
2. Add the "Empty Check Constraint", which would enforce that a particular table is to remain empty.

We've created some basic implementation, just to verify the benefits of these ideas. The initial results seem promising
aswe observed decreased execution time and for large data set the query execution time could be several times faster
thanbefore.
 
We would greatly appreciate your comments, thoughts or suggestions that you might have regarding these ideas.

Regards,
Nedyalko Borisov and Herodotos Herodotou
pgsql-hackers by date:
From: Kris Jurka
Date: 26 June 2009, 14:05:39
Subject: Re: gettext version problem exposed by buildfarm failures on "canary"
From: "Kevin Grittner"
Date: 26 June 2009, 17:13:28
Subject: Re: 8.4 open item: copy performance regression?
Join optimization for inheritance tables - Mailing list pgsql-hackers

Previous

Next