Thread: left outer join vs subplan

left outer join vs subplan

From

Teodor Sigaev

Date:

05 September 2007, 13:11:36

Hi!

I found two queries which do the same thing but they is very different in time. 
For test suite it's about 10^3 times, but on real data it can be 10^5 times. 
It's observed on 8.1-current, 8.2-current and CVS HEAD versions. Interesting 
that even without LIMIT clause they take approximately the same time, but costs 
is differ in 30 times. Is any way to tweaking pgsql to produce more reasonable 
plan for first query?

This query is auto-generated, so they may be more complex and I choose simplest 
example.

First query:
explain analyze
select *
from    a    left outer join (        select b.id, sum(b.val)        from b        group by b.id    ) bagg        on
bagg.id= a.id
 
where    a.id > 10000
order by a.addon, a.id
limit 100; Limit  (cost=9923.36..9923.61 rows=100 width=20) (actual 
time=2232.437..2233.273 rows=100 loops=1)   ->  Sort  (cost=9923.36..10031.41 rows=43221 width=20) (actual 
time=2232.428..2232.709 rows=100 loops=1)         Sort Key: a.addon, a.id         Sort Method:  top-N heapsort  Memory:
24kB        ->  Merge Right Join  (cost=0.00..8271.48 rows=43221 width=20) (actual 
 
time=313.198..2052.559 rows=40000 loops=1)               Merge Cond: (b.id = a.id)               ->  GroupAggregate
(cost=0.00..5725.41rows=53292 width=12) 
 
(actual time=0.266..1422.522 rows=50000 loops=1)                     ->  Index Scan using bidx on b
(cost=0.00..4309.26
 
rows=150000 width=12) (actual time=0.217..547.402 rows=150000 loops=1)               ->  Index Scan using a1idx on a
(cost=0.00..1256.90rows=40551 
 
width=8) (actual time=0.171..155.073 rows=40000 loops=1)                     Index Cond: (a.id > 10000) Total runtime:
2233.940ms
 

Second query:
explain analyze
select    a.id,    (        select sum(b.val)        from b        where b.id = a.id    ) as val
from a
where    id > 10000
order by a.addon, a.id
limit 100;
 Limit  (cost=0.00..839.04 rows=100 width=8) (actual time=0.339..7.436 rows=100 
loops=1)   ->  Index Scan using a2idx on a  (cost=0.00..340241.08 rows=40551 width=8) 
(actual time=0.332..6.865 rows=100 loops=1)         Index Cond: (id > 10000)         SubPlan           ->  Aggregate
(cost=8.33..8.34rows=1 width=8) (actual 
 
time=0.048..0.051 rows=1 loops=100)                 ->  Index Scan using bidx on b  (cost=0.00..8.32 rows=3 
width=8) (actual time=0.016..0.027 rows=3 loops=100)                       Index Cond: (id = $0)


How to reproduce:
select generate_series as id, (random()*100)::int as addon into a    from generate_series(1,50000);
create unique index a1idx on a (id);
create unique index a2idx on a (addon, id);

select    id, random() as val into b    from generate_series(1,50000) as id , generate_series(1,3) as foo;
create index bidx on b (id);

vacuum analyze;


-- 
Teodor Sigaev                                   E-mail: teodor@sigaev.ru
  WWW: http://www.sigaev.ru/

Re: left outer join vs subplan

From

Simon Riggs

Date:

06 September 2007, 05:18:22

On Wed, 2007-09-05 at 20:11 +0400, Teodor Sigaev wrote:

> I found two queries which do the same thing but they is very different in time. 
> For test suite it's about 10^3 times, but on real data it can be 10^5 times. 
> It's observed on 8.1-current, 8.2-current and CVS HEAD versions. Interesting 
> that even without LIMIT clause they take approximately the same time, but costs 
> is differ in 30 times. Is any way to tweaking pgsql to produce more reasonable 
> plan for first query?

Times I get are:
Q1: ~950ms
Q2: ~5ms

> This query is auto-generated, so they may be more complex and I choose simplest 
> example.

I think we need to build up a library of autogenerated queries, so we
can do things which address multiple use cases. Can you tell us more
about who/what generated it, so we can research?

The query formulation does seem a fairly common one.

> First query:
> explain analyze
> select *
> from
>      a
>      left outer join (
>          select b.id, sum(b.val)
>          from b
>          group by b.id
>      ) bagg
>          on bagg.id = a.id
> where
>      a.id > 10000
> order by a.addon, a.id
> limit 100;
>   Limit  (cost=9923.36..9923.61 rows=100 width=20) (actual 
> time=2232.437..2233.273 rows=100 loops=1)
>     ->  Sort  (cost=9923.36..10031.41 rows=43221 width=20) (actual 
> time=2232.428..2232.709 rows=100 loops=1)
>           Sort Key: a.addon, a.id
>           Sort Method:  top-N heapsort  Memory: 24kB
>           ->  Merge Right Join  (cost=0.00..8271.48 rows=43221 width=20) (actual 
> time=313.198..2052.559 rows=40000 loops=1)
>                 Merge Cond: (b.id = a.id)
>                 ->  GroupAggregate  (cost=0.00..5725.41 rows=53292 width=12) 
> (actual time=0.266..1422.522 rows=50000 loops=1)
>                       ->  Index Scan using bidx on b  (cost=0.00..4309.26 
> rows=150000 width=12) (actual time=0.217..547.402 rows=150000 loops=1)
>                 ->  Index Scan using a1idx on a  (cost=0.00..1256.90 rows=40551 
> width=8) (actual time=0.171..155.073 rows=40000 loops=1)
>                       Index Cond: (a.id > 10000)
>   Total runtime: 2233.940 ms

The value of sum(b.val) is never used in the query, so the aggregate
itself could be discarded. I suspect there are other conditions you
aren't showing us that would make this impossible?

The aggregate prevents the condition bagg.id = a.id from being pushed
down so that we know b.id = a.id. If we knew that then we could use b.id
= ? as an index condition to retrieve the rows.

Since we can't use the best technique, we use another. That then hits a
third optimization problem. When an IndexScan is used to enforce order,
we don't estimate how much of the table needs to be scanned before we
start hitting rows. In the example you give we need to scan 65% of the
table using an IndexScan before we hit any rows. So we would probably be
better off doing a Sort<-SeqScan to apply the condition.

I think we need to do all 3 eventually.

--  Simon Riggs 2ndQuadrant  http://www.2ndQuadrant.com

Re: left outer join vs subplan

From

"Filip Rembiałkowski"

Date:

07 September 2007, 06:07:36

2007/9/6, Simon Riggs <simon@2ndquadrant.com>:

> The query formulation does seem a fairly common one.
>
> > First query:
> > explain analyze
> > select *
> > from
> >      a
> >      left outer join (
> >          select b.id, sum(b.val)
> >          from b
> >          group by b.id
> >      ) bagg
> >          on bagg.id = a.id
> > where
> >      a.id > 10000
> > order by a.addon, a.id
> > limit 100;

>
> The value of sum(b.val) is never used in the query, so the aggregate
> itself could be discarded. I suspect there are other conditions you
> aren't showing us that would make this impossible?

The value of sum(b.val) is being output in the "select *", so saying
it's never used is an oversimplification. But it's actually not used
in any join, or filter. That should be enough to optimize...

>
> The aggregate prevents the condition bagg.id = a.id from being pushed
> down so that we know b.id = a.id. If we knew that then we could use b.id
> = ? as an index condition to retrieve the rows.

That's exactly the point... But if we all can see it, maybe it's
possible to code it?


Cheers,
Filip Rembiałkowski

Re: left outer join vs subplan

From

Teodor Sigaev

Date:

07 September 2007, 11:03:04


> I think we need to build up a library of autogenerated queries, so we
> can do things which address multiple use cases. Can you tell us more
> about who/what generated it, so we can research?

Sorry, I can't publish a lot of information, that is on of the biggest russian 
software company, it tries to migrate from MS SQL to PostgreSQL. MS SQL can 
optimize such queries to form similar to second query.

> 
> The query formulation does seem a fairly common one.

It seems to me too. Instead of SUM aggregates it can be MIN/AMX/AVG etc or more 
complex subquery. But pgsql usually optimizes non-aggregate subquery rather well.

> The value of sum(b.val) is never used in the query, so the aggregate
> itself could be discarded. I suspect there are other conditions you
> aren't showing us that would make this impossible?
No, - select *, ie all fields from a and bagg tables.


> The aggregate prevents the condition bagg.id = a.id from being pushed
> down so that we know b.id = a.id. If we knew that then we could use b.id
> = ? as an index condition to retrieve the rows.
In this case, it's safe to push down clause b.id=a.id.

BTW, is pgsql understand that query 'select id,sum() ... group by id' produces result with unique id?

> Since we can't use the best technique, we use another. That then hits a
> third optimization problem. When an IndexScan is used to enforce order,
> we don't estimate how much of the table needs to be scanned before we
> start hitting rows. In the example you give we need to scan 65% of the
Why 65%? a.addon has only 100 unique values and in first 100 tuples in index 
(a2idx) it will be about 80 tuples with  id>10000.

> table using an IndexScan before we hit any rows. So we would probably be
> better off doing a Sort<-SeqScan to apply the condition.

That's true for query without LIMIT clause (second query is slower for 10% in 
compare with first one - both without LIMIT).
-- 
Teodor Sigaev                                   E-mail: teodor@sigaev.ru
  WWW: http://www.sigaev.ru/