Home > mailing lists

Re: Low perfomance SUM and Group by large databse - Mailing list pgsql-performance

From	Robert Haas
Subject	Re: Low perfomance SUM and Group by large databse
Date	July 2, 2010 14:21:55
Msg-id	AANLkTimmthOhlu2nflXje4noQouyiRwgZvkq60PRAJOB@mail.gmail.com Whole thread Raw
In response to	Re: Low perfomance SUM and Group by large databse ("Sergio Charpinel Jr." <sergiocharpinel@gmail.com>)
List	pgsql-performance

Tree view

On Tue, Jun 29, 2010 at 7:59 AM, Sergio Charpinel Jr.
<sergiocharpinel@gmail.com> wrote:
> One more question about two specifics query behavior: If I add "AND (ip_dst
> = x.x.x.x)", it uses another plan and take a much more time. In both of
> them, I'm using WHERE clause. Why this behavior?

With either query, the planner is choosing to scan backward through
the acct_2010_26_pk index to get the rows in descending order by the
"bytes" column.  It keeps scanning until it finds 50 rows that match
the WHERE clause.  With just the critieria on stamp_inserted, matches
are pretty common, so it doesn't have to scan very far before finding
50 suitable rows.  But when you add the ip_dst = 'x.x.x.x' criterion,
suddenly a much smaller percentage of the rows match and so it has to
read much further into the index before it finds 50 that do.

A second index on just the ip_dst column might help a lot - then it
could consider index-scanning for the matching rows and sorting them
afterwards.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise Postgres Company

pgsql-performance by date:

From: "Benjamin Krajmalnik"
Date: 02 July 2010, 13:40:17
Subject: Re: Question about partitioned query behavior

From: Rajesh Kumar Mallah
Date: 02 July 2010, 16:22:43
Subject: Re: how to (temporarily) disable/minimize benefits of disk block cache or postgresql shared buffer

Re: Low perfomance SUM and Group by large databse - Mailing list pgsql-performance

Previous

Next