Home > mailing lists

Re: parallel query evaluation - Mailing list pgsql-performance

From	Tom Lane
Subject	Re: parallel query evaluation
Date	November 10, 2012 15:32:42
Msg-id	9130.1352561545@sss.pgh.pa.us Whole thread Raw
In response to	parallel query evaluation (Oliver Seidel <postgresql@os10000.net>)
List	pgsql-performance

Tree view

Oliver Seidel <postgresql@os10000.net> writes:
> I have
>              create table x ( att bigint, val bigint, hash varchar(30)
> );
> with 693million rows.  The query

>              create table y as select att, val, count(*) as cnt from x
> group by att, val;

> ran for more than 2000 minutes and used 14g memory on an 8g physical
> RAM machine

What was the plan for that query?  What did you have work_mem set to?

I can believe such a thing overrunning memory if the planner chose to
use a hash-aggregation plan instead of sort-and-unique, but it would
only do that if it had made a drastic underestimate of the number of
groups implied by the GROUP BY clause.  Do you have up-to-date
statistics for the source table?

            regards, tom lane

pgsql-performance by date:

From: Petr Praus
Date: 10 November 2012, 14:28:17
Subject: Re: Re: Increasing work_mem and shared_buffers on Postgres 9.2 significantly slows down queries

From: Rafał Rzepecki
Date: 12 November 2012, 07:11:11
Subject: Planner sometimes doesn't use a relevant index with IN (subquery) condition

Re: parallel query evaluation - Mailing list pgsql-performance

Previous

Next