Re: Index creation time and distribution - Mailing list pgsql-performance

From Tom Lane
Subject Re: Index creation time and distribution
Date
Msg-id 29529.1211462076@sss.pgh.pa.us
Whole thread Raw
In response to Index creation time and distribution  ("Guillaume Smet" <guillaume.smet@gmail.com>)
Responses Re: Index creation time and distribution
Re: Index creation time and distribution
List pgsql-performance
"Guillaume Smet" <guillaume.smet@gmail.com> writes:
> I experienced this morning a performance problem when we imported a
> dump in a 8.1 database.
> The table is 5 millions rows large and when the dump creates an index
> on a specific text column called clazz it takes 27 minutes while on
> the other columns, it only takes a couple of seconds:
> The only weird thing about this column is that 4.7 millions of rows
> have the exact same value.

Do you have maintenance_work_mem set large enough that the index
creation sort is done in-memory?  8.1 depends on the platform's qsort
and a lot of them are kinda pessimal for input like this.

8.2 (which uses our own qsort) seems to perform better in a quick
test.

            regards, tom lane

pgsql-performance by date:

Previous
From: "Guillaume Smet"
Date:
Subject: Index creation time and distribution
Next
From: "Guillaume Smet"
Date:
Subject: Re: Index creation time and distribution