Re: Parallel Scaling of a pgplsql problem - Mailing list pgsql-performance

From Greg Spiegelberg
Subject Re: Parallel Scaling of a pgplsql problem
Date
Msg-id CAEtnbpU3=saTXKcmr5n3uZ9Z+OTWjWkt7MsdRxiyOst1F0ckKQ@mail.gmail.com
Whole thread Raw
In response to Parallel Scaling of a pgplsql problem  (Venki Ramachandran <venki_ramachandran@yahoo.com>)
List pgsql-performance
On Wed, Apr 25, 2012 at 12:52 PM, Venki Ramachandran <venki_ramachandran@yahoo.com> wrote:

Now I have to run the same pgplsql on all possible combinations of employees and with 542 employees that is about say 300,000 unique pairs.

So (300000 * 40)/(1000 * 60 * 60) = 3.33 hours and I have to rank them and show it on a screen. No user wants to wait for 3 hours,  they can probably wait for 10 minutes (even that is too much for a UI application). How do I solve this scaling problem? Can I have multiple parellel sessions and each session have multiple/processes that do a pair each at 40 ms and then collate the results. Does PostGres or pgplsql have any parallel computing capability.

Interesting problem.  

How frequently does the data change?  Hourly, daily, monthly?
How granular are the time frames in the typical query?  Seconds, minutes, hours, days, weeks?

I'm thinking if you can prepare the data ahead of time as it changes via a trigger or client-side code then your problem will go away pretty quickly.

-Greg

pgsql-performance by date:

Previous
From: Yeb Havinga
Date:
Subject: Re: Parallel Scaling of a pgplsql problem
Next
From: Claudio Freire
Date:
Subject: Weird plan variation with recursive CTEs