Thread: autoscale cluster

autoscale cluster

From
Jesus arteche
Date:
hi everyone,

I'm looking for info about autoscale a cluster. I mean...with amazon you can generate automatically virtual machine as far as you need...if you configure that when the machine get 90% busy a new one will be created. The thing is that i'd like to do something like that for my database...whem my datbase get 90% busy...automatically another machine is created to share the charge.


I dont know if it is possible..can anyone give some piece of advice???

thanks

Re: autoscale cluster

From
David Fetter
Date:
On Wed, Apr 14, 2010 at 06:24:00PM +0200, Jesus arteche wrote:
> hi everyone,
>
> I'm looking for info about autoscale a cluster.

Reassess this goal in the cold light of reason.

First, find out what trade-offs people make in order to get this
effect.  In the unlikely event that, after finding out how the trick
is accomplished, you still have this goal, you'll know how.

Cheers,
David.
--
David Fetter <david@fetter.org> http://fetter.org/
Phone: +1 415 235 3778  AIM: dfetter666  Yahoo!: dfetter
Skype: davidfetter      XMPP: david.fetter@gmail.com
iCal: webcal://www.tripit.com/feed/ical/people/david74/tripit.ics

Remember to vote!
Consider donating to Postgres: http://www.postgresql.org/about/donate

Re: autoscale cluster

From
John R Pierce
Date:
Jesus arteche wrote:
> hi everyone,
>
> I'm looking for info about autoscale a cluster. I mean...with amazon
> you can generate automatically virtual machine as far as you need...if
> you configure that when the machine get 90% busy a new one will be
> created. The thing is that i'd like to do something like that for my
> database...whem my datbase get 90% busy...automatically another
> machine is created to share the charge.
>
>
> I dont know if it is possible..can anyone give some piece of advice???

this raises far more questions than answers.   a few that spring to mind...

where will the data that these database nodes are operating on be
stored?   will parts of it be on each node, or will each node have a
complete replica of the data?

if the data is partitioned, how will a node efficiently do a query that
requires joining data on different nodes?   how will the clients know
which node to query?

if the data is replicated, what happens when two nodes try and do a
transaction that effects the same data?

if the data is stored on a shared file system (NAS, etc), how will the
nodes keep their in-memory caches coherent and keep from stepping on the
same blocks?

databases strive to be ACID, which stands for Atomicity, Consistency,
Isolation, and Durability

distributed multi-host databases like you describe are very very hard to
implement without violating one or another of those constraints.