Thread: service outage on panora.postgresql.org
Hi all! We are currently experiencing an outage of one of our vmhosts(panora.postgresql.org) which is affecting the following production services(among some internal systems): brekka.postgresql.org (aka buildfarm.postgresql.org) We are working on the issue but atm I have not ETA for a solution because it is not entirely clear what the actual issue is (other than that the box has no working network atm). Stefan
On 11/13/21 10:46 PM, Stefan Kaltenbrunner wrote: > Hi all! > > We are currently experiencing an outage of one of our > vmhosts(panora.postgresql.org) which is affecting the following > production services(among some internal systems): > > brekka.postgresql.org (aka buildfarm.postgresql.org) > > > We are working on the issue but atm I have not ETA for a solution > because it is not entirely clear what the actual issue is (other than > that the box has no working network atm). to be more specific on this - those systems lost IPv4 connectivity but are still reachable over IPv6 but also all services are impaired by being unable to connect to any IPv4 host(inkluding DNS). Stefan
On 11/13/21 11:29 PM, Stefan Kaltenbrunner wrote: > On 11/13/21 10:46 PM, Stefan Kaltenbrunner wrote: >> Hi all! >> >> We are currently experiencing an outage of one of our >> vmhosts(panora.postgresql.org) which is affecting the following >> production services(among some internal systems): >> >> brekka.postgresql.org (aka buildfarm.postgresql.org) >> >> >> We are working on the issue but atm I have not ETA for a solution >> because it is not entirely clear what the actual issue is (other than >> that the box has no working network atm). > > to be more specific on this - those systems lost IPv4 connectivity but > are still reachable over IPv6 but also all services are impaired by > being unable to connect to any IPv4 host(inkluding DNS). services should be back for now - root cause is still somewhat unclear and we will likely need a few more reboots of the box in the next few days to nail this down... Sorry for the inconvenience :/ Stefan
On 11/14/21 9:52 AM, Stefan Kaltenbrunner wrote: > On 11/13/21 11:29 PM, Stefan Kaltenbrunner wrote: >> On 11/13/21 10:46 PM, Stefan Kaltenbrunner wrote: >>> Hi all! >>> >>> We are currently experiencing an outage of one of our >>> vmhosts(panora.postgresql.org) which is affecting the following >>> production services(among some internal systems): >>> >>> brekka.postgresql.org (aka buildfarm.postgresql.org) >>> >>> >>> We are working on the issue but atm I have not ETA for a solution >>> because it is not entirely clear what the actual issue is (other than >>> that the box has no working network atm). >> >> to be more specific on this - those systems lost IPv4 connectivity but >> are still reachable over IPv6 but also all services are impaired by >> being unable to connect to any IPv4 host(inkluding DNS). > > services should be back for now - root cause is still somewhat unclear > and we will likely need a few more reboots of the box in the next few > days to nail this down... > > > Sorry for the inconvenience :/ we have found the root-cause of this issue and it is caused by https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=949062 We have locally backported the fixes to the affected package and the service should now be stable again. Also special thanks to the great support team from equinix metal who helped in diagnosing the issue! Stefan