Re: Add progressive backoff to XactLockTableWait functions - Mailing list pgsql-hackers

From Fujii Masao
Subject Re: Add progressive backoff to XactLockTableWait functions
Date
Msg-id e2436db2-ba65-4194-b47f-6565c49b850d@oss.nttdata.com
Whole thread Raw
In response to Re: Add progressive backoff to XactLockTableWait functions  (Xuneng Zhou <xunengzhou@gmail.com>)
Responses Re: Add progressive backoff to XactLockTableWait functions
List pgsql-hackers

On 2025/07/04 17:57, Xuneng Zhou wrote:
> Hi,
>
> On Thu, Jul 3, 2025 at 9:30 AM Xuneng Zhou <xunengzhou@gmail.com <mailto:xunengzhou@gmail.com>> wrote:
>
>     Hi,
>
>
>          >>> On 2025-07-02 22:55:16 +0900, Fujii Masao wrote:
>          >>>> On 2025/06/24 1:32, Xuneng Zhou wrote:
>          >>>>> 3. The proposed solution
>          >>>>>
>          >>>>> If the above analysis is sound, one potential fix would be to add
>          >>>>> separate branching for standby in XactLockTableWait. However, this seems
>          >>>>> inconsistent with the function's definition—there's simply no lock entry
>          >>>>> in the lock table for waiting. We could implement a new function for
>          >>>>> this logic,
>          >>>>
>          >>>> To be honest, I'm fine with v3, since it only increases the sleep time
>          >>>> after 5000 loop iterations, which has negligible performance impact.
>          >>>
>          >>> I think this is completely the wrong direction. We should make
>          >>> XactLockTableWait() on standbys, not make the polling smarter.
>          >>
>          >> On standby, XactLockTableWait() can enter a busy loop with 1ms sleeps.
>          >
>          > Right.
>          >
>          >> But are you suggesting that this doesn't need to be addressed?
>          >
>          > No.
>          >
>          >> Or do you have another idea for how to handle it?
>          >
>          > We have all the information to make it work properly on standby. I've not find through the code to figure
outnot, but that's what needs to happen, instead on putting on another layer of hacks. 
>
>         Sorry, maybe I failed to get your point...
>         Could you explain your idea or reasoning in a bit more detail?
>
>
>     My take is that XactLockTableWait() isn’t really designed to work on standby. Instead of adding another layer on
toplike v4 did, maybe we can tweak the function itself to support standby. One possible approach could be to add a
checklike RecoveryInProgress() to handle the logic when running on a standby. 
>
>
> After thinking about this further, Andres's suggestion might be replacing polling(whether smart or not) with
event-drivenlike waiting in XactLockTableWait. To achieve this, implementing a new notification mechanism for standby
serversseems to be required. From what I can observe, the codebase appears to lack IPC infrastructure for waiting on
remotetransaction completion and receiving notifications when those transactions finish. I'm not familiar with this
area,so additional inputs would be very helpful here. 

Your guess might be right, or maybe not. It's hard for me to say for sure.
It seems better to wait for Andres to explain his idea in more detail,
rather than trying to guess...

Regards,

--
Fujii Masao
NTT DATA Japan Corporation




pgsql-hackers by date:

Previous
From: Fujii Masao
Date:
Subject: Re: Assertion failure during initdb with transaction_timeout set
Next
From: Álvaro Herrera
Date:
Subject: Re: [HACKERS] pg_upgrade failed with error - ERROR: column "a" in child table must be marked NOT NULL