Home > mailing lists

pgsql: Use a hash table to de-duplicate column names in ruleutils.c. - Mailing list pgsql-committers

From	Tom Lane
Subject	pgsql: Use a hash table to de-duplicate column names in ruleutils.c.
Date	September 10, 2024 23:49:13
Msg-id	E1so7nV-000UaJ-8F@gemulon.postgresql.org Whole thread Raw
List	pgsql-committers

Tree view

Use a hash table to de-duplicate column names in ruleutils.c.

Commit 8004953b5 added a hash table to avoid O(N^2) cost in choosing
unique relation aliases while deparsing a view or rule.  It did
nothing about the similar O(N^2) (maybe worse) costs of choosing
unique column aliases within each RTE.  However, that's now
demonstrably a bottleneck when deparsing CHECK constraints for wide
tables, so let's use a similar hash table to handle those.

The extra cost of setting up the hash table will not be repaid unless
the table has many columns.  I've set this up so that we use the brute
force method if there are less than 32 columns.  The exact cutoff is
not too critical, but this value seems good because it results in both
code paths getting exercised by existing regression-test cases.

Patch by me; thanks to David Rowley for review.

Discussion: https://postgr.es/m/2885468.1722291250@sss.pgh.pa.us

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/52c707483ce4d0161127e4958d981d1b5655865e

Modified Files
--------------
src/backend/utils/adt/ruleutils.c | 188 +++++++++++++++++++++++++++++++++-----
1 file changed, 165 insertions(+), 23 deletions(-)

pgsql-committers by date:

From: Tom Lane
Date: 10 September 2024, 23:20:39
Subject: pgsql: Fix some whitespace issues in XMLSERIALIZE(... INDENT).

From: Noah Misch
Date: 11 September 2024, 01:24:31
Subject: pgsql: Optimize pg_visibility with read streams.

pgsql: Use a hash table to de-duplicate column names in ruleutils.c. - Mailing list pgsql-committers

Previous

Next