Re: PG 18 release notes draft committed - Mailing list pgsql-hackers

From Peter Geoghegan
Subject Re: PG 18 release notes draft committed
Date
Msg-id CAH2-Wzk1aYZpsXRP1iz7iBweg55dZPicPBUL50SmtzkF-Rm9ig@mail.gmail.com
Whole thread Raw
In response to Re: PG 18 release notes draft committed  (Bruce Momjian <bruce@momjian.us>)
Responses Re: PG 18 release notes draft committed
List pgsql-hackers
On Wed, Sep 17, 2025 at 6:58 AM Bruce Momjian <bruce@momjian.us> wrote:
> > In light of all this, I propose that we change the current feature
> > description, from:
> >
> > "This allows multi-column btree indexes to be used by queries that
> > only equality-reference the second or later indexed columns."
> >
> > to:
> >
> > "This allows multi-column btree indexes to be used by queries that
> > only specify conditions on the second or later indexed columns."
>
> I think your new text is inaccurate because you state here that the
> first column can be referenced and skip-scan still be used:

It's true that that's also possible. Skip scan will always "fill-in"
*any* index column that lacks a = condition that comes from the query
itself, by adding a skip array for that column during nbtree
preprocessing (note that a column that has a < or a > condition counts
as not having an = condition, and so will get its own skip array). A
skip array is an artificial "= ANY(<all possible column values>)"
condition/constraint that makes up for the fact that the user's query
did not provide us with a conventional = condition/constraint. nbtree
preprocessing always does this to the extent required to enable
repositioning the scan using *all* of the keys that actually come from
the query itself.

So we'll reliably read only those B-Tree leaf pages that might have
matching index tuples, no matter the details -- even with absurdly
complicated/unrealistic index scans. Note that adding a skip array
doesn't necessarily make us skip (skipping only happens when the scan
finds that it actually allows us to skip over something, otherwise we
just step to the next page on the leaf level as before). Skipping is
purely a runtime choice (adding skip arrays merely enables us to make
this choice, but it still has to make sense for us to do it).

To be clear, I don't think that the release notes need to go into
anything like this level of detail. My point is just that there are
just about no limitations. This isn't useful because we actually
expect users to have really complicated index scans, with varied
operators/conditions on only a subset of index columns (though some
may); it's useful because users don't have to think about it at all.

> I think we need to highlight new cases where indexes can now be used by
> skip scan:
>
> *  missing early indexed column references
> *  early indexed column references that use non-equality comparisons and
>    the comparisons are not sufficiently restrictive on their own to use
>    the index.

There isn't that much difference between these 2 things, from an
implementation point of view. They're both cases where a query/scan
initially lacks an = condition on a column where it'd be useful to
have one, so as to be able to use at least one later condition that
comes from the query to reposition the scan.

If there is an inequality condition on a column that gets a skip array
(because it didn't initially have a = condition), then the skip array
will only generate values that satisfy the inequalities. For example,
a skip array on "x" will only generate the values 1 and 2 given a qual
such as "WHERE x BETWEEN 1 AND 2 AND y = 66". The runtime behavior is
very similar to what we'd get in Postgres 17 for a qual "WHERE x =
ANY('{1, 2}') AND y = 66".

> And, at the same time, not fall into the trip of saying the later column
> references must be equality-only.

Right. This later condition could be a simple scalar equality or
inequality condition, it could be an IN() condition, or a row compare
inequality -- it could be anything.

> Here is what I have so far:
>
>         Previously, multi-column btree indexes could only be used by
>         queries that either equality-referenced the first indexed column
>         or referenced that column in a restrictive-enough way for index
>         lookups to be efficient.

It's hard to pin this down, since it has always been possible to make
*some* use of index columns, even in these tricky cases (we at least
didn't have to go to the heap to eliminate non-matching rows). Skip
scan just makes the B-Tree code find the most efficient way of
*navigating through the index*, by skipping over groups of
provably-irrelevant index leaf pages. This makes it much more likely
that the optimizer will actually choose such an index scan in the
first place.

I suggest the following alternative, which has the merit of being a
bit less verbose:

"Skip scan allows B-Tree index scans to find the most efficient way of
navigating through a multicolumn index when one or more of its columns
initially lacks a = condition and comes before a column that is
directly used by the query. Such an index scan can now be broken down
into multiple "index searches" by generating an implementation level =
condition on any underspecified columns. This allows the scan to skip
over irrelevant sections of the index, though only when the generated
= condition is on a column that has relatively few distinct values."

Note that "index searches" is a term that now appears in EXPLAIN ANALYZE output.

> I apologize for people who got the wrong impression of the feature and I
> hope they see this email thread or the updated text.

It's tough to get this right. There are many ways in which this could
be unnecessarily confusing, or misleading.

--
Peter Geoghegan



pgsql-hackers by date:

Previous
From: Robert Haas
Date:
Subject: Re: Schedule for PG 18 RC and GA releases
Next
From: Sami Imseih
Date:
Subject: Re: pg_stat_statements: faster search by queryid