On Sun, May 4, 2025 at 01:31:51AM +0300, Alexander Borisov wrote:
> Hi hackers,
>
> > I will continue improving it until beta 1, and until the final release.
> > I will probably add markup in 1-3 weeks. Let the feedback begin. ;-)
> >
> > You can see the most current HTML-built version here:
> >
> > https://momjian.us/pgsql_docs/release-18.html
> >
>
>
> I'm not sure, but I'll ask.
>
> I have significantly improved the handling of Unicode Case in
> PostgreSQL.
> The improvements affect important functions such as lower(), upper(),
> casefold().
> Specifically, the patch has significantly reduced the size of Unicode
> Case tables (and consequently the size of the object file).
> We got a significant speed gain:
> ASCII by ≈10%
> Cyrillic by ≈80%
> Unicode in general by ≈30%
>
> But, unfortunately, I didn't see any mention of this improvement in the
> release notes.
> Hence the question to the community - are such improvements worth
> mentioning?
>
> I'm just new to the community, and want to understand.
>
> Commit: https://git.postgresql.org/gitweb/?p=postgresql.git;a=commitdiff;h=27bdec06841d1bb004ca7627eac97808b08a7ac7
>
> I am now actively working on a major improvement to Unicode
> Normalization Forms.
Given the performance numbers above, which were not in the commit, maybe
I should add it to the case folding item, and add his name as a
co-author.
--
Bruce Momjian <bruce@momjian.us> https://momjian.us
EDB https://enterprisedb.com
Do not let urgent matters crowd out time for investment in the future.