Re: POSIX regex performance bug in 7.3 Vs. 7.2 - Mailing list pgsql-hackers

From Tom Lane
Subject Re: POSIX regex performance bug in 7.3 Vs. 7.2
Date
Msg-id 15066.1044377958@sss.pgh.pa.us
Whole thread Raw
In response to Re: POSIX regex performance bug in 7.3 Vs. 7.2  (Neil Conway <neilc@samurai.com>)
Responses Re: POSIX regex performance bug in 7.3 Vs. 7.2
Re: POSIX regex performance bug in 7.3 Vs. 7.2
List pgsql-hackers
Neil Conway <neilc@samurai.com> writes:
> Given that this problem isn't a regression, I don't think we need to
> delay 7.3.2 to fix it (of course, a fix for 7.3.3 and 7.4 is essential,
> IMHO).

No, I've had to abandon my original thought that it was a localized bug,
so it's not going to be fixed in 7.3.2.

The real problem is simply that we're up against design limitations of
the existing regex package, which was never designed for wider-than-8-bit
character sets.  It's been rather crudely hacked while it was in our
hands (Henry Spencer would probably disown the code if he saw it now ;-))
so that it sorta kinda does MULTIBYTE, but it's slow and I don't think
it's complete either.

I'm about to go off and look at whether we can absorb the Tcl regex
package, which is Spencer's new baby.  That will not be a solution for
7.3.anything, but it could be an answer for 7.4.
        regards, tom lane


pgsql-hackers by date:

Previous
From: Neil Conway
Date:
Subject: Re: POSIX regex performance bug in 7.3 Vs. 7.2
Next
From: Tom Lane
Date:
Subject: Re: [GENERAL] HELP NEEDED: Recreating DROP columns