Re: invalid UTF-8 via pl/perl - Mailing list pgsql-hackers

From Andrew Dunstan
Subject Re: invalid UTF-8 via pl/perl
Date
Msg-id 4B40F60D.5050200@dunslane.net
Whole thread Raw
In response to Re: invalid UTF-8 via pl/perl  (Andrew Dunstan <andrew@dunslane.net>)
Responses Re: invalid UTF-8 via pl/perl
Re: invalid UTF-8 via pl/perl
Re: invalid UTF-8 via pl/perl
List pgsql-hackers

I wrote:
>
> I think the attached patch plugs the direct SPI holes as well.

There are two issues with this patch. First, how far if at all should it 
be backpatched? All the way, or 8.3, where we tightened the encoding 
rules, or not at all?

Second, It produces errors like this:
   andrew=# select 'a' || invalid_utf_seq() || 'b';   ERROR:  invalid byte sequence for encoding "UTF8": 0xd0   HINT:
Thiserror can also happen if the byte sequence does not   match the encoding expected by the server, which is
controlledby   "client_encoding".   CONTEXT:  PL/Perl function "invalid_utf_seq"   andrew=#
 


That hint seems rather misleading. I'm not sure what we can do about it 
though. If we set the noError param on pg_verifymbstr() we would miss 
the error message that actually identified the bad data, so that doesn't 
seem like a good plan.

cheers

andrew



pgsql-hackers by date:

Previous
From: Andrew Dunstan
Date:
Subject: Re: PATCH: Add hstore_to_json()
Next
From: "David E. Wheeler"
Date:
Subject: Re: PATCH: Add hstore_to_json()