Re: [GENERAL] Text search dictionary vs. the C locale - Mailing list pgsql-general
From | Tom Lane | ||
---|---|---|---|
Subject | Re: [GENERAL] Text search dictionary vs. the C locale | ||
Date | |||
Msg-id | 28705.1499011609@sss.pgh.pa.us Whole thread Raw |
||
In response to | [GENERAL] Text search dictionary vs. the C locale (twoflower <standa.kurik@gmail.com>) | ||
Responses |
Re: [GENERAL] Text search dictionary vs. the C locale
Re: [GENERAL] Text search dictionary vs. the C locale Re: [GENERAL] Text search dictionary vs. the C locale [GENERAL] Using 'WITH SELECT' Results Do Not Match 'SELECT FROM
twoflower <standa.kurik@gmail.com> writes: > I am having problems creating an Ispell-based text search dictionary for > Czech language. > Issuing the following command: > create text search dictionary czech_ispell ( > template = ispell, > dictfile = czech_ispell, > affFile = czech_ispell > ); > ends with > ERROR: syntax error > CONTEXT: line 252 of configuration file > "/usr/share/postgresql/9.6/tsearch_data/czech_ispell.affix": " . > TŘIA > The dictionary files are in UTF-8. The database cluster was initialized with > initdb --locale=C --encoding=UTF8 Presumably the problem is that the dictionary file parsing functions reject anything that doesn't satisfy t_isalpha() (unless it matches t_isspace()) and in C locale that's not going to accept very much. I wonder why we're doing it like that. It seems like it'd often be useful to load dictionary files that don't match the database's prevailing locale. Do we really need the t_isalpha tests, or would it be good enough to assume that anything that isn't t_isspace is part of a word? regards, tom lane pgsql-general by date: Есть вопросы? Напишите нам!Соглашаюсь с условиями обработки персональных данных ✖
By continuing to browse this website, you agree to the use of cookies. Go to Privacy Policy.
|