Solved: questions about tsearch2 (for czech language) - Mailing list pgsql-general
From | Pavel Stehule |
---|---|
Subject | Solved: questions about tsearch2 (for czech language) |
Date | |
Msg-id | Pine.LNX.4.44.0312230858550.3176-100000@kix.fsv.cvut.cz Whole thread Raw |
In response to | Re: questions about tsearch2 (for czech language) (Oleg Bartunov <oleg@sai.msu.su>) |
Responses |
Re: Solved: questions about tsearch2 (for czech language)
|
List | pgsql-general |
Oleg You has true. After restart of postmaster all works fine. tsearch2=# select to_tsvector('default_czech','Jmenuji se Pavel Stěhule'); to_tsvector ------------------------------------ 'pavel':3 'stěhule':4 'jmenovat':1 Thank You very much Pavel Stehule On Mon, 22 Dec 2003, Oleg Bartunov wrote: > Pavel, > > did you restart psql session after modifying tsearch2 configuration ? > btw, there is czech dictionary available from http://lingucomponent.openoffice.org/download_dictionary.html > We have utility to convert myspell dicts to ispell one. It's included > in 7.5 development. Patch for 7.4 could be downloaded from > http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/ > > Also, historically, we use openfts mailing list for discussion of > tsearch2. > > Oleg > On Mon, 22 Dec 2003, Pavel Stehule wrote: > > > > > result. Why? Have I problem with my configuration? > > > > > > did you specify stop words in dictionaries configuration ? > > > > > > select * from pg_ts_dict; > > > > > tsearch2=# select * from pg_ts_dict where dict_name ='cz_ispell'; > > -[ RECORD 1 > > ]---+-------------------------------------------------------------------------------------------------------------------------- > > dict_name | cz_ispell > > dict_init | 173405 > > dict_initoption | > > DictFile="/usr/lib/ispell/czech",AffFile="/usr/lib/ispell/czech.aff",StopFile="/usr/local/pgsql/share/contrib/czech.stop" > > dict_lexize | 173406 > > dict_comment | > > > > [postgres@usop root]$ cat /usr/local/pgsql/share/contrib/czech.stop|grep -e "^[sv]." > > se > > sem > > si > > svůj > > ve > > vám > > váš > > viz > > vy > > > > > > > > > > 2. I use small czech dictionary. I need don't erase words which aren't in > > > > dictionary (in my sample Stěhule). Can I set it somewhere? I tryed add > > > > simple dict into cfg map, but witout sucess > > > > > > > > > > Example, please ! What do you mean 'erase words' ? > > > > > > > > > > tsearch2=# select * from ts_debug('jmenuji se Pavel Stěhule a bydlím ve > > > > Skalici.'); ts_name | tok_type | description | token | > > > > dict_name | tsvector > > > > ---------------+----------+-------------+---------+--------------------+----------- > > > > default_czech | word | Word | Stěhule | {cz_ispell,simple} | > > > > default_czech | lword | Latin word | a | {cz_ispell,simple} | > > > > default_czech | word | Word | bydlím | {cz_ispell,simple} | > > > > 'bydlet' > > > > > > > > > > > > If tsearch didn't find word in dictionary, then erase this from result. > > True? My surname, fo example isn't in dictionary, but I wont save this > > word in result (tsvector). > > > > I use > > > > tsearch2=# select version(); > > version > > ------------------------------------------------------------------------------------------------------- > > PostgreSQL 7.4RC2 on i686-pc-linux-gnu, compiled by GCC gcc (GCC) 3.3 > > 20030715 (Red Hat Linux 3.3-14) > > > > > > Regards, > Oleg > _____________________________________________________________ > Oleg Bartunov, sci.researcher, hostmaster of AstroNet, > Sternberg Astronomical Institute, Moscow University (Russia) > Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ > phone: +007(095)939-16-83, +007(095)939-23-83 > > ---------------------------(end of broadcast)--------------------------- > TIP 9: the planner will ignore your desire to choose an index scan if your > joining column's datatypes do not match >
pgsql-general by date: