Re: Full text: Ispell dictionary - Mailing list pgsql-general

From Oleg Bartunov
Subject Re: Full text: Ispell dictionary
Date
Msg-id CAF4Au4xc82caKcj3i1m2G=9E8f5N9vLMQk1Woqqhb2ZaovP1UQ@mail.gmail.com
Whole thread Raw
In response to Full text: Ispell dictionary  (Tim van der Linden <tim@shisaa.jp>)
Responses Re: Full text: Ispell dictionary
List pgsql-general
Yes, it's normal for ispell dictionary, think about morphological dictionary.

On Fri, May 2, 2014 at 11:54 AM, Tim van der Linden <tim@shisaa.jp> wrote:
> Good morning/afternoon all
>
> I am currently writing a few articles about PostgreSQL's full text capabilities and have a question about the Ispell
dictionarywhich I cannot seem to find an answer to. It is probably a very simple issue, so forgive my ignorance. 
>
> In one article I am explaining about dictionaries and I have setup a sample configuration which maps most token
categoriesto only use a Ispell dictionary (timusan_ispell) which has a default configuration: 
>
> CREATE TEXT SEARCH DICTIONARY timusan_ispell (
>         TEMPLATE = ispell,
>         DictFile = en_us,
>         AffFile = en_us,
>         StopWords = english
> );
>
> When I run a simple query like "SELECT to_tsvector('timusan-ispell','smiling')" I get back the following tsvector:
>
> 'smile':1 'smiling':1
>
> As you can see I get two lexemes with the same pointer.
> The question here is: why does this happen?
>
> Is it normal behavior for the Ispell dictionary to emit multiple lexemes for a single token? And if so, is this
efficient?I mean, why could it not simply save one lexeme 'smile' which (same as the snowball dictionary) would match
'smiling'as well if later matched with the accompanying tsquery? 
>
> Thanks!
>
> Cheers,
> Tim
>
>
> --
> Sent via pgsql-general mailing list (pgsql-general@postgresql.org)
> To make changes to your subscription:
> http://www.postgresql.org/mailpref/pgsql-general


pgsql-general by date:

Previous
From: Shaun Thomas
Date:
Subject: Re: Backups over slave instead master?
Next
From: gvim
Date:
Subject: Timeouts after upgrade from 9.0 to 9.3