Re: Fastest Index/Algorithm to find similar sentences - Mailing list pgsql-general

From Amit Langote
Subject Re: Fastest Index/Algorithm to find similar sentences
Date
Msg-id CA+HiwqGXXsX1OdZKv7m4241GiyYg4bDU4rXtaDCW-Ac36ab7ww@mail.gmail.com
Whole thread Raw
In response to Fastest Index/Algorithm to find similar sentences  ("Janek Sendrowski" <janek12@web.de>)
Responses Re: Fastest Index/Algorithm to find similar sentences
List pgsql-general
On Fri, Jul 26, 2013 at 7:54 AM, Janek Sendrowski <janek12@web.de> wrote:
> Hi,
>
> I'm searching for an algorithm/Index to find similar sentences in a database.
>
> The Fulltextsearch is not really suitable because it doesn't have a tolerance.
>
> The Levenshtein-distance ist to slow.
>
> I also tried pg_trgm module, which works with tri-grams, but it's also very slow with 100.000+ rows.
>
> I hope someone can help, I can't really find sth. which is fast enough.
>

Have you tried pg_bigm (a bi-gram based implementation)? It's still in
development phase, but you could give it a try and see if it can
perform better where pg_trgm can not.


--
Amit Langote


pgsql-general by date:

Previous
From: John R Pierce
Date:
Subject: Re: Tablespace on Postgrsql
Next
From: Samrat Revagade
Date:
Subject: Re: Speed up Switchover