b6b06530d654f0436bfbaefc1e988d53fff0cbee |
|
01-Jun-2015 |
Timo Sirainen <tss@iki.fi> |
lib-fts: tokenizers - Fixed removal of trailing character in truncated tokens.
If the token is truncated, we don't want to remove the trailing character
since it's not actually there.
Also we don't want to remove trailing apostrophes from a truncated word,
because they're not actually at the end of the (untruncated) token there.
This doesn't make a big difference, but it's slightly more correct. |
e3f8b4fd89a11442c3208cd3210cfaccd4835386 |
|
01-Jun-2015 |
Timo Sirainen <tss@iki.fi> |
lib-fts: simple tokenizer cleanup - make prev_letter updating more explicit.
It was now hidden inside one of the functions, which didn't make the
prev_letter very consistent when a word break was found. It didn't actually
matter what the prev_letter was at that point, but now the behavior is more
consistent. |