3ec8b0d282d46d1f698b1f2aa27922cb8f26cb97 |
|
17-Nov-2015 |
Teemu Huovila <teemu.huovila@dovecot.fi> |
lib-fts: Add Norwegian.
Norwegian has two main dialects, Bokmal(nb) and Nynorsk(nn). They
are detected separately by libexttextcat, but the stemmer only
knows Norwegian. Thus they are treated as a single language,
Norwegian (no). This might also make more sense in everyday
use of mixed writing style Norwegian.
Caveat: The default normalizer filter does not modify U+00F8
(Latin Small Letter O with Stroke). In some configurations it
might be desirable to rewrite it to e.g. o. Same goes for the
upper case version. This can be done by passing a modified "id"
setting to the normalizer filter. |
48afa4224df2a6bcfe75fec11a59c224426dcdc1 |
|
17-Nov-2015 |
Teemu Huovila <teemu.huovila@dovecot.fi> |
lib-fts: Add comment to language names.
Languages are defined by their ISO 639-1 code, which is a two letters.
It is possible, that some languages with only a three letter code, ie.
a ISO 639-2 code, could be added in the future. |