Jehan
|
e6e51d9fe8
|
src: all language models now rebuilt after the fix.
|
2022-12-15 14:31:55 +01:00 |
|
Jehan
|
6bb1b3e101
|
scripts: all language models rebuilt with the new ratio data.
|
2022-12-14 20:16:44 +01:00 |
|
Jehan
|
eb8308d50a
|
src, script: regenerate all existing language models.
Now making sure that we have a generic language model working with UTF-8
for all 26 supported models which had single-byte encoding support until
now.
|
2022-12-14 00:23:13 +01:00 |
|
Jehan
|
fbd2efdbe9
|
LangModels: Romanian support added.
Encodings: ISO-8859-2, ISO-8859-16, Windows-1250 and IBM852.
Test texts from https://ro.wikipedia.org/wiki/Danemarca
|
2016-09-28 19:57:50 +02:00 |
|