uchardet/script/BuildLangModelLogs
Jehan 923d264470 LangModels: add Danish support (Windows-1252, ISO-8859-1 and ISO-8859-15).
Test for ISO-8859-1 is disabled for now since the difference is not big
enough, as for characters used in Danish, between ISO-8859-1 and
ISO-8859-15. Therefore the first to be declared "wins".
Let's see to improve this later.
Test contents from:
https://da.wikipedia.org/wiki/Eurosymbol
https://da.wikipedia.org/wiki/Dansk_%28sprog%29
2016-02-19 19:10:41 +01:00
..
LangArabicModel.log LangModels: add Arabic support. 2015-12-13 18:42:16 +01:00
LangDanishModel.log LangModels: add Danish support (Windows-1252, ISO-8859-1 and ISO-8859-15). 2016-02-19 19:10:41 +01:00
LangEsperantoModel.log LangModels: add Esperanto ISO-8859-3 language model. 2015-12-04 01:35:56 +01:00
LangFrenchModel.log Adding French Windows-1252 support. 2015-12-03 21:22:30 +01:00
LangGermanModel.log LangModels: adding German models for ISO-8859-1 and Windows-1252. 2015-12-03 23:58:41 +01:00
LangGreekModel.log LangModels: retraining Greek models with my training script. 2015-12-13 18:02:11 +01:00
LangHungarianModel.log BuildLangModel: forgot to add charset/language files. 2015-12-12 18:18:08 +01:00
LangSpanishModel.log LangModels: adding Spanish support. 2015-12-12 18:54:35 +01:00
LangThaiModel.log BuildLangModel: forgot to add logs for Thai models generation. 2015-12-04 03:26:52 +01:00
LangTurkishModel.log LangModels: adding Turkish models for ISO-8859-3 and ISO-8859-9. 2015-12-04 02:35:09 +01:00
LangVietnameseModel.log LangModels: add VISCII encoding support and retrain Vietnamese model. 2016-02-13 03:51:18 +01:00