mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-07 17:26:41 +08:00
Right now, each time we add new language or new charset support, we have too many pieces of code not to forget to edit. The script script/BuildLangModel.py will now take care of the main parts: listing the sequence models, listing the generic language models and computing the numbers for each listing. Furthermore the script will now end with a TODO list of the parts which are still to be done manually (2 functions to edit and a CMakeLists). Finally the script now allows to give a list of languages to edit rather of having to run it with languages one by one. It also allows 2 special code: "none", which will retrain none of the languages, but will re-generate only the new generated listings; and "all" which will retrain all models (useful in particulare when we change the model formats or usage and want to regenerate everything).
37 lines
108 B
Plaintext
37 lines
108 B
Plaintext
ar
|
|
be
|
|
bg
|
|
cs
|
|
da
|
|
de
|
|
el
|
|
en
|
|
eo
|
|
es
|
|
et
|
|
fi
|
|
fr
|
|
ga
|
|
he
|
|
hi
|
|
hr
|
|
hu
|
|
it
|
|
lt
|
|
lv
|
|
mk
|
|
mt
|
|
no
|
|
pl
|
|
pt
|
|
ro
|
|
ru
|
|
sk
|
|
sl
|
|
sr
|
|
sv
|
|
th
|
|
tr
|
|
uk
|
|
vi
|