mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-06 16:56:40 +08:00
Now making sure that we have a generic language model working with UTF-8 for all 26 supported models which had single-byte encoding support until now.
148 lines
4.5 KiB
Plaintext
148 lines
4.5 KiB
Plaintext
= Logs of language model for Maltese (mt) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2021-03-16 19:30:28.553074
|
|
- Maximum depth: 4
|
|
- Max number of pages: 100
|
|
|
|
== Parsed pages ==
|
|
|
|
Unjoni Ewropea (revision 255663)
|
|
1951 (revision 229183)
|
|
1952 (revision 229184)
|
|
1957 (revision 229188)
|
|
1958 (revision 229189)
|
|
1973 (revision 252982)
|
|
1979 (revision 252967)
|
|
1981 (revision 253774)
|
|
1985 (revision 252978)
|
|
1986 (revision 252978)
|
|
1990 (revision 257440)
|
|
1992 (revision 249582)
|
|
1995 (revision 252258)
|
|
1 ta' Mejju (revision 258193)
|
|
2007 (revision 258027)
|
|
2013 (revision 248708)
|
|
Albanija (revision 261944)
|
|
Awstrija (revision 261959)
|
|
Awtonomija (revision 262074)
|
|
Ażores (revision 255663)
|
|
Bank Ċentrali Ewropew (revision 255748)
|
|
Belt kapitali (revision 255506)
|
|
Belġju (revision 255745)
|
|
Brussell (revision 243311)
|
|
Bulgarija (revision 261950)
|
|
Danimarka (revision 256058)
|
|
Dazji doganali (revision 255663)
|
|
De facto (revision 215102)
|
|
Dħul nazzjonali gross (revision 255663)
|
|
Estonja (revision 255711)
|
|
European Free Trade Association (revision 255663)
|
|
Ewropa (revision 259973)
|
|
Ex Repubblika Jugoslava tal-Maċedonja (revision 255663)
|
|
Federazzjoni (revision 228364)
|
|
Finlandja (revision 258210)
|
|
Frankfurt (revision 261246)
|
|
Franza (revision 259635)
|
|
Greċja (revision 259971)
|
|
Groenlandja (revision 250685)
|
|
Indja (revision 254565)
|
|
Islanda (revision 255630)
|
|
Isle of Man (revision 259978)
|
|
Istati Membri (revision 255663)
|
|
Istitut tal-Unjoni Ewropea għall-Istudji dwar is-Sigurtà (revision 256700)
|
|
Italja (revision 254814)
|
|
Kilometru kwadru (revision 247665)
|
|
Komunitajiet Ewropej (revision 256698)
|
|
Komunità Ekonomika Ewropea (revision 255663)
|
|
Kroazja (revision 249144)
|
|
Kummissjoni Ewropea (revision 258115)
|
|
Kunsill Ewropew (revision 255754)
|
|
Kunsill tal-Ewropa (revision 255754)
|
|
Kunsill tal-Unjoni Ewropea (revision 255754)
|
|
Latvja (revision 255712)
|
|
Lista ta' pajjiżi skont id-daqs (revision 254529)
|
|
Lista ta' pajjiżi skont il-popolazzjoni (revision 260622)
|
|
Litwanja (revision 259637)
|
|
Liġijiet tal-Unjoni Ewropea (revision 255663)
|
|
Lussemburgu (revision 253431)
|
|
Lussemburgu (belt) (revision 243587)
|
|
Madejra (revision 243625)
|
|
Malta (revision 261973)
|
|
Montenegro (revision 255647)
|
|
Norveġja (revision 261168)
|
|
Olanda (revision 261407)
|
|
Organizzazzjoni Internazzjonali (revision 258039)
|
|
Organizzazzjonijiet mhux governattivi (revision 233500)
|
|
Pajjiżi l-Baxxi (revision 261407)
|
|
Pajjiżi membri tal-Unjoni Ewropea (revision 255663)
|
|
Pajjiżi ġirien li jdawru l-Unjoni Ewropea (revision 255663)
|
|
Parlament Ewropew (revision 255748)
|
|
Politika agrikola komuni (revision 255745)
|
|
Politika reġjonali tal-Unjoni Ewropea (revision 255663)
|
|
Polonja (revision 261762)
|
|
Portugall (revision 243625)
|
|
Qorti tal-Ġustizzja tal-Unjoni Ewropea (revision 255663)
|
|
Relazzjonijiet ta' terzi pajjiżi ma l-UE (revision 255663)
|
|
Renju Unit (revision 254529)
|
|
Repubblika Federali tal-Ġermanja (revision 258687)
|
|
Repubblika tal-Irlanda (revision 250619)
|
|
Repubblika Ċeka (revision 255669)
|
|
Rumanija (revision 261954)
|
|
Segretarjat tal-Parlament Ewropew (revision 255663)
|
|
Serbja (revision 259975)
|
|
Slovakkja (revision 255727)
|
|
Slovenja (revision 261963)
|
|
Spanja (revision 258290)
|
|
Stati membri tal-Unjoni Ewropea (revision 255663)
|
|
Strasburgu (revision 243503)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2021-03-16 19:33:28.445834
|
|
|
|
49 characters appeared 643393 times.
|
|
|
|
First 31 characters:
|
|
[ 0] Char i: 12.115145797358691 %
|
|
[ 1] Char a: 12.109705887381429 %
|
|
[ 2] Char t: 8.033037350421903 %
|
|
[ 3] Char l: 7.963095650714261 %
|
|
[ 4] Char e: 6.5463876666361 %
|
|
[ 5] Char n: 5.990118014961307 %
|
|
[ 6] Char r: 5.530834186881113 %
|
|
[ 7] Char u: 4.447514971409388 %
|
|
[ 8] Char o: 3.9081867536637795 %
|
|
[ 9] Char j: 3.7945703481386963 %
|
|
[10] Char m: 3.619405246870886 %
|
|
[11] Char s: 3.4255890256810377 %
|
|
[12] Char k: 2.5824029792055554 %
|
|
[13] Char d: 2.3040350143691337 %
|
|
[14] Char p: 2.1852895508654897 %
|
|
[15] Char b: 2.0524003214209667 %
|
|
[16] Char f: 1.9347428399127748 %
|
|
[17] Char ħ: 1.6223365812186332 %
|
|
[18] Char g: 1.4863388317871036 %
|
|
[19] Char w: 1.4324060100125429 %
|
|
[20] Char z: 1.3761417982477273 %
|
|
[21] Char ż: 0.9421924080616357 %
|
|
[22] Char h: 0.9235412881395973 %
|
|
[23] Char ġ: 0.7990450626599915 %
|
|
[24] Char ċ: 0.6618039052336597 %
|
|
[25] Char v: 0.6143989754318122 %
|
|
[26] Char x: 0.610357899448704 %
|
|
[27] Char q: 0.5511405936962324 %
|
|
[28] Char c: 0.24153200299039623 %
|
|
[29] Char à: 0.08936994962643362 %
|
|
[30] Char y: 0.061082417744675495 %
|
|
|
|
The first 31 characters have an accumulated ratio of 0.9995414933019164.
|
|
|
|
888 sequences found.
|
|
|
|
First 512 (typical positive ratio): 0.9960434044151966
|
|
Next 512 (512-1024): 0.009421924080616357
|
|
Rest: 1.5612511283791264e-17
|
|
|
|
- Processing end: 2021-03-16 19:33:28.518739
|