mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-06 16:56:40 +08:00
Now making sure that we have a generic language model working with UTF-8 for all 26 supported models which had single-byte encoding support until now.
157 lines
4.8 KiB
Plaintext
157 lines
4.8 KiB
Plaintext
= Logs of language model for Slovak (sk) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2021-03-16 20:04:01.478267
|
|
- Maximum depth: 4
|
|
- Max number of pages: 100
|
|
|
|
== Parsed pages ==
|
|
|
|
Dôkaz (matematika) (revision 7170221)
|
|
1825 (revision 6937105)
|
|
1839 (revision 6804159)
|
|
1847 (revision 7167629)
|
|
1852 (revision 6923466)
|
|
1878 (revision 7159904)
|
|
1955 (revision 7061181)
|
|
1976 (revision 7100059)
|
|
1983 (revision 7174204)
|
|
1993 (revision 7122277)
|
|
1995 (revision 7133683)
|
|
2012 (revision 7135523)
|
|
Adrien-Marie Legendre (revision 6556308)
|
|
Algebraická geometria (revision 5964212)
|
|
Algebraická rovnica (revision 6586551)
|
|
Algebrické číslo (revision 6382942)
|
|
Algoritmus (revision 7100698)
|
|
Andrew Wiles (revision 6813255)
|
|
Arabi (revision 7124298)
|
|
Arabčina (revision 7148041)
|
|
Aristoteles (revision 7150270)
|
|
Arthur Cayley (revision 6332355)
|
|
Axióma (revision 7073489)
|
|
Babylonia (revision 6432954)
|
|
Bernard Bolzano (revision 6903631)
|
|
Boh (revision 7166677)
|
|
Bolzanova veta (revision 6852875)
|
|
Bytie (revision 6569833)
|
|
Byzantská ríša (revision 7168566)
|
|
Caroline Blundenová (revision 7170221)
|
|
Cauchyho postupnosť (revision 6215169)
|
|
Celé číslo (revision 7047567)
|
|
Charles Hermite (revision 6412828)
|
|
Daniel Marcus (revision 5291472)
|
|
David Hilbert (revision 5968866)
|
|
Dedukcia (revision 6338099)
|
|
Definícia (revision 6965423)
|
|
Derivácia (funkcia) (revision 7014993)
|
|
Desiatková číselná sústava (revision 7047888)
|
|
Diofantická rovnica (revision 6060359)
|
|
Dynastia Chan (revision 7025657)
|
|
Dôkaz (logika) (revision 5495754)
|
|
Dôkaz sporom (revision 7051518)
|
|
Energia (revision 6975312)
|
|
Eric Weisstein (revision 6054413)
|
|
Ernst Kummer (revision 6001344)
|
|
Európa (revision 7164742)
|
|
Experiment (revision 6354302)
|
|
Fenomén (filozofia) (revision 6558128)
|
|
Filozofia (revision 6942330)
|
|
Formula (logika) (revision 3916562)
|
|
Formálny dôkaz (revision 7170221)
|
|
Formálny jazyk (revision 6505890)
|
|
Gabriel Cramer (revision 7068001)
|
|
Galoisova teória (revision 6749172)
|
|
Gentzenovský kalkul (revision 7170221)
|
|
Geometria (revision 7010499)
|
|
Geometrický dôkaz (revision 7170221)
|
|
Georg Ferdinand Cantor (revision 6697670)
|
|
Giordano Bruno (revision 7072808)
|
|
Gottlob Frege (revision 6580699)
|
|
Gödelova veta o neúplnosti (revision 6968373)
|
|
Hardvér (revision 6946820)
|
|
Henri Poincaré (revision 6830074)
|
|
Hilbertovský kalkul (revision 7170221)
|
|
Hmotnosť (revision 7021343)
|
|
Hypotéza (revision 6850461)
|
|
Idea (revision 6113421)
|
|
India (revision 6976622)
|
|
Intuícia (revision 5837951)
|
|
Jazyk (lingvistika) (revision 6462864)
|
|
John Taylor (revision 6741201)
|
|
Kardinálne číslo (revision 7154031)
|
|
Kenneth Appel (revision 5968422)
|
|
Klasická mechanika (revision 6295646)
|
|
Konečná množina (revision 6850487)
|
|
Konfucianizmus (revision 6948500)
|
|
Kresťanstvo (revision 7150939)
|
|
Latinčina (revision 7110742)
|
|
Leonhard Euler (revision 7016638)
|
|
Lineárna algebra (revision 6564030)
|
|
Logická axióma (revision 5495754)
|
|
Logický kalkul (revision 1608550)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2021-03-16 20:13:09.022092
|
|
|
|
64 characters appeared 535286 times.
|
|
|
|
First 46 characters:
|
|
[ 0] Char o: 8.787265125559047 %
|
|
[ 1] Char a: 8.624174740232323 %
|
|
[ 2] Char e: 8.577470735270492 %
|
|
[ 3] Char n: 6.100103496074995 %
|
|
[ 4] Char i: 5.884891441210867 %
|
|
[ 5] Char t: 5.302772723366575 %
|
|
[ 6] Char r: 5.02273550961542 %
|
|
[ 7] Char s: 4.340670221152805 %
|
|
[ 8] Char k: 4.253240323864252 %
|
|
[ 9] Char v: 4.073896944810811 %
|
|
[10] Char l: 3.6208680966810265 %
|
|
[11] Char d: 3.3796886150581185 %
|
|
[12] Char m: 3.248356953105443 %
|
|
[13] Char p: 2.8470761424733695 %
|
|
[14] Char u: 2.6178528861206907 %
|
|
[15] Char c: 2.426740097816868 %
|
|
[16] Char z: 2.104856095619912 %
|
|
[17] Char h: 2.080570013039758 %
|
|
[18] Char j: 2.0389100406138025 %
|
|
[19] Char á: 1.675926514050433 %
|
|
[20] Char b: 1.6690143213160817 %
|
|
[21] Char y: 1.6607944164427988 %
|
|
[22] Char ý: 1.2490519086992748 %
|
|
[23] Char í: 1.1096871578931637 %
|
|
[24] Char č: 0.9322119390381964 %
|
|
[25] Char é: 0.8785957413420117 %
|
|
[26] Char ž: 0.7489454235679618 %
|
|
[27] Char ú: 0.702615050645823 %
|
|
[28] Char f: 0.6794498641847535 %
|
|
[29] Char š: 0.6790762321450589 %
|
|
[30] Char g: 0.6219105300717748 %
|
|
[31] Char ť: 0.4550838243481055 %
|
|
[32] Char ô: 0.38428055282596596 %
|
|
[33] Char ľ: 0.3648516867618432 %
|
|
[34] Char ó: 0.23090460053130477 %
|
|
[35] Char x: 0.22922325635267876 %
|
|
[36] Char ň: 0.09434209002290364 %
|
|
[37] Char w: 0.08855079340763629 %
|
|
[38] Char ä: 0.07005600744275023 %
|
|
[39] Char ď: 0.06706695112519288 %
|
|
[40] Char q: 0.018121153925191393 %
|
|
[41] Char ĺ: 0.010274881091603367 %
|
|
[42] Char ě: 0.010274881091603367 %
|
|
[43] Char ö: 0.010088065071756034 %
|
|
[44] Char ř: 0.007285824774046024 %
|
|
[45] Char ŕ: 0.006351744674809354 %
|
|
|
|
The first 46 characters have an accumulated ratio of 0.9998617561453131.
|
|
|
|
1198 sequences found.
|
|
|
|
First 512 (typical positive ratio): 0.9724967373205526
|
|
Next 512 (512-1024): 0.007489454235679618
|
|
Rest: 0.00042527339003644096
|
|
|
|
- Processing end: 2021-03-16 20:13:09.628753
|