uchardet/script/BuildLangModelLogs/LangLatvianModel.log
Jehan eb8308d50a src, script: regenerate all existing language models.
Now making sure that we have a generic language model working with UTF-8
for all 26 supported models which had single-byte encoding support until
now.
2022-12-14 00:23:13 +01:00

166 lines
5.5 KiB
Plaintext

= Logs of language model for Latvian (lv) =
- Generated by BuildLangModel.py
- Started: 2021-03-16 19:26:37.227238
- Maximum depth: 4
- Max number of pages: 100
== Parsed pages ==
Zigfrīds Anna Meierovics (revision 3325285)
1. Saeima (revision 3366185)
1. Saeimas deputāti (revision 3368139)
1. Saeimas frakcijas (revision 3366184)
1. Saeimas vēlēšanas (revision 3330484)
1887. gads (revision 2773799)
1919. gada Parīzes miera konference (revision 3359347)
1920 (revision 3362733)
1921 (revision 3340387)
1922 (revision 3337740)
1923 (revision 3347028)
1924 (revision 3347028)
1925 (revision 3347028)
22. augusts (revision 3327223)
31. jūlijs (revision 3347080)
5. februāris (revision 3364814)
Agrārā reforma Latvijā (revision 3328548)
Agudas Izrael (Latvija) (revision 3285729)
Aigars Kalvītis (revision 3169702)
Alberts Kviesis (revision 3379738)
Aleksandrs Bočagovs (revision 3004343)
Aleksandrs Dauge (revision 3062538)
Aleksandrs Jaunbērzs (revision 3373734)
Aleksandrs Kerenskis (revision 2758772)
Aleksandrs Millerāns (revision 3108576)
Aleksandrs Neibergs (deputāts) (revision 3349399)
Alfrēds Birznieks (revision 3300916)
Alfrēds Jēkabs Bērziņš (revision 3351998)
Alfrēds Riekstiņš (politiķis) (revision 3034089)
Amerikas Savienotās Valstis (revision 3355214)
Andrejs Bērziņš (politiķis) (revision 3089135)
Andrejs Kurcijs (revision 3223696)
Andrejs Petrevics (revision 2460269)
Andrejs Sīmanis (revision 3210302)
Andrejs Veckalns (revision 3237365)
Andrievs Niedra (revision 3374557)
Andris Bērziņš (politiķis, 1951) (revision 3231604)
Andris Šķēle (revision 3379347)
Angļu valoda (revision 3303218)
Ansis Buševics (revision 2927384)
Ansis Rudevics (revision 2700953)
Antante (revision 3373256)
Antons Dzenis (revision 2564295)
Antons Laizāns (revision 3360427)
Antons Rubins (1885) (revision 3351508)
Antons Velkme (revision 3279136)
Ants Pīps (revision 3375003)
Apollo (portāls) (revision 3232284)
Apolonija Laurinoviča (revision 3209013)
Aprīļa pučs (revision 3010427)
Apvienotā Karaliste (revision 3382180)
Aristīds Briāns (revision 2767296)
Arons Nuroks (revision 3062127)
Arturs Alberings (revision 3325257)
Arturs Krišjānis Kariņš (revision 3381504)
Arturs Ozols (inženieris) (revision 3352707)
Artūrs Balfūrs (revision 3177309)
Artūrs Reisners (revision 3300906)
Artūrs Vīgants (revision 3296217)
Artūrs Žers (revision 3296461)
Arveds Bergs (revision 3238379)
Arveds Švābe (revision 3340584)
Arvīds Kalniņš (ķīmiķis) (revision 3382254)
Aspazija (revision 3382469)
Augusts Briedis (revision 3163311)
Augusts Kalniņš (revision 3310251)
Augusts Kirhenšteins (revision 3302758)
Austroungārija (revision 3376635)
Autoritatīvā vadība (revision 2385793)
Balfūra nota (revision 3224093)
Baltijas Antante (revision 3236261)
Baltijas pārkrievošana (revision 3311586)
Bermontiāde (revision 3156269)
Bernards Kublinskis (revision 2441386)
Berta Vesmane (revision 3299697)
Bezpartijiskais nacionālais centrs (revision 3286113)
Beļģija (revision 3308106)
Brestļitovskas miera līgums (revision 3348377)
Brizules muiža (revision 3103947)
Bruno Kalniņš (revision 3297011)
Brīvības piemineklis (revision 3343774)
Bulduru konference (revision 3122422)
Bunds (revision 3368404)
Ceire-Cion (revision 3285715)
Celmiņa 1. Ministru kabinets (revision 2925529)
Delfi (portāls) (revision 3363824)
Demokrātiskais Centrs (revision 3286115)
Demokrātu savienība (revision 3339759)
Diena (laikraksts) (revision 3343800)
Donats Bicāns (revision 3311441)
Dubulti (Jūrmala) (revision 3349180)
Durbe (revision 3380441)
Dāvids Komisārs (revision 3082713)
Džovanni Džoliti (revision 3165202)
Ebreji (revision 3340750)
Ebreju bloks (revision 3285659)
Ebreju nacionāldemokrātu partija (revision 3368172)
Eduards Grantskalns (revision 2932497)
== End of Parsed pages ==
- Wikipedia parsing ended at: 2021-03-16 19:30:28.292124
55 characters appeared 437791 times.
First 40 characters:
[ 0] Char a: 11.993622527644469 %
[ 1] Char i: 9.41179695334075 %
[ 2] Char s: 8.204599911830075 %
[ 3] Char e: 6.371761868106014 %
[ 4] Char t: 5.8011699646635035 %
[ 5] Char r: 5.772845947038655 %
[ 6] Char u: 4.945053690002764 %
[ 7] Char n: 4.437505567725239 %
[ 8] Char ā: 4.014015820334361 %
[ 9] Char l: 3.6974263975275874 %
[10] Char o: 3.597150238355745 %
[11] Char k: 3.5347917156816835 %
[12] Char m: 3.307971155185922 %
[13] Char d: 3.2337348186691823 %
[14] Char v: 2.977904982057648 %
[15] Char j: 2.8618678775945603 %
[16] Char p: 2.8296607285211435 %
[17] Char b: 2.040242946976982 %
[18] Char ī: 1.874638811670409 %
[19] Char g: 1.6240626234892905 %
[20] Char z: 1.5235580448204737 %
[21] Char ē: 1.5109949724868716 %
[22] Char c: 1.216105401892684 %
[23] Char š: 0.9225863482803439 %
[24] Char ņ: 0.45478321847639624 %
[25] Char f: 0.42691603984549703 %
[26] Char ļ: 0.3277819781585277 %
[27] Char ū: 0.29420431210326387 %
[28] Char h: 0.18616189003428577 %
[29] Char ž: 0.1815935000947941 %
[30] Char ķ: 0.126772820820894 %
[31] Char ģ: 0.11649394345703772 %
[32] Char č: 0.08382995538967224 %
[33] Char y: 0.029466115109721306 %
[34] Char w: 0.029466115109721306 %
[35] Char x: 0.012334652836627522 %
[36] Char é: 0.0050252289334408425 %
[37] Char ö: 0.0034262924546187568 %
[38] Char ü: 0.0027410339636950052 %
[39] Char q: 0.0025126144667204212 %
The first 40 characters have an accumulated ratio of 0.9998857902515126.
982 sequences found.
First 512 (typical positive ratio): 0.9904642991017133
Next 512 (512-1024): 0.001815935000947941
Rest: -5.377642775528102e-17
- Processing end: 2021-03-16 19:30:28.395006