mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-07 01:06:40 +08:00
Now making sure that we have a generic language model working with UTF-8 for all 26 supported models which had single-byte encoding support until now.
166 lines
5.5 KiB
Plaintext
166 lines
5.5 KiB
Plaintext
= Logs of language model for Latvian (lv) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2021-03-16 19:26:37.227238
|
|
- Maximum depth: 4
|
|
- Max number of pages: 100
|
|
|
|
== Parsed pages ==
|
|
|
|
Zigfrīds Anna Meierovics (revision 3325285)
|
|
1. Saeima (revision 3366185)
|
|
1. Saeimas deputāti (revision 3368139)
|
|
1. Saeimas frakcijas (revision 3366184)
|
|
1. Saeimas vēlēšanas (revision 3330484)
|
|
1887. gads (revision 2773799)
|
|
1919. gada Parīzes miera konference (revision 3359347)
|
|
1920 (revision 3362733)
|
|
1921 (revision 3340387)
|
|
1922 (revision 3337740)
|
|
1923 (revision 3347028)
|
|
1924 (revision 3347028)
|
|
1925 (revision 3347028)
|
|
22. augusts (revision 3327223)
|
|
31. jūlijs (revision 3347080)
|
|
5. februāris (revision 3364814)
|
|
Agrārā reforma Latvijā (revision 3328548)
|
|
Agudas Izrael (Latvija) (revision 3285729)
|
|
Aigars Kalvītis (revision 3169702)
|
|
Alberts Kviesis (revision 3379738)
|
|
Aleksandrs Bočagovs (revision 3004343)
|
|
Aleksandrs Dauge (revision 3062538)
|
|
Aleksandrs Jaunbērzs (revision 3373734)
|
|
Aleksandrs Kerenskis (revision 2758772)
|
|
Aleksandrs Millerāns (revision 3108576)
|
|
Aleksandrs Neibergs (deputāts) (revision 3349399)
|
|
Alfrēds Birznieks (revision 3300916)
|
|
Alfrēds Jēkabs Bērziņš (revision 3351998)
|
|
Alfrēds Riekstiņš (politiķis) (revision 3034089)
|
|
Amerikas Savienotās Valstis (revision 3355214)
|
|
Andrejs Bērziņš (politiķis) (revision 3089135)
|
|
Andrejs Kurcijs (revision 3223696)
|
|
Andrejs Petrevics (revision 2460269)
|
|
Andrejs Sīmanis (revision 3210302)
|
|
Andrejs Veckalns (revision 3237365)
|
|
Andrievs Niedra (revision 3374557)
|
|
Andris Bērziņš (politiķis, 1951) (revision 3231604)
|
|
Andris Šķēle (revision 3379347)
|
|
Angļu valoda (revision 3303218)
|
|
Ansis Buševics (revision 2927384)
|
|
Ansis Rudevics (revision 2700953)
|
|
Antante (revision 3373256)
|
|
Antons Dzenis (revision 2564295)
|
|
Antons Laizāns (revision 3360427)
|
|
Antons Rubins (1885) (revision 3351508)
|
|
Antons Velkme (revision 3279136)
|
|
Ants Pīps (revision 3375003)
|
|
Apollo (portāls) (revision 3232284)
|
|
Apolonija Laurinoviča (revision 3209013)
|
|
Aprīļa pučs (revision 3010427)
|
|
Apvienotā Karaliste (revision 3382180)
|
|
Aristīds Briāns (revision 2767296)
|
|
Arons Nuroks (revision 3062127)
|
|
Arturs Alberings (revision 3325257)
|
|
Arturs Krišjānis Kariņš (revision 3381504)
|
|
Arturs Ozols (inženieris) (revision 3352707)
|
|
Artūrs Balfūrs (revision 3177309)
|
|
Artūrs Reisners (revision 3300906)
|
|
Artūrs Vīgants (revision 3296217)
|
|
Artūrs Žers (revision 3296461)
|
|
Arveds Bergs (revision 3238379)
|
|
Arveds Švābe (revision 3340584)
|
|
Arvīds Kalniņš (ķīmiķis) (revision 3382254)
|
|
Aspazija (revision 3382469)
|
|
Augusts Briedis (revision 3163311)
|
|
Augusts Kalniņš (revision 3310251)
|
|
Augusts Kirhenšteins (revision 3302758)
|
|
Austroungārija (revision 3376635)
|
|
Autoritatīvā vadība (revision 2385793)
|
|
Balfūra nota (revision 3224093)
|
|
Baltijas Antante (revision 3236261)
|
|
Baltijas pārkrievošana (revision 3311586)
|
|
Bermontiāde (revision 3156269)
|
|
Bernards Kublinskis (revision 2441386)
|
|
Berta Vesmane (revision 3299697)
|
|
Bezpartijiskais nacionālais centrs (revision 3286113)
|
|
Beļģija (revision 3308106)
|
|
Brestļitovskas miera līgums (revision 3348377)
|
|
Brizules muiža (revision 3103947)
|
|
Bruno Kalniņš (revision 3297011)
|
|
Brīvības piemineklis (revision 3343774)
|
|
Bulduru konference (revision 3122422)
|
|
Bunds (revision 3368404)
|
|
Ceire-Cion (revision 3285715)
|
|
Celmiņa 1. Ministru kabinets (revision 2925529)
|
|
Delfi (portāls) (revision 3363824)
|
|
Demokrātiskais Centrs (revision 3286115)
|
|
Demokrātu savienība (revision 3339759)
|
|
Diena (laikraksts) (revision 3343800)
|
|
Donats Bicāns (revision 3311441)
|
|
Dubulti (Jūrmala) (revision 3349180)
|
|
Durbe (revision 3380441)
|
|
Dāvids Komisārs (revision 3082713)
|
|
Džovanni Džoliti (revision 3165202)
|
|
Ebreji (revision 3340750)
|
|
Ebreju bloks (revision 3285659)
|
|
Ebreju nacionāldemokrātu partija (revision 3368172)
|
|
Eduards Grantskalns (revision 2932497)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2021-03-16 19:30:28.292124
|
|
|
|
55 characters appeared 437791 times.
|
|
|
|
First 40 characters:
|
|
[ 0] Char a: 11.993622527644469 %
|
|
[ 1] Char i: 9.41179695334075 %
|
|
[ 2] Char s: 8.204599911830075 %
|
|
[ 3] Char e: 6.371761868106014 %
|
|
[ 4] Char t: 5.8011699646635035 %
|
|
[ 5] Char r: 5.772845947038655 %
|
|
[ 6] Char u: 4.945053690002764 %
|
|
[ 7] Char n: 4.437505567725239 %
|
|
[ 8] Char ā: 4.014015820334361 %
|
|
[ 9] Char l: 3.6974263975275874 %
|
|
[10] Char o: 3.597150238355745 %
|
|
[11] Char k: 3.5347917156816835 %
|
|
[12] Char m: 3.307971155185922 %
|
|
[13] Char d: 3.2337348186691823 %
|
|
[14] Char v: 2.977904982057648 %
|
|
[15] Char j: 2.8618678775945603 %
|
|
[16] Char p: 2.8296607285211435 %
|
|
[17] Char b: 2.040242946976982 %
|
|
[18] Char ī: 1.874638811670409 %
|
|
[19] Char g: 1.6240626234892905 %
|
|
[20] Char z: 1.5235580448204737 %
|
|
[21] Char ē: 1.5109949724868716 %
|
|
[22] Char c: 1.216105401892684 %
|
|
[23] Char š: 0.9225863482803439 %
|
|
[24] Char ņ: 0.45478321847639624 %
|
|
[25] Char f: 0.42691603984549703 %
|
|
[26] Char ļ: 0.3277819781585277 %
|
|
[27] Char ū: 0.29420431210326387 %
|
|
[28] Char h: 0.18616189003428577 %
|
|
[29] Char ž: 0.1815935000947941 %
|
|
[30] Char ķ: 0.126772820820894 %
|
|
[31] Char ģ: 0.11649394345703772 %
|
|
[32] Char č: 0.08382995538967224 %
|
|
[33] Char y: 0.029466115109721306 %
|
|
[34] Char w: 0.029466115109721306 %
|
|
[35] Char x: 0.012334652836627522 %
|
|
[36] Char é: 0.0050252289334408425 %
|
|
[37] Char ö: 0.0034262924546187568 %
|
|
[38] Char ü: 0.0027410339636950052 %
|
|
[39] Char q: 0.0025126144667204212 %
|
|
|
|
The first 40 characters have an accumulated ratio of 0.9998857902515126.
|
|
|
|
982 sequences found.
|
|
|
|
First 512 (typical positive ratio): 0.9904642991017133
|
|
Next 512 (512-1024): 0.001815935000947941
|
|
Rest: -5.377642775528102e-17
|
|
|
|
- Processing end: 2021-03-16 19:30:28.395006
|