mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-08 09:56:41 +08:00
250 lines
7.9 KiB
Plaintext
250 lines
7.9 KiB
Plaintext
= Logs of language model for Swedish (sv) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2022-12-14 18:18:23.937740
|
|
- Maximum depth: 4
|
|
- Max number of pages: 200
|
|
|
|
== Parsed pages ==
|
|
|
|
Kakapo (revision 49828871)
|
|
Ordning (biologi) (revision 49477220)
|
|
Könsdimorfism (revision 49227758)
|
|
Understam (revision 37821817)
|
|
Näbb (revision 50932877)
|
|
Ekosystem (revision 51621713)
|
|
Mana Island (revision 50706974)
|
|
Taiaha (revision 24936148)
|
|
Kelp (revision 49338041)
|
|
Sir David Attenborough (revision 51607859)
|
|
Maoripapegojor (revision 51427181)
|
|
Fasan (revision 49697043)
|
|
Ö (landområde) (revision 51150176)
|
|
Leicestershire (revision 47632046)
|
|
Proximal (revision 49685650)
|
|
Theropoder (revision 51150214)
|
|
Ekologi (revision 51390308)
|
|
Inlandsis (revision 51265091)
|
|
Årsmedeltemperatur (revision 29488423)
|
|
Britter (revision 49461730)
|
|
Hokkaido (revision 49200550)
|
|
Undersektion (revision 44004259)
|
|
National Library of Australia (revision 48833796)
|
|
Domän (biologi) (revision 48975224)
|
|
Vingtäckare (revision 51009246)
|
|
Sexuell läggning (revision 51518165)
|
|
England (revision 51638467)
|
|
Fett (revision 50390502)
|
|
Kemisk energi (revision 51629106)
|
|
Jämställdhet (revision 51603771)
|
|
International Commission on Zoological Nomenclature (revision 50077719)
|
|
Kaka (fågel) (revision 46220460)
|
|
1926 (revision 51173302)
|
|
Transperson (revision 51622251)
|
|
Land (revision 50379893)
|
|
Underserie (revision 44004261)
|
|
Infraklass (revision 44944834)
|
|
Nanofylum (revision 48212330)
|
|
Infrafylum (revision 48212330)
|
|
Habitat (revision 51634899)
|
|
Natriumglutamat (revision 51440450)
|
|
Suva (revision 49858077)
|
|
Internationella naturvårdsunionen (revision 49705198)
|
|
Kamouflage (revision 51589424)
|
|
Fauna (revision 50265422)
|
|
Solljus (revision 50272000)
|
|
Fiji Time (revision 51628863)
|
|
Internet Archive (revision 51051535)
|
|
Form (biologi) (revision 44857646)
|
|
Meter över havet (revision 49865837)
|
|
Serie (biologi) (revision 44004261)
|
|
Dicynodonter (revision 43852828)
|
|
Öken (revision 50057233)
|
|
Tredje könet (revision 51617056)
|
|
Division (biologi) (revision 46962848)
|
|
Neognathae (revision 49351226)
|
|
Könsmaktsordning (revision 49908397)
|
|
Smakförstärkare (revision 51533831)
|
|
Infraordning (revision 49477220)
|
|
Organism (revision 51537725)
|
|
Fisködlor (revision 51101913)
|
|
Nederbörd (revision 50294650)
|
|
Skrakar (revision 49421476)
|
|
Myrpiggsvin (revision 48865885)
|
|
Mansforskning (revision 47506745)
|
|
Fiji Summer Time (revision 51628863)
|
|
Rike (biologi) (revision 50937218)
|
|
Kvadratkilometer (revision 51146141)
|
|
Kannasläktet (revision 49955866)
|
|
Purdah (revision 49269676)
|
|
Litoral (revision 47388601)
|
|
Engelska kanalen (revision 50133974)
|
|
Undersläkte (revision 51622482)
|
|
Svärdfiskar (revision 51035233)
|
|
Klass (biologi) (revision 44944834)
|
|
Blodkärl (revision 47473904)
|
|
Flora (botanik) (revision 51339211)
|
|
Miljö (omgivning) (revision 51475610)
|
|
Anatomi (revision 51609030)
|
|
Artundergrupp (revision 51246830)
|
|
Överfamilj (revision 47122498)
|
|
Kruger nationalpark (revision 50511277)
|
|
Varietet (biologi) (revision 48198194)
|
|
Moaörn (revision 50941002)
|
|
Genusvetenskap (revision 51641870)
|
|
Latin (revision 51408565)
|
|
Överrike (revision 50937218)
|
|
Underrike (revision 50937218)
|
|
Östpapegojor (revision 46190135)
|
|
Biogas (revision 51329860)
|
|
Hedersdoktor (revision 51579005)
|
|
Underordning (revision 49477220)
|
|
Hormonterapi (transsexualism) (revision 49985314)
|
|
Serengeti (revision 50598959)
|
|
Underdivision (revision 46997002)
|
|
Sågtång (revision 43678985)
|
|
Årsnederbörd (revision 50582293)
|
|
Underfylum (revision 48212330)
|
|
Fåglar (revision 51631929)
|
|
Queer (revision 50491618)
|
|
Överhud (revision 49716509)
|
|
Trainee (revision 49688895)
|
|
Gasell (revision 46605384)
|
|
Bladtång (revision 50277892)
|
|
Area (revision 50460691)
|
|
Ryggradsdjur (revision 51433096)
|
|
Tidszon (revision 51455267)
|
|
Pleistocen (revision 49211710)
|
|
Stipendium (revision 49644010)
|
|
Undertribus (revision 46997009)
|
|
Binomial nomenklatur (revision 51484783)
|
|
Läderhud (revision 45323117)
|
|
Solförmörkelse (revision 51217582)
|
|
Vatten (revision 51556576)
|
|
Halvö (revision 51419101)
|
|
Intersektionalitet (revision 51228488)
|
|
Danmark (revision 51615196)
|
|
Vildmark (revision 49350253)
|
|
Växtriket (revision 51581458)
|
|
Plattektonik (revision 51390439)
|
|
Ocean (revision 51432545)
|
|
Brewster Kahle (revision 47526442)
|
|
Gemeinsame Normdatei (revision 46103091)
|
|
Biologi (revision 49616572)
|
|
Överklass (biologi) (revision 47122504)
|
|
Chengjiang (lagerstätte) (revision 51413388)
|
|
Biologism (revision 49913258)
|
|
RFSL (revision 51638725)
|
|
Erasistratos (revision 47910581)
|
|
Dimensionsanalys (revision 49247252)
|
|
Arkeologisk lokal (revision 50388755)
|
|
Borrflugor (revision 49571840)
|
|
Papua Nya Guinea (revision 51608607)
|
|
Nagel (revision 51401820)
|
|
Växter (revision 51581458)
|
|
Referensbibliotek (revision 43544193)
|
|
Femme (revision 48773869)
|
|
Civilekonomerna (revision 48828707)
|
|
Allians (biologi) (revision 51622482)
|
|
Aves (revision 51631929)
|
|
Farmakologi (revision 51164270)
|
|
Hen (revision 51606248)
|
|
Molekylär klocka (revision 47887818)
|
|
Kikunae Ikeda (revision 49340311)
|
|
Vatikanstatens bibliotek (revision 43158770)
|
|
Kina (revision 51635520)
|
|
Familj (biologi) (revision 50548234)
|
|
National- och universitetsbiblioteket i Zagreb (revision 43219495)
|
|
Kräftdjur (revision 49977078)
|
|
Begränsningsarea (revision 40757907)
|
|
Transgender Day of Remembrance (revision 51636616)
|
|
Tjeckiska nationalbiblioteket (revision 46514905)
|
|
Överfylum (revision 48212330)
|
|
Svavelväte (revision 51344105)
|
|
Djur (revision 51469052)
|
|
Rum (fysik) (revision 49290047)
|
|
Systematik (biologi) (revision 51506994)
|
|
Underfamilj (revision 50548234)
|
|
Gränsvärde (revision 47179480)
|
|
Plantae (revision 51581458)
|
|
Linjär algebra (revision 50044309)
|
|
Integrated Taxonomic Information System (revision 48591706)
|
|
Neuroanatomi (revision 49426339)
|
|
Tyrannosaurus (revision 51502373)
|
|
Zebror (revision 51635419)
|
|
Metangas (revision 51580655)
|
|
Pretegelen (revision 50032174)
|
|
Feminisering (revision 50209006)
|
|
Underklass (biologi) (revision 44944834)
|
|
Edmigasell (revision 48106386)
|
|
Förenta nationernas medlemsstater (revision 51630915)
|
|
Allosaurider (revision 40888601)
|
|
Elefant (revision 51244638)
|
|
Pelagial (revision 43975416)
|
|
Vetenskapligt namn (revision 46637057)
|
|
Adjunkt (lärare) (revision 47023760)
|
|
Kön (revision 51124326)
|
|
Arkiv (revision 51182072)
|
|
Charles III (revision 51633914)
|
|
Canna pedunculata (revision 46703358)
|
|
Jordens atmosfär (revision 50939215)
|
|
Kolesterol (revision 51581405)
|
|
Gödsel (revision 49711703)
|
|
Bro (revision 51285531)
|
|
Campechebukten (revision 49690649)
|
|
Auktorsnamn (revision 51253351)
|
|
Doktorsgrad (revision 51581730)
|
|
Shackletons shelfis (revision 47822557)
|
|
Leddjur (revision 50562856)
|
|
Ainu (revision 50015241)
|
|
Mittoceanisk rygg (revision 49691134)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2022-12-14 18:21:28.823200
|
|
|
|
52 characters appeared 872342 times.
|
|
|
|
Most Frequent characters:
|
|
[ 0] Char e: 10.07827205385044 %
|
|
[ 1] Char a: 9.501663338461292 %
|
|
[ 2] Char r: 8.93342290065135 %
|
|
[ 3] Char n: 8.460672534395913 %
|
|
[ 4] Char t: 7.666717869826284 %
|
|
[ 5] Char s: 6.2858374353177995 %
|
|
[ 6] Char i: 5.936318553961635 %
|
|
[ 7] Char l: 5.365326901605105 %
|
|
[ 8] Char o: 4.766020666206602 %
|
|
[ 9] Char d: 4.2648410829697525 %
|
|
[10] Char m: 3.3875475444263827 %
|
|
[11] Char k: 3.270620926196377 %
|
|
[12] Char g: 2.9386410375747127 %
|
|
[13] Char v: 2.3831249670427423 %
|
|
[14] Char f: 2.099176699046933 %
|
|
[15] Char ä: 1.9664305971740441 %
|
|
[16] Char u: 1.9017770553292173 %
|
|
[17] Char p: 1.8738063741055688 %
|
|
[18] Char h: 1.8481283716707437 %
|
|
[19] Char c: 1.475682702426342 %
|
|
[20] Char å: 1.3009805787179798 %
|
|
[21] Char b: 1.238161179904212 %
|
|
[22] Char ö: 1.233346554447682 %
|
|
[23] Char y: 0.7070621384732135 %
|
|
[24] Char j: 0.5712209202354123 %
|
|
[25] Char x: 0.32452868255798756 %
|
|
[26] Char w: 0.08425594548926912 %
|
|
[27] Char z: 0.07164621215073905 %
|
|
[28] Char q: 0.02372922546432477 %
|
|
|
|
The first 29 characters have an accumulated ratio of 0.9995896104968004.
|
|
The first 4 characters have an accumulated ratio of 0.36974030827358995.
|
|
All characters whose order is over 21 have an accumulated ratio of 0.030157896788186284.
|
|
|
|
886 sequences found.
|
|
|
|
First 482 (typical positive ratio): 0.9950244403710493
|
|
Next 121 (603-482): 0.003978503582736215
|
|
Rest: 0.0009970560462144729
|
|
|
|
- Processing end: 2022-12-14 18:21:28.869918
|