mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-06 16:56:40 +08:00
Now making sure that we have a generic language model working with UTF-8 for all 26 supported models which had single-byte encoding support until now.
151 lines
4.6 KiB
Plaintext
151 lines
4.6 KiB
Plaintext
= Logs of language model for Swedish (sv) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2021-03-16 20:20:06.144954
|
|
- Maximum depth: 4
|
|
- Max number of pages: 100
|
|
|
|
== Parsed pages ==
|
|
|
|
Kakapo (revision 48946696)
|
|
Akut hotad (revision 45694757)
|
|
Aotearoa (revision 48764847)
|
|
Arkive (revision 45404194)
|
|
Art (revision 48819963)
|
|
Artepitet (revision 48819963)
|
|
Auckland (revision 48740415)
|
|
Auktorsnamn (revision 46648298)
|
|
BBC (revision 48945370)
|
|
Basalomsättning (revision 48638233)
|
|
Beilschmiedia tawa (revision 47662851)
|
|
Berguv (revision 47572081)
|
|
Betesmark (revision 47837257)
|
|
Biodiversity Heritage Library (revision 48152021)
|
|
Biotop (revision 48969696)
|
|
BirdLife International (revision 47616784)
|
|
British Museum (revision 48501908)
|
|
Bröstben (revision 48379566)
|
|
CITES (revision 47938046)
|
|
Dacrydium cupressinum (revision 47442085)
|
|
Digital object identifier (revision 47511062)
|
|
Djur (revision 48964290)
|
|
Djurpark (revision 48242363)
|
|
Domän (biologi) (revision 48975224)
|
|
Don Merton (revision 48407169)
|
|
Douglas Adams (revision 47251802)
|
|
Däggdjur (revision 48794669)
|
|
Ekologisk nisch (revision 48844778)
|
|
Ekosystem (revision 48570659)
|
|
Endemisk (revision 48546826)
|
|
Eukaryoter (revision 48898436)
|
|
Evolution (revision 49003401)
|
|
Familj (biologi) (revision 48771961)
|
|
Femininum (revision 46628147)
|
|
Fjäder (biologi) (revision 48641138)
|
|
Fjäderdräkt (revision 48641138)
|
|
Fladdermöss (revision 48746998)
|
|
Flygg (revision 48763776)
|
|
Fossilworks (revision 43519389)
|
|
Frukter (revision 48807025)
|
|
Frö (revision 46332448)
|
|
Fylum (revision 48212330)
|
|
Fågelläte (revision 48681377)
|
|
Fåglar (revision 48837894)
|
|
Fåglarnas liv (revision 48837894)
|
|
Genitiv (revision 48658908)
|
|
George Edward Grey (revision 46365447)
|
|
George Robert Gray (revision 43056128)
|
|
Global Biodiversity Information Facility (revision 40116158)
|
|
Haasts örn (revision 48440980)
|
|
Hauturu/Little Barrier Island (revision 20537378)
|
|
Hermelin (revision 48863152)
|
|
Hertz (revision 48548540)
|
|
Hjortdjur (revision 48740321)
|
|
Hund (revision 48989960)
|
|
Husdjur (revision 48155297)
|
|
Huskatt (revision 47647609)
|
|
Hāngi (revision 46574175)
|
|
IUCN (revision 49006187)
|
|
Iller (revision 48765500)
|
|
Inaturalist (revision 48552803)
|
|
Infraröd (revision 48615998)
|
|
Integrated Taxonomic Information System (revision 48591706)
|
|
Internationella naturvårdsunionen (revision 49006187)
|
|
Internet Archive (revision 48979443)
|
|
Jordbruk (revision 48448896)
|
|
Kahurangi National Park (revision 47659423)
|
|
Kamouflage (revision 47671382)
|
|
Kaniner (revision 48911042)
|
|
Kapiti Island (revision 48553791)
|
|
Katt (revision 48986224)
|
|
Kelp (revision 46077553)
|
|
Kivier (revision 48467049)
|
|
Klass (biologi) (revision 44944834)
|
|
Kroppsfett (revision 39272827)
|
|
Könsdimorfism (revision 48346350)
|
|
Könsfördelning (revision 45646592)
|
|
Lamm- och fårkött (revision 48351109)
|
|
Lek (fortplantningsbeteende) (revision 30508235)
|
|
Mandel (revision 48952857)
|
|
Maori (revision 48297968)
|
|
Maorier (revision 48066510)
|
|
Maoripapegojor (revision 46078328)
|
|
Mark Carwardine (revision 48869810)
|
|
Markpapegoja (revision 47342275)
|
|
Maskulinum (revision 46628162)
|
|
Masterton (revision 48262093)
|
|
Metrosideros umbellata (revision 46936435)
|
|
Milford Sound (revision 45323524)
|
|
Morrhår (revision 48980591)
|
|
Muskelmage (revision 41849238)
|
|
Mustela (revision 48294935)
|
|
Mårddjur (revision 48435918)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2021-03-16 20:24:13.933499
|
|
|
|
49 characters appeared 513356 times.
|
|
|
|
First 30 characters:
|
|
[ 0] Char a: 9.801969783152433 %
|
|
[ 1] Char e: 9.753075838209742 %
|
|
[ 2] Char r: 9.263357202409244 %
|
|
[ 3] Char n: 8.249635730370347 %
|
|
[ 4] Char t: 7.409088429861539 %
|
|
[ 5] Char s: 6.03207131113691 %
|
|
[ 6] Char i: 5.692346052252238 %
|
|
[ 7] Char l: 5.428981057979258 %
|
|
[ 8] Char o: 4.548890049010823 %
|
|
[ 9] Char d: 4.4466218374773065 %
|
|
[10] Char m: 3.3119316809387636 %
|
|
[11] Char k: 3.0742798369942106 %
|
|
[12] Char g: 3.073890243807416 %
|
|
[13] Char f: 2.2676271437365103 %
|
|
[14] Char v: 2.2645103982421557 %
|
|
[15] Char u: 2.116464987260303 %
|
|
[16] Char ä: 2.0311440793523405 %
|
|
[17] Char h: 1.9354989519943275 %
|
|
[18] Char p: 1.8753068046346004 %
|
|
[19] Char å: 1.4903887360817833 %
|
|
[20] Char c: 1.4510398242155542 %
|
|
[21] Char b: 1.3084487178488222 %
|
|
[22] Char ö: 1.2946181597176227 %
|
|
[23] Char j: 0.7221109717233265 %
|
|
[24] Char y: 0.6866579917250407 %
|
|
[25] Char x: 0.22323689603316216 %
|
|
[26] Char w: 0.12096868449964547 %
|
|
[27] Char z: 0.07947701010604727 %
|
|
[28] Char é: 0.01577852406517115 %
|
|
[29] Char q: 0.013635761537802226 %
|
|
|
|
The first 30 characters have an accumulated ratio of 0.9998305269637442.
|
|
|
|
752 sequences found.
|
|
|
|
First 512 (typical positive ratio): 0.996987580875875
|
|
Next 512 (512-1024): 0.012946181597176228
|
|
Rest: 4.640385298237959e-17
|
|
|
|
- Processing end: 2021-03-16 20:24:14.019931
|