mirror of
https://gitlab.freedesktop.org/uchardet/uchardet.git
synced 2025-12-06 16:56:40 +08:00
228 lines
7.5 KiB
Plaintext
228 lines
7.5 KiB
Plaintext
= Logs of language model for Romanian (ro) =
|
|
|
|
- Generated by BuildLangModel.py
|
|
- Started: 2022-12-15 00:01:17.765077
|
|
- Maximum depth: 4
|
|
- Max number of pages: 200
|
|
|
|
== Parsed pages ==
|
|
|
|
The Loving Kind (revision 15340411)
|
|
Limba engleză (revision 15174203)
|
|
Whole Lotta History (revision 15221846)
|
|
The Promise (revision 15302845)
|
|
Chemistry (revision 13003795)
|
|
Untouchable (revision 12020867)
|
|
31 ianuarie (revision 14777533)
|
|
Neil Tennant (revision 13355922)
|
|
Dance (revision 14700085)
|
|
The Guardian (revision 15212051)
|
|
Billboard (revision 13092896)
|
|
Sound of the Underground (cântec) (revision 15206321)
|
|
Compozitor (revision 15313365)
|
|
The Show (revision 10112441)
|
|
Compact Disc (revision 13258410)
|
|
Gen muzical (revision 15348917)
|
|
Disc single (revision 13271042)
|
|
29 noiembrie (revision 15270237)
|
|
Zimbabwe (revision 15223871)
|
|
Republica Irlanda (revision 15335833)
|
|
Limba pali (revision 14710607)
|
|
1954 (revision 15272524)
|
|
Call the Shots (revision 15311533)
|
|
Limbi indo-iraniene (revision 13016907)
|
|
Casă de discuri (revision 15244458)
|
|
Mary Higgins Clark (revision 14158157)
|
|
See the Day (revision 10112431)
|
|
Mai (revision 15170552)
|
|
Normanzi (revision 15181050)
|
|
Listă de limbi (revision 15276205)
|
|
5 decembrie (revision 15333253)
|
|
The Sound of Girls Aloud (revision 10112480)
|
|
22 mai (revision 14998993)
|
|
2009 (revision 15348935)
|
|
Biblioteca Nacional de España (revision 15237290)
|
|
Can't Speak French (revision 15243027)
|
|
Bibliothèque nationale de France (revision 15237314)
|
|
MSN Search (revision 15237622)
|
|
5 septembrie (revision 15347684)
|
|
27 aprilie (revision 14912864)
|
|
Limba franceză (revision 15326202)
|
|
Uniunea Europeană (revision 15216020)
|
|
2005 (revision 15348977)
|
|
Irlanda (revision 15335833)
|
|
Statele Unite ale Americii (revision 15339104)
|
|
Consoană oclusivă (revision 13880727)
|
|
Contratenor (revision 14250562)
|
|
3 septembrie (revision 15102675)
|
|
Mixed Up (revision 10112443)
|
|
Sri Lanka (revision 15339014)
|
|
Anglia (revision 15109546)
|
|
Girls Aloud (revision 15319932)
|
|
Pet Shop Boys (revision 13165657)
|
|
Regatul Unit al Marii Britanii și al Irlandei de Nord (revision 15335741)
|
|
Noua Zeelandă (revision 15181159)
|
|
1921 (revision 15196999)
|
|
Parlophone (revision 15295705)
|
|
1834 (revision 15086768)
|
|
Something Kinda Ooooh (revision 15206082)
|
|
1987 (revision 15272755)
|
|
No Good Advice (revision 10112436)
|
|
Limba maghiară (revision 15329180)
|
|
1935 (revision 14962293)
|
|
Biology (revision 10112430)
|
|
Muzică pop (revision 15177633)
|
|
Tangled Up (revision 13010794)
|
|
26 aprilie (revision 14916666)
|
|
British Broadcasting Corporation (revision 14882345)
|
|
Girls A Live (revision 10112444)
|
|
13 aprilie (revision 15215645)
|
|
Wake Me Up (revision 10112439)
|
|
Sexy! No No No... (revision 12017812)
|
|
I Think We're Alone Now (revision 15152417)
|
|
1725 (revision 14748670)
|
|
1903 (revision 14907631)
|
|
ASP.NET (revision 13678267)
|
|
Al Doilea Război Mondial (revision 15346198)
|
|
Dick Durock (revision 14802579)
|
|
Life Got Cold (revision 10112437)
|
|
MusicBrainz (revision 15177442)
|
|
Nicolae Popovici (jurist) (revision 15200517)
|
|
National Diet Library (revision 12675764)
|
|
BBC Three (revision 15290069)
|
|
Friedrich Schleiermacher (revision 14711103)
|
|
The Beatles (revision 15302748)
|
|
20 ianuarie (revision 14947182)
|
|
2 iunie (revision 15000116)
|
|
Universal Media Disc (revision 13269523)
|
|
Castelul Bunratty (revision 8799348)
|
|
Londra (revision 15290324)
|
|
23 noiembrie (revision 15307048)
|
|
20 iulie (revision 15036777)
|
|
2001 (revision 15111207)
|
|
Florida (revision 15142921)
|
|
Uzbekistan (revision 15298947)
|
|
1938 (revision 15163477)
|
|
23 iunie (revision 14994443)
|
|
28 aprilie (revision 15140389)
|
|
1811 (revision 14233359)
|
|
Crișana (revision 15314665)
|
|
Pronunție (revision 14476477)
|
|
I'll Stand By You (cântec de Girls Aloud) (revision 10112432)
|
|
2019 (revision 15344837)
|
|
Calendarul armean (revision 14268830)
|
|
2002 (revision 15294674)
|
|
Australia (revision 15309171)
|
|
Serghei Prokofiev (revision 15269322)
|
|
Limbi ugrice (revision 15165135)
|
|
WorldCat Identities (revision 13000969)
|
|
Rwanda (revision 14914537)
|
|
Brașov (revision 15335383)
|
|
Rudolf Hess (revision 15198812)
|
|
Limba daneză (revision 14842105)
|
|
Lista țărilor după indicele dezvoltării umane (revision 15314050)
|
|
Postpoziție (revision 15346785)
|
|
16 iunie (revision 14987301)
|
|
9 mai (revision 14936959)
|
|
Erasmus din Rotterdam (revision 15139499)
|
|
1939 (revision 15344797)
|
|
Lista orașelor din Statele Unite ale Americii după populație (revision 14835883)
|
|
Benjamin Henry Latrobe (revision 15309615)
|
|
Wayback Machine (revision 15154168)
|
|
Love Machine (revision 10112433)
|
|
Discografia formației Girls Aloud (revision 15316070)
|
|
Anii 1950 (revision 15053828)
|
|
Ross Brawn (revision 14956382)
|
|
Djibouti (revision 15324881)
|
|
Cuvânt (revision 12985155)
|
|
1 martie (revision 15348743)
|
|
Domenico Scarlatti (revision 15271887)
|
|
Hawaii (revision 15282894)
|
|
Listă de compozitori de muzică cultă (revision 14649633)
|
|
Premiul Nobel pentru Fizică (revision 15191205)
|
|
4 mai (revision 15222001)
|
|
27 octombrie (revision 15314197)
|
|
8 noiembrie (revision 15277041)
|
|
Accidentul nuclear de la Cernobîl (revision 15345489)
|
|
Consoană africată dentală surdă (revision 14997698)
|
|
Extended play (revision 14728849)
|
|
Divertisment (revision 12383285)
|
|
25 decembrie (revision 15332780)
|
|
15 ianuarie (revision 14749726)
|
|
Iulian Cristache (revision 11040565)
|
|
Stat unitar (revision 15207224)
|
|
Limba marathi (revision 15165081)
|
|
Consoană fricativă laterală alveolară sonoră (revision 13946216)
|
|
James Abbott McNeill Whistler (revision 15285621)
|
|
Regatul Unit (revision 15335741)
|
|
Albania (revision 15331282)
|
|
Henry Cowell (revision 15119343)
|
|
Limba valenciană (revision 15165114)
|
|
Bronx (revision 15211973)
|
|
Integrated Authority File (revision 15145168)
|
|
Menuet (revision 14224105)
|
|
Jocurile Olimpice de vară (revision 15157901)
|
|
Acuzativ (revision 15315694)
|
|
National and University Library in Zagreb (revision 14932231)
|
|
20 septembrie (revision 15109959)
|
|
1 ianuarie (revision 14833650)
|
|
12 septembrie (revision 15058394)
|
|
ITunes (revision 14303931)
|
|
Tokyo (revision 15215196)
|
|
Paris (revision 15295690)
|
|
Universal Music Group (revision 15070153)
|
|
Premiul Nobel pentru Literatură (revision 15129756)
|
|
Flandra (revision 15318704)
|
|
|
|
== End of Parsed pages ==
|
|
|
|
- Wikipedia parsing ended at: 2022-12-15 00:04:31.827603
|
|
|
|
68 characters appeared 1537323 times.
|
|
|
|
Most Frequent characters:
|
|
[ 0] Char i: 11.164992652812714 %
|
|
[ 1] Char e: 11.007836349290292 %
|
|
[ 2] Char a: 10.768654342646276 %
|
|
[ 3] Char r: 7.448857527012866 %
|
|
[ 4] Char n: 7.210586194313101 %
|
|
[ 5] Char t: 6.14821999020375 %
|
|
[ 6] Char l: 5.709080004657447 %
|
|
[ 7] Char u: 5.164171745300109 %
|
|
[ 8] Char o: 5.019569732580596 %
|
|
[ 9] Char c: 4.31893622875609 %
|
|
[10] Char s: 3.679578071752 %
|
|
[11] Char d: 3.4889219767088635 %
|
|
[12] Char m: 3.302168769998237 %
|
|
[13] Char p: 2.6401088125267105 %
|
|
[14] Char ă: 2.0153864867695335 %
|
|
[15] Char b: 1.5493165717289081 %
|
|
[16] Char g: 1.3016783070311184 %
|
|
[17] Char f: 1.1247473692906436 %
|
|
[18] Char v: 0.9899025774024066 %
|
|
[19] Char ș: 0.92596025688811 %
|
|
[20] Char ț: 0.8636441398456929 %
|
|
[21] Char î: 0.8400967135728796 %
|
|
[22] Char z: 0.793652342416005 %
|
|
[23] Char h: 0.719497464098306 %
|
|
[24] Char â: 0.4213818436333809 %
|
|
[25] Char k: 0.327907668069755 %
|
|
[26] Char j: 0.2703400651652255 %
|
|
[27] Char x: 0.23144127811787113 %
|
|
[28] Char y: 0.2307907967291194 %
|
|
[29] Char w: 0.18811921762700487 %
|
|
[30] Char é: 0.02998719202145548 %
|
|
[31] Char q: 0.02205131907868418 %
|
|
|
|
The first 32 characters have an accumulated ratio of 0.9991758400804515.
|
|
The first 4 characters have an accumulated ratio of 0.4039034087176214.
|
|
All characters whose order is over 21 have an accumulated ratio of 0.03235169186956807.
|
|
|
|
1295 sequences found.
|
|
|
|
First 487 (typical positive ratio): 0.9950167482401342
|
|
Next 267 (754-487): 0.003984360305270163
|
|
Rest: 0.0009988914545956407
|
|
|
|
- Processing end: 2022-12-15 00:04:31.911782
|