uchardet/test/he/ibm862.visual.txt
Jehan 6d31689632 test: adding 2 tests for Hebrew/IBM862 recognition.
This is the same text, taken from this Wikipedia page, which was today's
page of honor on Wikipedia in Hebrew:
https://he.wikipedia.org/wiki/שתי מסכתות על ממשל מדיני

I put it in 2 variants, since IBM862 can be used in logical and visual
variants. The visual variant is just about inverting orders of letters
(per lines, while lines stay in proper order), so that's what I did.
Though note that the English title quoted in the text should likely not
have been reverted, but it doesn't matter too much since anyway these
are off-Hebrew alphabet and would trigger bad sequence score, whichever
their order. So I didn't bother fixing these.
2022-12-16 23:35:17 +01:00

2 lines
539 B
Plaintext
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

.…š…‹Ž‘ š…Œ…<C592>… ‰Ž‰ˆ‰‚Œ <20>ˆŒ™ Œ™ …š‰Œ‹š ,‰<>‰ƒŽ„ Œ™ŽŽ„ ˜…—Ž ‰<>Œ …𙉂 š€ —…Œ ‚‰–Ž „‰‰<E280B0>™„ šŽ<E28098> .„™…˜<CB9C> ˜<>„ ‰<>‰ƒŽ <20>ˆŒ™ <20><EFBFBD>Œ „‡”™Ž<E284A2> <20>€„ <20>ˆŒ™ <20><EFBFBD> š…‰‹™Ž„ šŽ‰‰— „‰”Œ „™‰‚„ š€… <20>ŒŽ Œ™ š‰„…Œ€„ <20>š…<20>…‰˜ š€ ,š‰ˆ…Œ…<E280A6>€ „‹…ŒŽ Œ™ š…‰Ž…‰ˆ‰‚Œ„ ‰<>Œ ˜ŽŒ‰” Œ™ …‰<E280A6>ˆ š€ …† šŽ<E28098> ˜<CB9C>Ž —…Œ ."„‹˜€‰˜ˆ”" …˜<E2809D> ˜ŽŒ‰” ˆ˜<CB86>˜ ‚‰–„™ <20>ˆ„ š€ Š‰˜”„Œ ™—<E284A2>Ž —…Œ „<>…™€˜„ šŽ<E28098> .)š…š‹‘Ž( <20>‰—Œ‡ ‰<>™Ž <20>˜…Ž ˜<CB9C>‰‡„ ]1[.9861-<2D> <20><20>…Œ‰<E280B0> <20>˜…”™ ,—…Œ <20>…\' ‰Œ<C592>€„ „‚…„„ š€Ž ‰ˆ‰Œ…”-‰”…‘…Œ‰” ˜<CB9C>‰‡ €…„ )tnemnrevoG fo sesitaerT owT :š‰Œ<C592><EFBFBD>( ‰<>‰ƒŽ Œ™ŽŽ Œ’ š…š‹‘Ž ‰š™