web-dev-qa-db-ja.com

一部のUnicode文字がどの言語に属しているかを確認するにはどうすればよいですか?

Facebookには、現在、次の奇妙な文字が浮かんでいるメッセージがあります。

ก็็็็็็็็็็็็็็็็็็็็กิิิิิิิิิิิิิิิิิิิิก้้้้้้้้้้้้้้้้้้้

これらは、指定されたテキスト領域から外れるため、読者を混乱させるために使用されます。

彼らは本当に言語に属していますか?もしそうなら、どれですか?

5
magnattic

これらは、基本文字の後に発音区別符号を組み合わせた長い文字列を持つタイ文字です。ラテン文字でも非常によく似たことができます。 â̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂̂または、ホーンを組み合わせるシーケンスを使用することもできます:ư̛̛̛̛̛̛̛̛̛̛̛̛̛̛̛̛̛̛またはセディーユ:当然、不自然なシーケンスがどのように表示されるかは、レンダリングソフトウェアに依存します。

9

Linuxを使用している場合は、Perlスクリプトを試すことができます tfinfo.plコマンドラインで文字列内のUTF-8/Unicode文字をチェック/検索するプログラム? も参照してください。 =);私が得る出力は次のとおりです。

$ echo ก็็็็็็็็็็ กิิิิิิิิิิ ก้้้้้้้้้้  | Perl utfinfo.pl
Got 64 uchars
Char: 'ก' u: 3585 [0x0E01] b: 224,184,129 [0xE0,0xB8,0x81] n: THAI CHARACTER KO KAI [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: '็' u: 3655 [0x0E47] b: 224,185,135 [0xE0,0xB9,0x87] n: THAI CHARACTER MAITAIKHU [Thai]
Char: ' ' u: 32 [0x0020] b: 32 [0x20] n: SPACE [Basic Latin]
Char: 'ก' u: 3585 [0x0E01] b: 224,184,129 [0xE0,0xB8,0x81] n: THAI CHARACTER KO KAI [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: 'ิ' u: 3636 [0x0E34] b: 224,184,180 [0xE0,0xB8,0xB4] n: THAI CHARACTER SARA I [Thai]
Char: ' ' u: 32 [0x0020] b: 32 [0x20] n: SPACE [Basic Latin]
Char: 'ก' u: 3585 [0x0E01] b: 224,184,129 [0xE0,0xB8,0x81] n: THAI CHARACTER KO KAI [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
Char: '้' u: 3657 [0x0E49] b: 224,185,137 [0xE0,0xB9,0x89] n: THAI CHARACTER MAI THO [Thai]
1
sdaau
0
Mathias Bynens

Googleで検索してください。例えば。 กはタイ語です。

0
MSalters

いくつかの非常に簡単なグーグルの仕事によると、グーグル翻訳はそれらがタイ文字であると信じています。スタイルからはほぼ正しく見えます。

0
Turix