未統一漢字列表

有些字只是同一字在不同地區的寫法,但因為原規格分離原則而只好分開編碼。由於南韓KS X 1001:1998(U+F900-U+FA0B,268個字)、台灣Big5(U+FA0C-U+FA0D,2個字)、日本IBM 32CP932變種;U+FA0E-U+FA2D,32個字)、南韓KS X 1001:2004(U+FA2E-U+FA2F個字)、日本JIS X 0213(U+FA30-U+FA6A,59個字)、ARIB STD-B24(U+FA6B-U+FA6D,3個字)和北韓KPS 10721-2000(U+FA70-U+FAD9,106個字)均有字形非常接近但編碼上分離的字,為實現與這些標準的互換性而創立「相容表意文字區」(Compatibility Ideographs)。值得注意的是原規格分離原則由“Unicode聯盟決定把不正統的編入位於基本多文種平面的‘相容表意文字區’”時起廢棄,原因是台灣來源(T-source,即CNS 11643)有太多字形非常接近,按Unicode標準應該統一的字。這些字只有正統的會編入正式字集(包括擴展區),不正統的編入位於「第二輔助平面」的「相容表意文字補充區」(Compatibility Ideographs Supplement)中。

以下是所有摘自ISO/IEC JTC1/SC2/WG2原規格分離原則文件之中有的字。 -{R|

UnicodeUnicodeUnicode
U+4E1FU+4E22
U+4E48U+5E7A
U+4E89U+722D
U+4EDEU+4EED
U+4F75U+5002
U+4FA3U+4FB6
U+4FC1U+4FE3
U+4FDEU+516A
U+4FF1U+5036
U+5024U+503C
U+5077U+5078
U+507DU+50DE
U+514CU+5151
U+514EU+5154
U+5156U+5157
U+518AU+518C
U+51C0U+51C8
U+51E2U+51E3
U+5203U+5204
U+520AU+520B
U+5220U+522A
U+5225U+522B
U+5238U+52B5
U+5239U+524E
U+524FU+5259
U+525DU+5265
U+5292U+5294
U+52FBU+5300
U+5355U+5358
U+5373U+537D
U+5377U+5DFB
U+53C1U+53C2
U+53C3U+53C4
U+5415U+5442
U+541EU+5451
U+5433U+5434U+5449
U+5436U+5450
U+543FU+544A
U+5527U+559E
U+55A9U+55BB
U+5618U+5653
U+568FU+5694
U+56EFU+56FD
U+5708U+570F
U+570EU+5713
U+5716U+5717
U+5759U+5DE0
U+57D2U+57D3
U+5848U+588D
U+5861U+586B
U+5897U+589E
U+58EEU+58EF
U+58FDU+5900
U+5910U+657B
U+5965U+5967
U+5968U+596CU+734E
U+5986U+599D
U+598DU+59F8
U+59CDU+59D7
U+5A1BU+5A2FU+5A31
U+5A55U+5AAB
U+5A7EU+5AAE
U+5AAAU+5ABC
U+5AAFU+5B00
U+5B0EU+5B14
U+5B24U+5B37
U+5B73U+5B76
U+5BABU+5BAE
U+5BDBU+5BEC
U+5BDCU+5BE7
U+5BDDU+5BE2
U+5C02U+5C08
U+5C06U+5C07
U+5C13U+5C14
U+5C19U+5C1A
U+5C2AU+5C2B
U+5C36U+5C37
U+5C4FU+5C5B
U+5CE5U+5D22
U+5DD3U+5DD4
U+5E21U+5E32
U+5E2FU+5E36
U+5E76U+5E77
U+5EC4U+5ECF
U+5F11U+5F12
U+5F37U+5F3A
U+5F39U+5F3E
U+5F50U+5F51
U+5F54U+5F55
U+5F59U+5F5A
U+5F5BU+5F5C
U+5F5DU+5F5E
U+5F65U+5F66
U+5FB3U+5FB7
U+5FB4U+5FB5
U+6075U+60E0
U+6085U+60A6
U+609EU+60AE
U+60B3U+60EA
U+6120U+614D
U+613CU+614E
U+6229U+622C
U+622FU+6231
U+6236U+6237U+6238
U+623BU+623E
U+629BU+62CB
U+629CU+62D4
U+6329U+635D
U+633FU+63D2U+63F7
U+634FU+63D1
U+635CU+641C
U+63B2U+63ED
U+63FAU+6416U+6447
U+63FEU+6435
U+6483U+64CA
U+654EU+6559
U+6553U+655A
U+65E2U+65E3
U+6602U+663B
U+665AU+6669
U+66A8U+66C1
U+66FDU+66FE
U+67B4U+67FA
U+67E5U+67FB
U+67F5U+6805
U+68B2U+68C1
U+6961U+6986
U+6982U+69EA
U+6985U+69B2
U+699DU+6A27
U+69C7U+69D9
U+69D8U+6A23
U+6A2AU+6A6B
U+6B65U+6B69
U+6B72U+6B73
U+6B7F歿U+6B81
U+6BBBU+6BBC
U+6BC0U+6BC1
U+6BCEU+6BCF
U+6C32U+6C33
U+6C5AU+6C61
U+6C92U+6CA1
U+6D44U+6DE8
U+6D89U+6E09
U+6D97U+6D9A
U+6D99U+6DDA
U+6DE5U+6E0C
U+6DF8U+6E05
U+6E07U+6E34
U+6E29U+6EAB
U+6E88U+6F59
U+6E89U+6F11
U+6EDAU+6EFE
U+6F5BU+6FF3
U+7028U+702C
U+70BAU+7232
U+712DU+7162
U+7155U+7199
U+7174U+7185
U+72B6U+72C0
U+7464U+7476
U+74F6U+7501
U+7522U+7523
U+75E9U+7626
U+76A1U+76A5
U+771EU+771F
U+773EU+8846
U+7814U+784F
U+797F祿U+7984
U+79BF禿U+79C3
U+7A05U+7A0E
U+7A42U+7A57
U+7B5DU+7B8F
U+7BB3U+7C08
U+7BE1U+7C12
U+7CA4U+7CB5
U+7D55U+7D76
U+7DA0U+7DD1
U+7DD2U+7DD6
U+7DE3U+7E01
U+7DFCU+7E15
U+7E48U+7E66
U+7FAEU+7FB9
U+7FF6U+7FFA
U+80FCU+8141
U+812BU+8131
U+817DU+8183
U+8203U+8204
U+820DU+820E
U+8216U+8217
U+8358U+838A
U+83D1U+8458
U+8480U+8495
U+848BU+8523
U+848DU+853F
U+8570U+8580
U+85ABU+85B0
U+85F4U+860A
U+865AU+865B
U+86FBU+8715
U+885BU+885E
U+886EU+889E
U+88C5U+88DD
U+8A2EU+8A7D
U+8AAAU+8AAC
U+8ACCU+8AEB
U+8B20U+8B21
U+8C5CU+8C63
U+8D70U+8D71
U+8EFF軿U+8F27
U+8F1CU+8F3A
U+8F3CU+8F40
U+8FBEU+8FD6
U+8FF8U+902C
U+9059U+9065
U+90A2U+90C9
U+90CEU+90DE
U+90F7U+9109U+9115
U+9196U+919E
U+91A4U+91AC
U+9203U+9292
U+92B3U+92ED
U+9304U+9332
U+932CU+934A
U+93ADU+93AE
U+95B1U+95B2
U+9667U+9689
U+9751U+9752
U+9759U+975C
U+976DU+9771
U+9839U+983D
U+984FU+9854
U+985AU+985B
U+98EEU+98F2
U+9905U+9920
U+99B1U+99C4
U+99E2U+9A08
U+9AA9U+9AAB
U+9AD8U+9AD9
U+9AEAU+9AEE
U+9B2CU+9B2D
U+9C1BU+9C2E
U+9CEFU+9CF3
U+9D87U+9DAB
U+9DC6U+9DCF
U+9EAAU+9EAB
U+9EBCU+9EBD
U+9EC3U+9EC4
U+9ED1U+9ED2

}- 自上表發表後,WG2亦調查過其他漢字[1],認為以下屬於基本多文種平面的漢字,亦可考慮收編到ISO 10646 Annex S3: -{R|

UnicodeUnicode
U+5022U+507C
U+52C0U+52CA
U+5637U+5651
U+5EFBU+5EFD
U+6323U+6399
U+66ADU+66CD
U+6808U+685F
U+6D85U+6E7C
U+6F40U+6F68
U+6FF2U+7014
U+734BU+7354
U+84D8U+8509
U+86D4U+8716
U+8B86U+8B8F
U+8FF4U+9025
U+91F0U+91FC

}-

注釋

  1. Taichi Kawabata(川幡太一):IRGN 1155 Possible multiple-encoded Ideographs in the UCS,2005.11.21

參考資料

參見

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.