HKIUG Code Table for CJK Characters : Mapping to Unicode

  • Ki Tat Lam (Owner)

Dataset

Description

The EACC/Unicode Mapping Table supplements Library of Congress's East Asian Code Tables in two main aspects:

Identified multi-mapping cases and marked HKIUG's preferred mapping:- System implementers can make use of this multi-mapping information to develop logic for handling the MANY-TO-ONE mapping problem from EACC to Unicode.

Included Pure CCCII characters:- In addition to the EACC characters as found in LC's code tables, 7,044 Pure CCCII characters are also included in the HKIUG Code Table to reduce the occurrence of missing characters. These are CCCII (and non-EACC) characters that have been in use in HKIUG member libraries. They are called Pure CCCII because their inclusion to the table would not introduce more multi-mapping cases.

HKIUG EACC/Unicode Mapping Table is available in HTML and XML formats, and the version deposited here is version 1.0.5, released on 14 November 2013.

HKIUG EACC/Unicode Mapping Table was developed by the HKIUG (Hong Kong Innovative Users Group) Unicode Task Force, chaired by Ki Tat Lam of Hong Kong University of Science and Technology Library. HKIUG and the Unicode Task Force were dissolved on 30 June 2017.
Date made available17 Dec 2019
PublisherDataSpace@HKUST

Keywords

  • EACC/Unicode mapping table
  • East Asia Coded Character
  • Unicode
  • Chinese Japanese Korean Code Mapping Table
  • Character Sets
  • Data Processing

Cite this