Skip to content

gh-84508: tool to generate cjk traditional chinese mappings#93272

Merged
corona10 merged 7 commits intopython:mainfrom
sorcio:genmap-tchinese
Jun 11, 2022
Merged

gh-84508: tool to generate cjk traditional chinese mappings#93272
corona10 merged 7 commits intopython:mainfrom
sorcio:genmap-tchinese

Conversation

@sorcio
Copy link
Copy Markdown
Contributor

@sorcio sorcio commented May 26, 2022

This adds a script to generate the mapping files for Traditional Chinese Big-5-based codecs, as discussed in the issue.

I initially planned to add support for later versions of HKSCS, but I decided to keep this minimal so to close gh-84508. The topic of refreshing the mappings is split to its own issue in gh-93271.

So this generates mappings_tw.h and mappings_hk.h files identical to the existing versions (with only one new line difference).

Notes about the mapping files:

  • I did not include the BIG5.TXT and CP950.TXT files that are available on the Unicode website. It looks like these are available for redistribution, so I can add them to the PR if needed.
  • I also did not include the hkscs-2004-big5-iso.txt file, but that's a different story. The terms of use from the source website include a clause that unilaterally binds to any update to the terms (clause 2), which I believe is incompatible with redistribution as part of CPython.

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add tools for generating mappings_XX.h

6 participants