xref: /openbmc/linux/fs/unicode/README.utf8data (revision 28ba53c0)
1The utf8data.h file in this directory is generated from the Unicode
2Character Database for version 12.1.0 of the Unicode standard.
3
4The full set of files can be found here:
5
6  http://www.unicode.org/Public/12.1.0/ucd/
7
8Note!
9
10The URL's listed below are not stable.  That's because Unicode 12.1.0
11has not been officially released yet; it is scheduled to be released
12on May 8, 2019.  We taking Unicode 12.1.0 a few weeks early because it
13contains a new Japanese character which is required in order to
14specify Japenese dates after May 1, 2019, when Crown Prince Naruhito
15ascends to the Chrysanthemum Throne.  (Isn't internationalization fun?
16The abdication of Emperor Akihito of Japan is requiring dozens of
17software packages to be updated with only a month's notice.  :-)
18
19We will update the URL's (and any needed changes to the checksums)
20after the final Unicode 12.1.0 is released.
21
22Individual source links:
23
24  https://www.unicode.org/Public/12.1.0/ucd/CaseFolding-12.1.0d2.txt
25  https://www.unicode.org/Public/12.1.0/ucd/DerivedAge-12.1.0d3.txt
26  https://www.unicode.org/Public/12.1.0/ucd/extracted/DerivedCombiningClass-12.1.0d2.txt
27  https://www.unicode.org/Public/12.1.0/ucd/DerivedCoreProperties-12.1.0d2.txt
28  https://www.unicode.org/Public/12.1.0/ucd/NormalizationCorrections-12.1.0d1.txt
29  https://www.unicode.org/Public/12.1.0/ucd/NormalizationTest-12.1.0d3.txt
30  https://www.unicode.org/Public/12.1.0/ucd/UnicodeData-12.1.0d2.txt
31
32md5sums (verify by running "md5sum -c README.utf8data"):
33
34  900e76da1d822a160fd6b8c0b1d70094  CaseFolding.txt
35  131256380bff4fea8ad4a851616f2f10  DerivedAge.txt
36  e731a4089b30002144e107e3d6f8d1fa  DerivedCombiningClass.txt
37  a47c9fbd7ff92a9b261ba9831e68778a  DerivedCoreProperties.txt
38  fcab6dad15e440879d92f315978f93d3  NormalizationCorrections.txt
39  f9ff1c55a60decf436100f791b44aa98  NormalizationTest.txt
40  755f6af699f8c8d2d958da411f78f6c6  UnicodeData.txt
41
42sha1sums (verify by running "sha1sum -c README.utf8data"):
43
44  dc9245f6803c4ac99555c361f5052e0b13eb779b  CaseFolding.txt
45  3281104f237184cdb5d869e86eb8573678ada7da  DerivedAge.txt
46  2f5f995ccb96e0fa84b15151b35d5e2681535175  DerivedCombiningClass.txt
47  5b8698a3fcd5018e1987f296b02e2c17e696415e  DerivedCoreProperties.txt
48  cd83935fbc012345d8792d2c704f69497e753835  NormalizationCorrections.txt
49  ea419aae505b337b0d99a83fa83fe58ddff7c19f  NormalizationTest.txt
50  dc973c0fc93d6f09d9ab9f70d1c9f89c447f0526  UnicodeData.txt
51
52
53To update to the newer version of the Unicode standard, the latest
54released version of the UCD can be found here:
55
56  http://www.unicode.org/Public/UCD/latest/
57
58Then, build under fs/unicode/ with REGENERATE_UTF8DATA=1:
59
60	make REGENERATE_UTF8DATA=1 fs/unicode/
61
62After sanity checking the newly generated utf8data.h file (the
63version generated from the 12.1.0 UCD should be 4,109 lines long, and
64have a total size of 324k) and/or comparing it with the older version
65of utf8data.h_shipped, rename it to utf8data.h_shipped.
66
67If you are a kernel developer updating to a newer version of the
68Unicode Character Database, please update this README.utf8data file
69with the version of the UCD that was used, the md5sum and sha1sums of
70the *.txt files, before checking in the new versions of the utf8data.h
71and README.utf8data files.
72