Skip to content

Commit 1395545

Browse files
author
Sebastian Köln
committed
initial
0 parents  commit 1395545

File tree

10 files changed

+1057
-0
lines changed

10 files changed

+1057
-0
lines changed

Cargo.toml

+10
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
[package]
2+
name = "encoding"
3+
version = "0.1.0"
4+
authors = ["Sebastian Köln <[email protected]>"]
5+
edition = "2018"
6+
7+
# See more keys and their definitions at https://doc.rust-lang.org/cargo/reference/manifest.html
8+
9+
[dependencies]
10+
lazy_static = "*"

encodings/ReadMe.txt

+42
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
2011 July 12 Note:
2+
3+
The information in this ReadMe.txt is obsolete, but is retained for
4+
historical purposes, as documentation of the original mapping files
5+
included in this directory.
6+
7+
For a more current version of the Adobe glyph naming conventions, see
8+
the document "Glyph", located at:
9+
10+
<http://www.adobe.com/devnet/opentype/archives/glyph.html>
11+
12+
-------------------------------------------------------------------------------
13+
File name: Adobe ReadMe for Unicode 3.0
14+
Date: 30 March 1999
15+
-------------------------------------------------------------------------------
16+
17+
The document "Unicode and Glyph Names," at:
18+
19+
http://partners.adobe.com/asn/developer/typeforum/unicodegn.html
20+
21+
describes Adobe's PostScript glyph naming conventions in the context of
22+
Unicode and also contains links to the following 3 database files:
23+
24+
"The Adobe Glyph List" (AGL), which maps approximately 1000 glyph names to
25+
standard or Corporate Use subarea Unicode values.
26+
"Unicode's Corporate Use Subarea as used by Adobe."
27+
"Zapf Dingbats Glyph Names and Unicode Values."
28+
29+
-------------------------------------------------------------------------------
30+
31+
The 3 files in the current directory:
32+
33+
stdenc.txt
34+
symbol.txt
35+
zdingbat.txt
36+
37+
were originally provided by the Unicode Consortium for use by NeXT
38+
implementations with DPS, and continue to be provided for compatibility with
39+
those implementations. All others should refer to the files on Adobe.com
40+
mentioned above.
41+
42+
-------------------------------------------------------------------------------

encodings/stdenc.txt

+213
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,213 @@
1+
#
2+
# Name: Adobe Standard Encoding to Unicode
3+
# Unicode version: 2.0
4+
# Table version: 1.0
5+
# Date: 2011 July 12
6+
#
7+
# Copyright (c) 1991-2011 Unicode, Inc. All Rights reserved.
8+
#
9+
# This file is provided as-is by Unicode, Inc. (The Unicode Consortium). No
10+
# claims are made as to fitness for any particular purpose. No warranties of
11+
# any kind are expressed or implied. The recipient agrees to determine
12+
# applicability of information provided. If this file has been provided on
13+
# magnetic media by Unicode, Inc., the sole remedy for any claim will be
14+
# exchange of defective media within 90 days of receipt.
15+
#
16+
# Unicode, Inc. hereby grants the right to freely use the information
17+
# supplied in this file in the creation of products supporting the
18+
# Unicode Standard, and to make copies of this file in any form for
19+
# internal or external distribution as long as this notice remains
20+
# attached.
21+
#
22+
# Format: 4 tab-delimited fields:
23+
#
24+
# (1) The Unicode value (in hexadecimal)
25+
# (2) The Adobe Standard Encoding code point (in hexadecimal)
26+
# (3) # Unicode name
27+
# (4) # PostScript character name
28+
#
29+
# General Notes:
30+
#
31+
# The Unicode values in this table were produced as the result of applying
32+
# the algorithm described in the section "Populating a Unicode space" in the
33+
# document "Unicode and Glyph Names," at
34+
# http://partners.adobe.com/asn/developer/typeforum/unicodegn.html
35+
# to the characters encoded in Adobe Standard Encoding. Note that some
36+
# Standard Encoding characters, such as "space", are mapped to 2 Unicode
37+
# values. Refer to the above document for more details.
38+
#
39+
# 2011 July 12: The above link is no longer valid. For comparable,
40+
# more current information, see the document, "Glyph", at:
41+
# <http://www.adobe.com/devnet/opentype/archives/glyph.html>
42+
#
43+
# Revision History:
44+
#
45+
# [v1.0, 2011 July 12]
46+
# Updated terms of use to current wording.
47+
# Updated contact information and document link.
48+
# No changes to the mapping data.
49+
#
50+
# [v0.2, 30 March 1999]
51+
# Different algorithm to produce Unicode values (see notes above) results in
52+
# some character codes being mapped to 2 Unicode values. Updated Unicode
53+
# names to Unicode 2.0 names.
54+
#
55+
# [v0.1, 5 May 1995] First release.
56+
#
57+
# Use the Unicode reporting form <http://www.unicode.org/reporting.html>
58+
# for any questions or comments or to report errors in the data.
59+
#
60+
0020 20 # SPACE # space
61+
00A0 20 # NO-BREAK SPACE # space
62+
0021 21 # EXCLAMATION MARK # exclam
63+
0022 22 # QUOTATION MARK # quotedbl
64+
0023 23 # NUMBER SIGN # numbersign
65+
0024 24 # DOLLAR SIGN # dollar
66+
0025 25 # PERCENT SIGN # percent
67+
0026 26 # AMPERSAND # ampersand
68+
2019 27 # RIGHT SINGLE QUOTATION MARK # quoteright
69+
0028 28 # LEFT PARENTHESIS # parenleft
70+
0029 29 # RIGHT PARENTHESIS # parenright
71+
002A 2A # ASTERISK # asterisk
72+
002B 2B # PLUS SIGN # plus
73+
002C 2C # COMMA # comma
74+
002D 2D # HYPHEN-MINUS # hyphen
75+
00AD 2D # SOFT HYPHEN # hyphen
76+
002E 2E # FULL STOP # period
77+
002F 2F # SOLIDUS # slash
78+
0030 30 # DIGIT ZERO # zero
79+
0031 31 # DIGIT ONE # one
80+
0032 32 # DIGIT TWO # two
81+
0033 33 # DIGIT THREE # three
82+
0034 34 # DIGIT FOUR # four
83+
0035 35 # DIGIT FIVE # five
84+
0036 36 # DIGIT SIX # six
85+
0037 37 # DIGIT SEVEN # seven
86+
0038 38 # DIGIT EIGHT # eight
87+
0039 39 # DIGIT NINE # nine
88+
003A 3A # COLON # colon
89+
003B 3B # SEMICOLON # semicolon
90+
003C 3C # LESS-THAN SIGN # less
91+
003D 3D # EQUALS SIGN # equal
92+
003E 3E # GREATER-THAN SIGN # greater
93+
003F 3F # QUESTION MARK # question
94+
0040 40 # COMMERCIAL AT # at
95+
0041 41 # LATIN CAPITAL LETTER A # A
96+
0042 42 # LATIN CAPITAL LETTER B # B
97+
0043 43 # LATIN CAPITAL LETTER C # C
98+
0044 44 # LATIN CAPITAL LETTER D # D
99+
0045 45 # LATIN CAPITAL LETTER E # E
100+
0046 46 # LATIN CAPITAL LETTER F # F
101+
0047 47 # LATIN CAPITAL LETTER G # G
102+
0048 48 # LATIN CAPITAL LETTER H # H
103+
0049 49 # LATIN CAPITAL LETTER I # I
104+
004A 4A # LATIN CAPITAL LETTER J # J
105+
004B 4B # LATIN CAPITAL LETTER K # K
106+
004C 4C # LATIN CAPITAL LETTER L # L
107+
004D 4D # LATIN CAPITAL LETTER M # M
108+
004E 4E # LATIN CAPITAL LETTER N # N
109+
004F 4F # LATIN CAPITAL LETTER O # O
110+
0050 50 # LATIN CAPITAL LETTER P # P
111+
0051 51 # LATIN CAPITAL LETTER Q # Q
112+
0052 52 # LATIN CAPITAL LETTER R # R
113+
0053 53 # LATIN CAPITAL LETTER S # S
114+
0054 54 # LATIN CAPITAL LETTER T # T
115+
0055 55 # LATIN CAPITAL LETTER U # U
116+
0056 56 # LATIN CAPITAL LETTER V # V
117+
0057 57 # LATIN CAPITAL LETTER W # W
118+
0058 58 # LATIN CAPITAL LETTER X # X
119+
0059 59 # LATIN CAPITAL LETTER Y # Y
120+
005A 5A # LATIN CAPITAL LETTER Z # Z
121+
005B 5B # LEFT SQUARE BRACKET # bracketleft
122+
005C 5C # REVERSE SOLIDUS # backslash
123+
005D 5D # RIGHT SQUARE BRACKET # bracketright
124+
005E 5E # CIRCUMFLEX ACCENT # asciicircum
125+
005F 5F # LOW LINE # underscore
126+
2018 60 # LEFT SINGLE QUOTATION MARK # quoteleft
127+
0061 61 # LATIN SMALL LETTER A # a
128+
0062 62 # LATIN SMALL LETTER B # b
129+
0063 63 # LATIN SMALL LETTER C # c
130+
0064 64 # LATIN SMALL LETTER D # d
131+
0065 65 # LATIN SMALL LETTER E # e
132+
0066 66 # LATIN SMALL LETTER F # f
133+
0067 67 # LATIN SMALL LETTER G # g
134+
0068 68 # LATIN SMALL LETTER H # h
135+
0069 69 # LATIN SMALL LETTER I # i
136+
006A 6A # LATIN SMALL LETTER J # j
137+
006B 6B # LATIN SMALL LETTER K # k
138+
006C 6C # LATIN SMALL LETTER L # l
139+
006D 6D # LATIN SMALL LETTER M # m
140+
006E 6E # LATIN SMALL LETTER N # n
141+
006F 6F # LATIN SMALL LETTER O # o
142+
0070 70 # LATIN SMALL LETTER P # p
143+
0071 71 # LATIN SMALL LETTER Q # q
144+
0072 72 # LATIN SMALL LETTER R # r
145+
0073 73 # LATIN SMALL LETTER S # s
146+
0074 74 # LATIN SMALL LETTER T # t
147+
0075 75 # LATIN SMALL LETTER U # u
148+
0076 76 # LATIN SMALL LETTER V # v
149+
0077 77 # LATIN SMALL LETTER W # w
150+
0078 78 # LATIN SMALL LETTER X # x
151+
0079 79 # LATIN SMALL LETTER Y # y
152+
007A 7A # LATIN SMALL LETTER Z # z
153+
007B 7B # LEFT CURLY BRACKET # braceleft
154+
007C 7C # VERTICAL LINE # bar
155+
007D 7D # RIGHT CURLY BRACKET # braceright
156+
007E 7E # TILDE # asciitilde
157+
00A1 A1 # INVERTED EXCLAMATION MARK # exclamdown
158+
00A2 A2 # CENT SIGN # cent
159+
00A3 A3 # POUND SIGN # sterling
160+
2044 A4 # FRACTION SLASH # fraction
161+
2215 A4 # DIVISION SLASH # fraction
162+
00A5 A5 # YEN SIGN # yen
163+
0192 A6 # LATIN SMALL LETTER F WITH HOOK # florin
164+
00A7 A7 # SECTION SIGN # section
165+
00A4 A8 # CURRENCY SIGN # currency
166+
0027 A9 # APOSTROPHE # quotesingle
167+
201C AA # LEFT DOUBLE QUOTATION MARK # quotedblleft
168+
00AB AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK # guillemotleft
169+
2039 AC # SINGLE LEFT-POINTING ANGLE QUOTATION MARK # guilsinglleft
170+
203A AD # SINGLE RIGHT-POINTING ANGLE QUOTATION MARK # guilsinglright
171+
FB01 AE # LATIN SMALL LIGATURE FI # fi
172+
FB02 AF # LATIN SMALL LIGATURE FL # fl
173+
2013 B1 # EN DASH # endash
174+
2020 B2 # DAGGER # dagger
175+
2021 B3 # DOUBLE DAGGER # daggerdbl
176+
00B7 B4 # MIDDLE DOT # periodcentered
177+
2219 B4 # BULLET OPERATOR # periodcentered
178+
00B6 B6 # PILCROW SIGN # paragraph
179+
2022 B7 # BULLET # bullet
180+
201A B8 # SINGLE LOW-9 QUOTATION MARK # quotesinglbase
181+
201E B9 # DOUBLE LOW-9 QUOTATION MARK # quotedblbase
182+
201D BA # RIGHT DOUBLE QUOTATION MARK # quotedblright
183+
00BB BB # RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK # guillemotright
184+
2026 BC # HORIZONTAL ELLIPSIS # ellipsis
185+
2030 BD # PER MILLE SIGN # perthousand
186+
00BF BF # INVERTED QUESTION MARK # questiondown
187+
0060 C1 # GRAVE ACCENT # grave
188+
00B4 C2 # ACUTE ACCENT # acute
189+
02C6 C3 # MODIFIER LETTER CIRCUMFLEX ACCENT # circumflex
190+
02DC C4 # SMALL TILDE # tilde
191+
00AF C5 # MACRON # macron
192+
02C9 C5 # MODIFIER LETTER MACRON # macron
193+
02D8 C6 # BREVE # breve
194+
02D9 C7 # DOT ABOVE # dotaccent
195+
00A8 C8 # DIAERESIS # dieresis
196+
02DA CA # RING ABOVE # ring
197+
00B8 CB # CEDILLA # cedilla
198+
02DD CD # DOUBLE ACUTE ACCENT # hungarumlaut
199+
02DB CE # OGONEK # ogonek
200+
02C7 CF # CARON # caron
201+
2014 D0 # EM DASH # emdash
202+
00C6 E1 # LATIN CAPITAL LETTER AE # AE
203+
00AA E3 # FEMININE ORDINAL INDICATOR # ordfeminine
204+
0141 E8 # LATIN CAPITAL LETTER L WITH STROKE # Lslash
205+
00D8 E9 # LATIN CAPITAL LETTER O WITH STROKE # Oslash
206+
0152 EA # LATIN CAPITAL LIGATURE OE # OE
207+
00BA EB # MASCULINE ORDINAL INDICATOR # ordmasculine
208+
00E6 F1 # LATIN SMALL LETTER AE # ae
209+
0131 F5 # LATIN SMALL LETTER DOTLESS I # dotlessi
210+
0142 F8 # LATIN SMALL LETTER L WITH STROKE # lslash
211+
00F8 F9 # LATIN SMALL LETTER O WITH STROKE # oslash
212+
0153 FA # LATIN SMALL LIGATURE OE # oe
213+
00DF FB # LATIN SMALL LETTER SHARP S # germandbls

0 commit comments

Comments
 (0)