Swánga̱lyiatwuki
kcgwiktionary
https://kcg.wiktionary.org/wiki/A%CC%B1tsak_wat_wu
MediaWiki 1.47.0-wmf.2
case-sensitive
Media
Wat khwo
A̱lyiat
A̱tyunta̱m
A̱lyiat a̱tyunta̱m
Swánga̱lyiatwuki
A̱lyiat Swánga̱lyiatwuki
Fail
A̱lyiat fail
MediaWiki
A̱lyiat MediaWiki
Ta̱mpi̱let
A̱lyiat ta̱mpi̱let
Beang
A̱lyiat beang
Sa
A̱lyiat nsa
TimedText
TimedText talk
Module
Module talk
Event
Event talk
mbwak
0
377
43515
18877
2026-05-14T11:11:24Z
Danjuma Anthony
44
/* Tyap */
43515
wikitext
text/x-wiki
=={{-kcg-}}==
[[Fail:Big and Small Age Grandpa and Grandson 01.jpg|thumb|'''Mbwak''' a̱nietkhwop-a̱niet ma̱ng nggwon.]]
===Vak yei/lyuut ghyang===
* [[mbvwak]]
===Bwoi a̱lyoot===
Neet mi̱ ''[[m-]]'' + ''[[bwak]]''.
===Yei===
* [[Za̱FBS]]: /m̀bvʷɑ̀g/
* A̱nwuat:
===A̱lyoot===
'''mbwak''' (''a̱nyiung'' '''[[bwak]]''')
# '''Mbwak''' yet bwak ku lan nyiung hwa.
#* ''Ka̱ '''mbwak''' na sweang, tyia̱ a̱junjung a̱toot la.''
#* ''Tyia̱ '''mbwak''' ang na mami.''
#* '''''Mbwak''' lat a̱ma.''
====Tai====
* [[si̱sa]]
====Bwan di̱ lilyem ghyáng====
:* A̱do: abọ
:* [[A̱ghwangkpang]]: [[Wikt:yo:ọwọ|ọwọ]]
:* [[A̱kum-a̱cyi]]: [[Wikt:ig:aka|aka]]
:* Ba̱mba̱ra: [[Wikt:bm:bolow|bolow]]
:* Eve: [[Wikt:ee:asiwo|asiwo]]
:* [[Jhyuo]]: [[Wikt:kaj:mbvwak|mbvwak]]
:* [[Kpat]]: [[Wikt:ha:hanaye|hanaye]]
:* Lati̱n: [[Wikt:la:manus|manus]]
:* [[Shong]]: [[Wikt:en:hands|hands]]
:* Swahili: [[Wikt:sw:mikono|mikono]]
:* Tswi: [[Wikt:tw:nsa|nsa]]
:* Zulu: [[Wikt:zu:izandla|izandla]]
[[Sa:a̱lyoot]]
1yev7sg798ncc6s701ospsqz9ruj4sc
ta̱m
0
469
43503
37935
2026-05-14T09:46:32Z
Danjuma Anthony
44
/* Yei */
43503
wikitext
text/x-wiki
==Tyap==
===Yei===
* [[Za̱FBS]]: /tǝ́m/
* A̱nwuat: [[|thumb|Si̱sak nang á̱ ngyei "ta̱m" a̱ni.]]
===A̱lyoot===
'''ta̱m''' (''á̱kpa'' '''[[nta̱m]]''')
:Sot swang a̱liyat:
::'''wu''' (''á̱kpa'' '''na''')
'''Ta̱m''' yet kyang hwa nang a̱yin nyia̱ a̱ni ma̱ng mumwang ku na bai ma̱ng ya a̱son a̱ni.
#* ''A̱tyoli Kumaai nat '''ta̱m''' di̱ fam a̱ta̱k Ka̱duna.''
#* '''''Ta̱m''' huni byia̱ a̱khwu za̱m.''
#* ''A̱fai a̱wa lyen nyia̱ '''ta̱m''' hu ansham.''
====Bwoi mi̱ bwoi====
* [[a̱banta̱m]]
* [[a̱tyunta̱m]]
====Tai====
* [[kyangnyian]]
====Bwan di̱ lilyem ghyáng====
* A̱do: {{t+|bin|iwinna}}
* [[A̱ghwangkpang]]: [[Wikt:yo:ṣiṣẹ|ṣiṣẹ]]
* Eve: [[Wikt:ee:dɔ|dɔ]]
* Ji̱kun: butso
* [[Kpat]]: [[Wikt:ha:aiki|aiki]]
* Kutyep: wande
* [[Shong]]: [[Wikt:en:work|work]]
* Swahili: [[Wikt:sw:kazi|kazi]]
[[Sa:a̱lyoot]]
aa8vl4jepm0s3mac5yb16wx1ud5fxrn
43504
43503
2026-05-14T09:49:30Z
Danjuma Anthony
44
/* Yei */
43504
wikitext
text/x-wiki
==Tyap==
===Yei===
* [[Za̱FBS]]: /tǝ́m/
** {{audio|kcg|Kcg-nywán.ogg|a=Si̱sak nang á̱ ngyei "ta̱m" a̱ni}}
===A̱lyoot===
'''ta̱m''' (''á̱kpa'' '''[[nta̱m]]''')
:Sot swang a̱liyat:
::'''wu''' (''á̱kpa'' '''na''')
'''Ta̱m''' yet kyang hwa nang a̱yin nyia̱ a̱ni ma̱ng mumwang ku na bai ma̱ng ya a̱son a̱ni.
#* ''A̱tyoli Kumaai nat '''ta̱m''' di̱ fam a̱ta̱k Ka̱duna.''
#* '''''Ta̱m''' huni byia̱ a̱khwu za̱m.''
#* ''A̱fai a̱wa lyen nyia̱ '''ta̱m''' hu ansham.''
====Bwoi mi̱ bwoi====
* [[a̱banta̱m]]
* [[a̱tyunta̱m]]
====Tai====
* [[kyangnyian]]
====Bwan di̱ lilyem ghyáng====
* A̱do: {{t+|bin|iwinna}}
* [[A̱ghwangkpang]]: [[Wikt:yo:ṣiṣẹ|ṣiṣẹ]]
* Eve: [[Wikt:ee:dɔ|dɔ]]
* Ji̱kun: butso
* [[Kpat]]: [[Wikt:ha:aiki|aiki]]
* Kutyep: wande
* [[Shong]]: [[Wikt:en:work|work]]
* Swahili: [[Wikt:sw:kazi|kazi]]
[[Sa:a̱lyoot]]
5oc486uuxjydichrj5vyw0jpuydqqbd
43505
43504
2026-05-14T09:50:30Z
Danjuma Anthony
44
/* Yei */
43505
wikitext
text/x-wiki
==Tyap==
===Yei===
* [[Za̱FBS]]: /tǝ́m/
** {{audio|kcg|kcg-ta̱m.ogg|a=Si̱sak nang á̱ ngyei "ta̱m" a̱ni}}
===A̱lyoot===
'''ta̱m''' (''á̱kpa'' '''[[nta̱m]]''')
:Sot swang a̱liyat:
::'''wu''' (''á̱kpa'' '''na''')
'''Ta̱m''' yet kyang hwa nang a̱yin nyia̱ a̱ni ma̱ng mumwang ku na bai ma̱ng ya a̱son a̱ni.
#* ''A̱tyoli Kumaai nat '''ta̱m''' di̱ fam a̱ta̱k Ka̱duna.''
#* '''''Ta̱m''' huni byia̱ a̱khwu za̱m.''
#* ''A̱fai a̱wa lyen nyia̱ '''ta̱m''' hu ansham.''
====Bwoi mi̱ bwoi====
* [[a̱banta̱m]]
* [[a̱tyunta̱m]]
====Tai====
* [[kyangnyian]]
====Bwan di̱ lilyem ghyáng====
* A̱do: {{t+|bin|iwinna}}
* [[A̱ghwangkpang]]: [[Wikt:yo:ṣiṣẹ|ṣiṣẹ]]
* Eve: [[Wikt:ee:dɔ|dɔ]]
* Ji̱kun: butso
* [[Kpat]]: [[Wikt:ha:aiki|aiki]]
* Kutyep: wande
* [[Shong]]: [[Wikt:en:work|work]]
* Swahili: [[Wikt:sw:kazi|kazi]]
[[Sa:a̱lyoot]]
ctdh1ixo1hoauybhveg264fa5gm0a0f
43506
43505
2026-05-14T09:53:11Z
Danjuma Anthony
44
/* Yei */
43506
wikitext
text/x-wiki
==Tyap==
===Yei===
* [[Za̱FBS]]: /tǝ́m/
** {{audio|kcg|
LL-Q3912765 (kcg)-Kambai Akau-ta̱m.wav|thumb|a=Si̱sak nang á̱ ngyei "ta̱m" a̱ni}}
===A̱lyoot===
'''ta̱m''' (''á̱kpa'' '''[[nta̱m]]''')
:Sot swang a̱liyat:
::'''wu''' (''á̱kpa'' '''na''')
'''Ta̱m''' yet kyang hwa nang a̱yin nyia̱ a̱ni ma̱ng mumwang ku na bai ma̱ng ya a̱son a̱ni.
#* ''A̱tyoli Kumaai nat '''ta̱m''' di̱ fam a̱ta̱k Ka̱duna.''
#* '''''Ta̱m''' huni byia̱ a̱khwu za̱m.''
#* ''A̱fai a̱wa lyen nyia̱ '''ta̱m''' hu ansham.''
====Bwoi mi̱ bwoi====
* [[a̱banta̱m]]
* [[a̱tyunta̱m]]
====Tai====
* [[kyangnyian]]
====Bwan di̱ lilyem ghyáng====
* A̱do: {{t+|bin|iwinna}}
* [[A̱ghwangkpang]]: [[Wikt:yo:ṣiṣẹ|ṣiṣẹ]]
* Eve: [[Wikt:ee:dɔ|dɔ]]
* Ji̱kun: butso
* [[Kpat]]: [[Wikt:ha:aiki|aiki]]
* Kutyep: wande
* [[Shong]]: [[Wikt:en:work|work]]
* Swahili: [[Wikt:sw:kazi|kazi]]
[[Sa:a̱lyoot]]
11d7y2064q0wd8j5lz6emd0tl0w4jog
43507
43506
2026-05-14T10:02:39Z
Danjuma Anthony
44
/* Tyap */
43507
wikitext
text/x-wiki
==Tyap==
[[File:Irrigation farming on sugarcane 2.jpg|thumb|ta̱m a̱yit]]
===Yei===
* [[Za̱FBS]]: /tǝ́m/
** {{audio|kcg|
LL-Q3912765 (kcg)-Kambai Akau-ta̱m.wav|thumb|a=Si̱sak nang á̱ ngyei "ta̱m" a̱ni}}
===A̱lyoot===
'''ta̱m''' (''á̱kpa'' '''[[nta̱m]]''')
:Sot swang a̱liyat:
::'''wu''' (''á̱kpa'' '''na''')
'''Ta̱m''' yet kyang hwa nang a̱yin nyia̱ a̱ni ma̱ng mumwang ku na bai ma̱ng ya a̱son a̱ni.
#* ''A̱tyoli Kumaai nat '''ta̱m''' di̱ fam a̱ta̱k Ka̱duna.''
#* '''''Ta̱m''' huni byia̱ a̱khwu za̱m.''
#* ''A̱fai a̱wa lyen nyia̱ '''ta̱m''' hu ansham.''
====Bwoi mi̱ bwoi====
* [[a̱banta̱m]]
* [[a̱tyunta̱m]]
====Tai====
* [[kyangnyian]]
====Bwan di̱ lilyem ghyáng====
* A̱do: {{t+|bin|iwinna}}
* [[A̱ghwangkpang]]: [[Wikt:yo:ṣiṣẹ|ṣiṣẹ]]
* Eve: [[Wikt:ee:dɔ|dɔ]]
* Ji̱kun: butso
* [[Kpat]]: [[Wikt:ha:aiki|aiki]]
* Kutyep: wande
* [[Shong]]: [[Wikt:en:work|work]]
* Swahili: [[Wikt:sw:kazi|kazi]]
[[Sa:a̱lyoot]]
jplwvrv9rzvl5ozxpi0lrrwawei4v3o
43508
43507
2026-05-14T10:04:28Z
Danjuma Anthony
44
/* Tyap */
43508
wikitext
text/x-wiki
==[[Tyap]]==
[[File:Irrigation farming on sugarcane 2.jpg|thumb|ta̱m a̱yit]]
===Yei===
* [[Za̱FBS]]: /tǝ́m/
** {{audio|kcg|
LL-Q3912765 (kcg)-Kambai Akau-ta̱m.wav|thumb|a=Si̱sak nang á̱ ngyei "ta̱m" a̱ni}}
===A̱lyoot===
'''ta̱m''' (''á̱kpa'' '''[[nta̱m]]''')
:Sot swang a̱liyat:
::'''wu''' (''á̱kpa'' '''na''')
'''Ta̱m''' yet kyang hwa nang a̱yin nyia̱ a̱ni ma̱ng mumwang ku na bai ma̱ng ya a̱son a̱ni.
#* ''A̱tyoli Kumaai nat '''ta̱m''' di̱ fam a̱ta̱k Ka̱duna.''
#* '''''Ta̱m''' huni byia̱ a̱khwu za̱m.''
#* ''A̱fai a̱wa lyen nyia̱ '''ta̱m''' hu ansham.''
====Bwoi mi̱ bwoi====
* [[a̱banta̱m]]
* [[a̱tyunta̱m]]
====Tai====
* [[kyangnyian]]
====Bwan di̱ lilyem ghyáng====
* A̱do: {{t+|bin|iwinna}}
* [[A̱ghwangkpang]]: [[Wikt:yo:ṣiṣẹ|ṣiṣẹ]]
* Eve: [[Wikt:ee:dɔ|dɔ]]
* Ji̱kun: butso
* [[Kpat]]: [[Wikt:ha:aiki|aiki]]
* Kutyep: wande
* [[Shong]]: [[Wikt:en:work|work]]
* Swahili: [[Wikt:sw:kazi|kazi]]
[[Sa:a̱lyoot]]
101v30bfhp7e8ng0nyg1596bzmda5vb
43509
43508
2026-05-14T10:07:14Z
Danjuma Anthony
44
/* Tyap */
43509
wikitext
text/x-wiki
=={{-kcg-}}==
[[File:Irrigation farming on sugarcane 2.jpg|thumb|ta̱m a̱yit]]
===Yei===
* [[Za̱FBS]]: /tǝ́m/
** {{audio|kcg|
LL-Q3912765 (kcg)-Kambai Akau-ta̱m.wav|thumb|a=Si̱sak nang á̱ ngyei "ta̱m" a̱ni}}
===A̱lyoot===
'''ta̱m''' (''á̱kpa'' '''[[nta̱m]]''')
:Sot swang a̱liyat:
::'''wu''' (''á̱kpa'' '''na''')
'''Ta̱m''' yet kyang hwa nang a̱yin nyia̱ a̱ni ma̱ng mumwang ku na bai ma̱ng ya a̱son a̱ni.
#* ''A̱tyoli Kumaai nat '''ta̱m''' di̱ fam a̱ta̱k Ka̱duna.''
#* '''''Ta̱m''' huni byia̱ a̱khwu za̱m.''
#* ''A̱fai a̱wa lyen nyia̱ '''ta̱m''' hu ansham.''
====Bwoi mi̱ bwoi====
* [[a̱banta̱m]]
* [[a̱tyunta̱m]]
====Tai====
* [[kyangnyian]]
====Bwan di̱ lilyem ghyáng====
* A̱do: {{t+|bin|iwinna}}
* [[A̱ghwangkpang]]: [[Wikt:yo:ṣiṣẹ|ṣiṣẹ]]
* Eve: [[Wikt:ee:dɔ|dɔ]]
* Ji̱kun: butso
* [[Kpat]]: [[Wikt:ha:aiki|aiki]]
* Kutyep: wande
* [[Shong]]: [[Wikt:en:work|work]]
* Swahili: [[Wikt:sw:kazi|kazi]]
[[Sa:a̱lyoot]]
oamcdksxvkx32jaep5h0mdnfp8uijgw
43514
43509
2026-05-14T11:00:12Z
Danjuma Anthony
44
/* A̱lyoot */
43514
wikitext
text/x-wiki
=={{-kcg-}}==
[[File:Irrigation farming on sugarcane 2.jpg|thumb|ta̱m a̱yit]]
===Yei===
* [[Za̱FBS]]: /tǝ́m/
** {{audio|kcg|
LL-Q3912765 (kcg)-Kambai Akau-ta̱m.wav|thumb|a=Si̱sak nang á̱ ngyei "ta̱m" a̱ni}}
===A̱lyoot===
'''ta̱m''' (''á̱kpa'' '''[[nta̱m]]''')
:Sot swang a̱liyat:
::'''hu''' (''á̱kpa'' '''na''')
'''Ta̱m''' yet kyang hwa nang a̱yin nyia̱ a̱ni ma̱ng mumwang ku na bai ma̱ng ya a̱son a̱ni.
#* ''A̱tyoli Kumaai nat '''ta̱m''' di̱ fam a̱ta̱k Ka̱duna.''
#* '''''Ta̱m''' huni byia̱ a̱khwu za̱m.''
#* ''A̱fai a̱wa lyen nyia̱ '''ta̱m''' hu ansham.''
====Bwoi mi̱ bwoi====
* [[a̱banta̱m]]
* [[a̱tyunta̱m]]
====Tai====
* [[kyangnyian]]
====Bwan di̱ lilyem ghyáng====
* A̱do: {{t+|bin|iwinna}}
* [[A̱ghwangkpang]]: [[Wikt:yo:ṣiṣẹ|ṣiṣẹ]]
* Eve: [[Wikt:ee:dɔ|dɔ]]
* Ji̱kun: butso
* [[Kpat]]: [[Wikt:ha:aiki|aiki]]
* Kutyep: wande
* [[Shong]]: [[Wikt:en:work|work]]
* Swahili: [[Wikt:sw:kazi|kazi]]
[[Sa:a̱lyoot]]
bj9zrkpui265gykposhzgudud2ml7ei
Module:languages
828
696
43497
43416
2026-05-13T23:09:08Z
Kambai Akau
8
43497
Scribunto
text/plain
--[==[ intro:
This module implements fetching of language-specific information and processing text in a given language.
===Types of languages===
There are two types of languages: full languages and etymology-only languages. The essential difference is that only
full languages appear in L2 headings in vocabulary entries, and hence categories like [[:Category:French nouns]] exist
only for full languages. Etymology-only languages have either a full language or another etymology-only language as
their parent (in the parent-child inheritance sense), and for etymology-only languages with another etymology-only
language as their parent, a full language can always be derived by following the parent links upwards. For example,
"Canadian French", code `fr-CA`, is an etymology-only language whose parent is the full language "French", code `fr`.
An example of an etymology-only language with another etymology-only parent is "Northumbrian Old English", code
`ang-nor`, which has "Anglian Old English", code `ang-ang` as its parent; this is an etymology-only language whose
parent is "Old English", code `ang`, which is a full language. (This is because Northumbrian Old English is considered
a variety of Anglian Old English.) Sometimes the parent is the "Undetermined" language, code `und`; this is the case,
for example, for "substrate" languages such as "Pre-Greek", code `qsb-grc`, and "the BMAC substrate", code `qsb-bma`.
It is important to distinguish language ''parents'' from language ''ancestors''. The parent-child relationship is one
of containment, i.e. if X is a child of Y, X is considered a variety of Y. On the other hand, the ancestor-descendant
relationship is one of descent in time. For example, "Classical Latin", code `la-cla`, and "Late Latin", code `la-lat`,
are both etymology-only languages with "Latin", code `la`, as their parents, because both of the former are varieties
of Latin. However, Late Latin does *NOT* have Classical Latin as its parent because Late Latin is *not* a variety of
Classical Latin; rather, it is a descendant. There is in fact a separate `ancestors` field that is used to express the
ancestor-descendant relationship, and Late Latin's ancestor is given as Classical Latin. It is also important to note
that sometimes an etymology-only language is actually the conceptual ancestor of its parent language. This happens,
for example, with "Old Italian" (code `roa-oit`), which is an etymology-only variant of full language "Italian" (code
`it`), and with "Old Latin" (code `itc-ola`), which is an etymology-only variant of Latin. In both cases, the full
language has the etymology-only variant listed as an ancestor. This allows a Latin term to inherit from Old Latin
using the {{tl|inh}} template (where in this template, "inheritance" refers to ancestral inheritance, i.e. inheritance
in time, rather than in the parent-child sense); likewise for Italian and Old Italian.
Full languages come in three subtypes:
* {regular}: This indicates a full language that is attested according to [[WT:CFI]] and therefore permitted in the
main namespace. There may also be reconstructed terms for the language, which are placed in the
{Reconstruction} namespace and must be prefixed with * to indicate a reconstruction. Most full languages
are natural (not constructed) languages, but a few constructed languages (e.g. Esperanto and Volapük,
among others) are also allowed in the mainspace and considered regular languages.
* {reconstructed}: This language is not attested according to [[WT:CFI]], and therefore is allowed only in the
{Reconstruction} namespace. All terms in this language are reconstructed, and must be prefixed with
*. Languages such as Proto-Indo-European and Proto-Germanic are in this category.
* {appendix-constructed}: This language is attested but does not meet the additional requirements set out for
constructed languages ([[WT:CFI#Constructed languages]]). Its entries must therefore be in
the Appendix namespace, but they are not reconstructed and therefore should not have *
prefixed in links. Most constructed languages are of this subtype.
Both full languages and etymology-only languages have a {Language} object associated with them, which is fetched using
the {getByCode} function in [[Module:languages]] to convert a language code to a {Language} object. Depending on the
options supplied to this function, etymology-only languages may or may not be accepted, and family codes may be
accepted (returning a {Family} object as described in [[Module:families]]). There are also separate {getByCanonicalName}
functions in [[Module:languages]] and [[Module:etymology languages]] to convert a language's canonical name to a
{Language} object (depending on whether the canonical name refers to a full or etymology-only language).
===Textual representations===
Textual strings belonging to a given language come in several different ''text variants'':
# The ''input text'' is what the user supplies in wikitext, in the parameters to {{tl|m}}, {{tl|l}}, {{tl|ux}},
{{tl|t}}, {{tl|lang}} and the like.
# The ''corrected input text'' is the input text with some corrections and/or normalizations applied, such as
bad-character replacements for certain languages, like replacing `l` or `1` to [[palochka]] in some languages written
in Cyrillic. (FIXME: This currently goes under the name ''display text'' but that will be repurposed below. Also,
[[User:Surjection]] suggests renaming this to ''normalized input text'', but "normalized" is used in a different sense
in [[Module:usex]].)
# The ''display text'' is the text in the form as it will be displayed to the user. This is what appears in headwords,
in usexes, in displayed internal links, etc. This can include accent marks that are removed to form the stripped
display text (see below), as well as embedded bracketed links that are variously processed further. The display text
is generated from the corrected input text by applying language-specific transformations; for most languages, there
will be no such transformations. The general reason for having a difference between input and display text is to allow
for extra information in the input text that is not displayed to the user but is sent to the transliteration module.
Note that having different display and input text is only supported currently through special-casing but will be
generalized. Examples of transformations are: (1) Removing the {{cd|^}} that is used in certain East Asian (and
possibly other unicameral) languages to indicate capitalization of the transliteration (which is currently
special-cased); (2) for Korean, removing or otherwise processing hyphens (which is currently special-cased); (3) for
Arabic, removing a ''sukūn'' diacritic placed over a ''tāʔ marbūṭa'' (like this: ةْ) to indicate that the
''tāʔ marbūṭa'' is pronounced and transliterated as /t/ instead of being silent [NOTE, NOT IMPLEMENTED YET]; (4) for
Thai and Khmer, converting space-separated words to bracketed words and resolving respelling substitutions such as
`[กรีน/กฺรีน]`, which indicate how to transliterate given words [NOTE, NOT IMPLEMENTED YET except in language-specific
templates like {{tl|th-usex}}].
## The ''right-resolved display text'' is the result of removing brackets around one-part embedded links and resolving
two-part embedded links into their right-hand components (i.e. converting two-part links into the displayed form).
The process of right-resolution is what happens when you call {{cd|remove_links()}} in [[Module:links]] on some text.
When applied to the display text, it produces exactly what the user sees, without any link markup.
# The ''stripped display text'' is the result of applying diacritic-stripping to the display text.
## The ''left-resolved stripped display text'' [NEED BETTER NAME] is the result of applying left-resolution to the
stripped display text, i.e. similar to right-resolution but resolving two-part embedded links into their left-hand
components (i.e. the linked-to page). If the display text refers to a single page, the resulting of applying
diacritic stripping and left-resolution produces the ''logical pagename''.
# The ''physical pagename text'' is the result of converting the stripped display text into physical page links. If the
stripped display text contains embedded links, the left side of those links is converted into physical page links;
otherwise, the entire text is considered a pagename and converted in the same fashion. The conversion does three
things: (1) converts characters not allowed in pagenames into their "unsupported title" representation, e.g.
{{cd|Unsupported titles/`gt`}} in place of the logical name {{cd|>}}; (2) handles certain special-cased
unsupported-title logical pagenames, such as {{cd|Unsupported titles/Space}} in place of {{cd|[space]}} and
{{cd|Unsupported titles/Ancient Greek dish}} in place of a very long Greek name for a gourmet dish as found in
Aristophanes; (3) converts "mammoth" pagenames such as [[a]] into their appropriate split component, e.g.
[[a/languages A to L]].
# The ''source translit text'' is the text as supplied to the language-specific {{cd|transliterate()}} method. The form
of the source translit text may need to be language-specific, e.g Thai and Khmer will need the corrected input text,
whereas other languages may need to work off the display text. [FIXME: It's still unclear to me how embedded bracketed
links are handled in the existing code.] In general, embedded links need to be right-resolved (see above), but when
this happens is unclear to me [FIXME]. Some languages have a chop-up-and-paste-together scheme that sends parts of the
text through the transliterate mechanism, and for others (those listed with "cont" in {{cd|substitution}} in
[[Module:languages/data]]) they receive the full input text, but preprocessed in certain ways. (The wisdom of this is
still unclear to me.)
# The ''transliterated text'' (or ''transliteration'') is the result of transliterating the source translit text. Unlike
for all the other text variants except the transcribed text, it is always in the Latin script.
# The ''transcribed text'' (or ''transcription'') is the result of transcribing the source translit text, where
"transcription" here means a close approximation to the phonetic form of the language in languages (e.g. Akkadian,
Sumerian, Ancient Egyptian, maybe Tibetan) that have a wide difference between the written letters and spoken form.
Unlike for all the other text variants other than the transliterated text, it is always in the Latin script.
Currently, the transcribed text is always supplied manually be the user; there is no such thing as a
{{cd|transcribe()}} method on language objects.
# The ''sort key'' is the text used in sort keys for determining the placing of pages in categories they belong to. The
sort key is generated from the pagename or a specified ''sort base'' by lowercasing, doing language-specific
transformations and then uppercasing the result. If the sort base is supplied and is generated from input text, it
needs to be converted to display text, have embedded links removed through right-resolution and have
diacritic-stripping applied.
# There are other text variants that occur in usexes (specifically, there are normalized variants of several of the
above text variants), but we can skip them for now.
The following methods exist on {Language} objects to convert between different text variants:
# {correctInputText} (currently called {makeDisplayText}): This converts input text to corrected input text.
# {stripDiacritics}: This converts to stripped display text. [FIXME: This needs some rethinking. In particular,
{stripDiacritics} is sometimes called on input text, corrected input text or display text (in various paths inside of
[[Module:links]], and, in the case of input text, usually from other modules). We need to make sure we don't try to
convert input text to display text twice, but at the same time we need to support calling it directly on input text
since so many modules do this. This means we need to add a parameter indicating whether the passed-in text is input,
corrected input, or display text; if the former two, we call {correctInputText} ourselves.]
# {logicalToPhysical}: This converts logical pagenames to physical pagenames.
# {transliterate}: This appears to convert input text with embedded brackets removed into a transliteration.
[FIXME: This needs some rethinking. In particular, it calls {processDisplayText} on its input, which won't work
for Thai and Khmer, so we may need language-specific flags indicating whether to pass the input text directly to the
language transliterate method. In addition, I'm not sure how embedded links are handled in the existing translit code;
a lot of callers remove the links themselves before calling {transliterate()}, which I assume is wrong.]
# {makeSortKey}: This converts display text (?) to a sort key. [FIXME: Clarify this.]
]==]
local export = {}
local debug_track_module = "Module:debug/track"
local etymology_languages_data_module = "Module:etymology languages/data"
local families_module = "Module:families"
local headword_page_module = "Module:headword/page"
local json_module = "Module:JSON"
local language_like_module = "Module:language-like"
local languages_data_module = "Module:languages/data"
local languages_data_patterns_module = "Module:languages/data/patterns"
local links_data_module = "Module:links/data"
local load_module = "Module:load"
local scripts_module = "Module:scripts"
local scripts_data_module = "Module:scripts/data"
local string_encode_entities_module = "Module:string/encode entities"
local string_pattern_escape_module = "Module:string/patternEscape"
local string_replacement_escape_module = "Module:string/replacementEscape"
local string_utilities_module = "Module:string utilities"
local table_module = "Module:table"
local utilities_module = "Module:utilities"
local wikimedia_languages_module = "Module:wikimedia languages"
local mw = mw
local string = string
local table = table
local char = string.char
local concat = table.concat
local find = string.find
local floor = math.floor
local get_by_code -- Defined below.
local get_data_module_name -- Defined below.
local get_extra_data_module_name -- Defined below.
local getmetatable = getmetatable
local gmatch = string.gmatch
local gsub = string.gsub
local insert = table.insert
local ipairs = ipairs
local is_known_language_tag = mw.language.isKnownLanguageTag
local make_object -- Defined below.
local match = string.match
local next = next
local pairs = pairs
local remove = table.remove
local require = require
local select = select
local setmetatable = setmetatable
local sub = string.sub
local type = type
local unstrip = mw.text.unstrip
-- Loaded as needed by findBestScript.
local Hans_chars
local Hant_chars
local function check_object(...)
check_object = require(utilities_module).check_object
return check_object(...)
end
local function debug_track(...)
debug_track = require(debug_track_module)
return debug_track(...)
end
local function decode_entities(...)
decode_entities = require(string_utilities_module).decode_entities
return decode_entities(...)
end
local function decode_uri(...)
decode_uri = require(string_utilities_module).decode_uri
return decode_uri(...)
end
local function deep_copy(...)
deep_copy = require(table_module).deepCopy
return deep_copy(...)
end
local function encode_entities(...)
encode_entities = require(string_encode_entities_module)
return encode_entities(...)
end
local function get_L2_sort_key(...)
get_L2_sort_key = require(headword_page_module).get_L2_sort_key
return get_L2_sort_key(...)
end
local function get_script(...)
get_script = require(scripts_module).getByCode
return get_script(...)
end
local function find_best_script_without_lang(...)
find_best_script_without_lang = require(scripts_module).findBestScriptWithoutLang
return find_best_script_without_lang(...)
end
local function get_family(...)
get_family = require(families_module).getByCode
return get_family(...)
end
local function get_plaintext(...)
get_plaintext = require(utilities_module).get_plaintext
return get_plaintext(...)
end
local function get_wikimedia_lang(...)
get_wikimedia_lang = require(wikimedia_languages_module).getByCode
return get_wikimedia_lang(...)
end
local function keys_to_list(...)
keys_to_list = require(table_module).keysToList
return keys_to_list(...)
end
local function list_to_set(...)
list_to_set = require(table_module).listToSet
return list_to_set(...)
end
local function load_data(...)
load_data = require(load_module).load_data
return load_data(...)
end
local function make_family_object(...)
make_family_object = require(families_module).makeObject
return make_family_object(...)
end
local function pattern_escape(...)
pattern_escape = require(string_pattern_escape_module)
return pattern_escape(...)
end
local function replacement_escape(...)
replacement_escape = require(string_replacement_escape_module)
return replacement_escape(...)
end
local function safe_require(...)
safe_require = require(load_module).safe_require
return safe_require(...)
end
local function shallow_copy(...)
shallow_copy = require(table_module).shallowCopy
return shallow_copy(...)
end
local function split(...)
split = require(string_utilities_module).split
return split(...)
end
local function to_json(...)
to_json = require(json_module).toJSON
return to_json(...)
end
local function u(...)
u = require(string_utilities_module).char
return u(...)
end
local function ugsub(...)
ugsub = require(string_utilities_module).gsub
return ugsub(...)
end
local function ulen(...)
ulen = require(string_utilities_module).len
return ulen(...)
end
local function ulower(...)
ulower = require(string_utilities_module).lower
return ulower(...)
end
local function umatch(...)
umatch = require(string_utilities_module).match
return umatch(...)
end
local function uupper(...)
uupper = require(string_utilities_module).upper
return uupper(...)
end
local function track(page)
debug_track("languages/" .. page)
return true
end
local function normalize_code(code)
return load_data(languages_data_module).aliases[code] or code
end
local function check_inputs(self, check, default, ...)
local n = select("#", ...)
if n == 0 then
return false
end
local ret = check(self, (...))
if ret ~= nil then
return ret
elseif n > 1 then
local inputs = {...}
for i = 2, n do
ret = check(self, inputs[i])
if ret ~= nil then
return ret
end
end
end
return default
end
local function make_link(self, target, display)
local prefix, main
if self:getFamilyCode() == "qfa-sub" then
prefix, main = display:match("^(the )(.*)")
if not prefix then
prefix, main = display:match("^(a )(.*)")
end
end
return (prefix or "") .. "[[" .. target .. "|" .. (main or display) .. "]]"
end
-- Convert risky characters to HTML entities, which minimizes interference once returned (e.g. for "sms:a", "<!-- -->" etc.).
local function escape_risky_characters(text)
-- Spacing characters in isolation generally need to be escaped in order to be properly processed by the MediaWiki software.
if umatch(text, "^%s*$") then
return encode_entities(text, text)
end
return encode_entities(text, "!#%&*+/:;<=>?@[\\]_{|}")
end
-- Temporarily convert various formatting characters to PUA to prevent them from being disrupted by the substitution process.
local function doTempSubstitutions(text, subbedChars, keepCarets, noTrim)
-- Clone so that we don't insert any extra patterns into the table in package.loaded. For some reason, using require seems to keep memory use down; probably because the table is always cloned.
local patterns = shallow_copy(require(languages_data_patterns_module))
if keepCarets then
insert(patterns, "((\\+)%^)")
insert(patterns, "((%^))")
end
-- Ensure any whitespace at the beginning and end is temp substituted, to prevent it from being accidentally trimmed. We only want to trim any final spaces added during the substitution process (e.g. by a module), which means we only do this during the first round of temp substitutions.
if not noTrim then
insert(patterns, "^([\128-\191\244]*(%s+))")
insert(patterns, "((%s+)[\128-\191\244]*)$")
end
-- Pre-substitution, of "[[" and "]]", which makes pattern matching more accurate.
text = gsub(text, "%f[%[]%[%[", "\1"):gsub("%f[%]]%]%]", "\2")
local i = #subbedChars
for _, pattern in ipairs(patterns) do
-- Patterns ending in \0 stand are for things like "[[" or "]]"), so the inserted PUA are treated as breaks between terms by modules that scrape info from pages.
local term_divider
pattern = gsub(pattern, "%z$", function(divider)
term_divider = divider == "\0"
return ""
end)
text = gsub(text, pattern, function(...)
local m = {...}
local m1New = m[1]
for k = 2, #m do
local n = i + k - 1
subbedChars[n] = m[k]
local byte2 = floor(n / 4096) % 64 + (term_divider and 128 or 136)
local byte3 = floor(n / 64) % 64 + 128
local byte4 = n % 64 + 128
m1New = gsub(m1New, pattern_escape(m[k]), "\244" .. char(byte2) .. char(byte3) .. char(byte4), 1)
end
i = i + #m - 1
return m1New
end)
end
text = gsub(text, "\1", "%[%["):gsub("\2", "%]%]")
return text, subbedChars
end
-- Reinsert any formatting that was temporarily substituted.
local function undoTempSubstitutions(text, subbedChars)
for i = 1, #subbedChars do
local byte2 = floor(i / 4096) % 64 + 128
local byte3 = floor(i / 64) % 64 + 128
local byte4 = i % 64 + 128
text = gsub(text, "\244[" .. char(byte2) .. char(byte2+8) .. "]" .. char(byte3) .. char(byte4),
replacement_escape(subbedChars[i]))
end
text = gsub(text, "\1", "%[%["):gsub("\2", "%]%]")
return text
end
-- Check if the raw text is an unsupported title, and if so return that. Otherwise, remove HTML entities. We do the pre-conversion to avoid loading the unsupported title list unnecessarily.
local function checkNoEntities(self, text)
local textNoEnc = decode_entities(text)
if textNoEnc ~= text and load_data(links_data_module).unsupported_titles[text] then
return text
else
return textNoEnc
end
end
-- If no script object is provided (or if it's invalid or None), get one.
local function checkScript(text, self, sc)
if not check_object("script", true, sc) or sc:getCode() == "None" then
return self:findBestScript(text)
end
return sc
end
local function normalize(text, sc)
text = sc:fixDiscouragedSequences(text)
return sc:toFixedNFD(text)
end
-- Subfunction of iterateSectionSubstitutions(). Process an individual chunk of text according to the specifications in
-- `substitution_data`. The input parameters are all as in the documentation of iterateSectionSubstitutions() except for
-- `recursed`, which is set to true if we called ourselves recursively to process a script-specific setting or
-- script-wide fallback. Returns two values: the processed text and the actual substitution data used to do the
-- substitutions (same as the `actual_substitution_data` return value to iterateSectionSubstitutions()).
local function doSubstitutions(self, text, sc, substitution_data, data_field, function_name, recursed)
-- BE CAREFUL in this function because the value at any level can be `false`, which causes no processing to be done
-- and blocks any further fallback processing.
local actual_substitution_data = substitution_data
-- If there are language-specific substitutes given in the data module, use those.
if type(substitution_data) == "table" then
-- If a script is specified, run this function with the script-specific data before continuing.
local sc_code = sc:getCode()
local has_substitution_data = false
if substitution_data[sc_code] ~= nil then
has_substitution_data = true
if substitution_data[sc_code] then
text, actual_substitution_data = doSubstitutions(self, text, sc, substitution_data[sc_code], data_field,
function_name, true)
end
-- Hant, Hans and Hani are usually treated the same, so add a special case to avoid having to specify each one
-- separately.
elseif sc_code:match("^Han") and substitution_data.Hani ~= nil then
has_substitution_data = true
if substitution_data.Hani then
text, actual_substitution_data = doSubstitutions(self, text, sc, substitution_data.Hani, data_field,
function_name, true)
end
-- Substitution data with key 1 in the outer table may be given as a fallback.
elseif substitution_data[1] ~= nil then
has_substitution_data = true
if substitution_data[1] then
text, actual_substitution_data = doSubstitutions(self, text, sc, substitution_data[1], data_field,
function_name, true)
end
end
-- Iterate over all strings in the "from" subtable, and gsub with the corresponding string in "to". We work with
-- the NFD decomposed forms, as this simplifies many substitutions.
if substitution_data.from then
has_substitution_data = true
for i, from in ipairs(substitution_data.from) do
-- Normalize each loop, to ensure multi-stage substitutions work correctly.
text = sc:toFixedNFD(text)
text = ugsub(text, sc:toFixedNFD(from), substitution_data.to[i] or "")
end
end
if substitution_data.remove_diacritics then
has_substitution_data = true
text = sc:toFixedNFD(text)
-- Convert exceptions to PUA.
local remove_exceptions, substitutes = substitution_data.remove_exceptions
if remove_exceptions then
substitutes = {}
local i = 0
for _, exception in ipairs(remove_exceptions) do
exception = sc:toFixedNFD(exception)
text = ugsub(text, exception, function(m)
i = i + 1
local subst = u(0x80000 + i)
substitutes[subst] = m
return subst
end)
end
end
-- Strip diacritics.
text = ugsub(text, "[" .. substitution_data.remove_diacritics .. "]", "")
-- Convert exceptions back.
if remove_exceptions then
text = text:gsub("\242[\128-\191]*", substitutes)
end
end
if not has_substitution_data and sc._data[data_field] then
-- If language-specific sort key (etc.) is nil, fall back to script-wide sort key (etc.).
text, actual_substitution_data = doSubstitutions(self, text, sc, sc._data[data_field], data_field,
function_name, true)
end
elseif type(substitution_data) == "string" then
-- If there is a dedicated function module, use that.
local module = safe_require("Module:" .. substitution_data)
if module then
-- TODO: translit functions should take objects, not codes.
-- TODO: translit functions should be called with form NFD.
if function_name == "tr" then
if not module[function_name] then
error(("Internal error: Module [[%s]] has no function named 'tr'"):format(substitution_data))
end
text = module[function_name](text, self._code, sc:getCode())
elseif function_name == "stripDiacritics" then
-- FIXME, get rid of this arm after renaming makeEntryName -> stripDiacritics.
if module[function_name] then
text = module[function_name](sc:toFixedNFD(text), self, sc)
elseif module.makeEntryName then
text = module.makeEntryName(sc:toFixedNFD(text), self, sc)
else
error(("Internal error: Module [[%s]] has no function named 'stripDiacritics' or 'makeEntryName'"
):format(substitution_data))
end
else
if not module[function_name] then
error(("Internal error: Module [[%s]] has no function named '%s'"):format(
substitution_data, function_name))
end
text = module[function_name](sc:toFixedNFD(text), self, sc)
end
else
error("Substitution data '" .. substitution_data .. "' does not match an existing module.")
end
elseif substitution_data == nil and sc._data[data_field] then
-- If language-specific sort key (etc.) is nil, fall back to script-wide sort key (etc.).
text, actual_substitution_data = doSubstitutions(self, text, sc, sc._data[data_field], data_field,
function_name, true)
end
-- Don't normalize to NFC if this is the inner loop or if a module returned nil.
if recursed or not text then
return text, actual_substitution_data
end
-- Fix any discouraged sequences created during the substitution process, and normalize into the final form.
return sc:toFixedNFC(sc:fixDiscouragedSequences(text)), actual_substitution_data
end
-- Split the text into sections, based on the presence of temporarily substituted formatting characters, then iterate
-- over each section to apply substitutions (e.g. transliteration or diacritic stripping). This avoids putting PUA
-- characters through language-specific modules, which may be unequipped for them. This function is passed the following
-- values:
-- * `self` (the Language object);
-- * `text` (the text to process);
-- * `sc` (the script of the text, which must be specified; callers should call checkScript() as needed to autodetect the
-- script of the text if not given explicitly by the user);
-- * `subbedChars` (an array of the same length as the text, indicating which characters have been substituted and by
-- what, or {nil} if no substitutions are to happen);
-- * `keepCarets` (DOCUMENT ME);
-- * `substitution_data` (the data indicating which substitutions to apply, taken directly from `data_field` in the
-- language's data structure in a submodule of [[Module:languages/data]]);
-- * `data_field` (the data field from which `substitution_data` was fetched, such as "sort_key" or "strip_diacritics");
-- * `function_name` (the name of the function to call to do the substitution, in case `substitution_data` specifies a
-- module to do the substitution);
-- * `notrim` (don't trim whitespace at the edges of `text`; set when computing the sort key, because whitespace at the
-- beginning of a sort key is significant and causes the resulting page to be sorted at the beginning of the category
-- it's in).
-- Returns three values:
-- (1) the processed text;
-- (2) the value of `subbedChars` that was passed in, possibly modified with additional character substitutions; will be
-- {nil} if {nil} was passed in;
-- (3) the actual substitution data that was used to apply substitutions to `text`; this may be different from the value
-- of `substitution_data` passed in if that value recursively specified script-specific substitutions or if no
-- substitution data could be found in the language-specific data (e.g. {nil} was passed in or a structure was passed
-- in that had no setting for the script given in `sc`), but a script-wide fallback value was set; currently it is
-- only used by makeSortKey().
local function iterateSectionSubstitutions(self, text, sc, subbedChars, keepCarets, substitution_data, data_field,
function_name, notrim)
local sections
-- See [[Module:languages/data]].
if not find(text, "\244") or load_data(languages_data_module).substitution[self._code] == "cont" then
sections = {text}
else
sections = split(text, "\244[\128-\143][\128-\191]*", true)
end
local actual_substitution_data
for _, section in ipairs(sections) do
-- Don't bother processing empty strings or whitespace (which may also not be handled well by dedicated
-- modules).
if gsub(section, "%s+", "") ~= "" then
local sub, this_actual_substitution_data = doSubstitutions(self, section, sc, substitution_data, data_field,
function_name)
actual_substitution_data = this_actual_substitution_data
-- Second round of temporary substitutions, in case any formatting was added by the main substitution
-- process. However, don't do this if the section contains formatting already (as it would have had to have
-- been escaped to reach this stage, and therefore should be given as raw text).
if sub and subbedChars then
local noSub
for _, pattern in ipairs(require(languages_data_patterns_module)) do
if match(section, pattern .. "%z?") then
noSub = true
end
end
if not noSub then
sub, subbedChars = doTempSubstitutions(sub, subbedChars, keepCarets, true)
end
end
if not sub then
text = sub
break
end
text = sub and gsub(text, pattern_escape(section), replacement_escape(sub), 1) or text
end
end
if not notrim then
-- Trim, unless there are only spacing characters, while ignoring any final formatting characters.
-- Do not trim sort keys because spaces at the beginning are significant.
text = text and text:gsub("^([\128-\191\244]*)%s+(%S)", "%1%2"):gsub("(%S)%s+([\128-\191\244]*)$", "%1%2") or
nil
end
return text, subbedChars, actual_substitution_data
end
-- Process carets (and any escapes). Default to simple removal, if no pattern/replacement is given.
local function processCarets(text, pattern, repl)
local rep
repeat
text, rep = gsub(text, "\\\\(\\*^)", "\3%1")
until rep == 0
return (text:gsub("\\^", "\4")
:gsub(pattern or "%^", repl or "")
:gsub("\3", "\\")
:gsub("\4", "^"))
end
-- Remove carets if they are used to capitalize parts of transliterations (unless they have been escaped).
local function removeCarets(text, sc)
if not sc:hasCapitalization() and sc:isTransliterated() and text:find("^", 1, true) then
return processCarets(text)
else
return text
end
end
local Language = {}
--[==[Returns the language code of the language. Example: {{code|lua|"fr"}} for French.]==]
function Language:getCode()
return self._code
end
--[==[Returns the canonical name of the language. This is the name used to represent that language on Wiktionary, and is guaranteed to be unique to that language alone. Example: {{code|lua|"French"}} for French.]==]
function Language:getCanonicalName()
local name = self._name
if name == nil then
name = self._data[1]
self._name = name
end
return name
end
--[==[
Return the display form of the language. The display form of a language, family or script is the form it takes when
appearing as the <code><var>source</var></code> in categories such as <code>English terms derived from
<var>source</var></code> or <code>English given names from <var>source</var></code>, and is also the displayed text
in {makeCategoryLink()} links. For full and etymology-only languages, this is the same as the canonical name, but
for families, it reads <code>"<var>name</var> languages"</code> (e.g. {"Indo-Iranian languages"}), and for scripts,
it reads <code>"<var>name</var> script"</code> (e.g. {"Arabic script"}).
]==]
function Language:getDisplayForm()
local form = self._displayForm
if form == nil then
form = self:getCanonicalName()
-- Add article and " substrate" to substrates that lack them.
if self:getFamilyCode() == "qfa-sub" then
if not (sub(form, 1, 4) == "the " or sub(form, 1, 2) == "a ") then
form = "a " .. form
end
if not match(form, " [Ss]ubstrate") then
form = form .. " substrate"
end
end
self._displayForm = form
end
return form
end
--[==[Returns the value which should be used in the HTML lang= attribute for tagged text in the language.]==]
function Language:getHTMLAttribute(sc, region)
local code = self._code
if not find(code, "-", 1, true) then
return code .. "-" .. sc:getCode() .. (region and "-" .. region or "")
end
local parent = self:getParent()
region = region or match(code, "%f[%u][%u-]+%f[%U]")
if parent then
return parent:getHTMLAttribute(sc, region)
end
-- TODO: ISO family codes can also be used.
return "mis-" .. sc:getCode() .. (region and "-" .. region or "")
end
--[==[Returns a table of the aliases that the language is known by, excluding the canonical name. Aliases are synonyms for the language in question. The names are not guaranteed to be unique, in that sometimes more than one language is known by the same name. Example: {{code|lua|{"High German", "New High German", "Deutsch"} }} for [[:Category:German language|German]].]==]
function Language:getAliases()
self:loadInExtraData()
return require(language_like_module).getAliases(self)
end
--[==[
Return a table of the known subvarieties of a given language, excluding subvarieties that have been given
explicit etymology-only language codes. The names are not guaranteed to be unique, in that sometimes a given name
refers to a subvariety of more than one language. Example: {{code|lua|{"Southern Aymara", "Central Aymara"} }} for
[[:Category:Aymara language|Aymara]]. Note that the returned value can have nested tables in it, when a subvariety
goes by more than one name. Example: {{code|lua|{"North Azerbaijani", "South Azerbaijani", {"Afshar", "Afshari",
"Afshar Azerbaijani", "Afchar"}, {"Qashqa'i", "Qashqai", "Kashkay"}, "Sonqor"} }} for
[[:Category:Azerbaijani language|Azerbaijani]]. Here, for example, Afshar, Afshari, Afshar Azerbaijani and Afchar
all refer to the same subvariety, whose preferred name is Afshar (the one listed first). To avoid a return value
with nested tables in it, specify a non-{{code|lua|nil}} value for the <code>flatten</code> parameter; in that case,
the return value would be {{code|lua|{"North Azerbaijani", "South Azerbaijani", "Afshar", "Afshari",
"Afshar Azerbaijani", "Afchar", "Qashqa'i", "Qashqai", "Kashkay", "Sonqor"} }}.
]==]
function Language:getVarieties(flatten)
self:loadInExtraData()
return require(language_like_module).getVarieties(self, flatten)
end
--[==[Returns a table of the "other names" that the language is known by, which are listed in the <code>otherNames</code> field. It should be noted that the <code>otherNames</code> field itself is deprecated, and entries listed there should eventually be moved to either <code>aliases</code> or <code>varieties</code>.]==]
function Language:getOtherNames() -- To be eventually removed, once there are no more uses of the `otherNames` field.
self:loadInExtraData()
return require(language_like_module).getOtherNames(self)
end
--[==[
Return a combined table of the canonical name, aliases, varieties and other names of a given language.]==]
function Language:getAllNames()
self:loadInExtraData()
return require(language_like_module).getAllNames(self)
end
--[==[Returns a table of types as a lookup table (with the types as keys).
The possible types are
* {language}: This is a language, either full or etymology-only.
* {full}: This is a "full" (not etymology-only) language, i.e. the union of {regular}, {reconstructed} and
{appendix-constructed}. Note that the types {full} and {etymology-only} also exist for families, so if you
want to check specifically for a full language and you have an object that might be a family, you should
use {{lua|hasType("language", "full")}} and not simply {{lua|hasType("full")}}.
* {etymology-only}: This is an etymology-only (not full) language, whose parent is another etymology-only
language or a full language. Note that the types {full} and {etymology-only} also exist for
families, so if you want to check specifically for an etymology-only language and you have an
object that might be a family, you should use {{lua|hasType("language", "etymology-only")}}
and not simply {{lua|hasType("etymology-only")}}.
* {regular}: This indicates a full language that is attested according to [[WT:CFI]] and therefore permitted
in the main namespace. There may also be reconstructed terms for the language, which are placed in
the {Reconstruction} namespace and must be prefixed with * to indicate a reconstruction. Most full
languages are natural (not constructed) languages, but a few constructed languages (e.g. Esperanto
and Volapük, among others) are also allowed in the mainspace and considered regular languages.
* {reconstructed}: This language is not attested according to [[WT:CFI]], and therefore is allowed only in the
{Reconstruction} namespace. All terms in this language are reconstructed, and must be prefixed
with *. Languages such as Proto-Indo-European and Proto-Germanic are in this category.
* {appendix-constructed}: This language is attested but does not meet the additional requirements set out for
constructed languages ([[WT:CFI#Constructed languages]]). Its entries must therefore
be in the Appendix namespace, but they are not reconstructed and therefore should
not have * prefixed in links.
]==]
function Language:getTypes()
local types = self._types
if types == nil then
types = {language = true}
if self:getFullCode() == self._code then
types.full = true
else
types["etymology-only"] = true
end
for t in gmatch(self._data.type, "[^,]+") do
types[t] = true
end
self._types = types
end
return types
end
--[==[Given a list of types as strings, returns true if the language has all of them.]==]
function Language:hasType(...)
Language.hasType = require(language_like_module).hasType
return self:hasType(...)
end
--[==[Returns a table containing <code>WikimediaLanguage</code> objects (see [[Module:wikimedia languages]]), which represent languages and their codes as they are used in Wikimedia projects for interwiki linking and such. More than one object may be returned, as a single Wiktionary language may correspond to multiple Wikimedia languages. For example, Wiktionary's single code <code>sh</code> (Serbo-Croatian) maps to four Wikimedia codes: <code>sh</code> (Serbo-Croatian), <code>bs</code> (Bosnian), <code>hr</code> (Croatian) and <code>sr</code> (Serbian).
The code for the Wikimedia language is retrieved from the <code>wikimedia_codes</code> property in the data modules. If that property is not present, the code of the current language is used. If none of the available codes is actually a valid Wikimedia code, an empty table is returned.]==]
function Language:getWikimediaLanguages()
local wm_langs = self._wikimediaLanguageObjects
if wm_langs == nil then
local codes = self:getWikimediaLanguageCodes()
wm_langs = {}
for i = 1, #codes do
wm_langs[i] = get_wikimedia_lang(codes[i])
end
self._wikimediaLanguageObjects = wm_langs
end
return wm_langs
end
function Language:getWikimediaLanguageCodes()
local wm_langs = self._wikimediaLanguageCodes
if wm_langs == nil then
wm_langs = self._data.wikimedia_codes
if wm_langs then
wm_langs = split(wm_langs, ",", true, true)
else
local code = self._code
if is_known_language_tag(code) then
wm_langs = {code}
else
-- Inherit, but only if no codes are specified in the data *and*
-- the language code isn't a valid Wikimedia language code.
local parent = self:getParent()
wm_langs = parent and parent:getWikimediaLanguageCodes() or {}
end
end
self._wikimediaLanguageCodes = wm_langs
end
return wm_langs
end
--[==[
Returns the name of the Wikipedia article for the language. `project` specifies the language and project to retrieve
the article from, defaulting to {"enwiki"} for the English Wikipedia. Normally if specified it should be the project
code for a specific-language Wikipedia e.g. "zhwiki" for the Chinese Wikipedia, but it can be any project, including
non-Wikipedia ones. If the project is the English Wikipedia and the property {wikipedia_article} is present in the data
module it will be used first. In all other cases, a sitelink will be generated from {:getWikidataItem} (if set). The
resulting value (or lack of value) is cached so that subsequent calls are fast. If no value could be determined, and
`noCategoryFallback` is {false}, {:getCategoryName} is used as fallback; otherwise, {nil} is returned. Note that if
`noCategoryFallback` is {nil} or omitted, it defaults to {false} if the project is the English Wikipedia, otherwise
to {true}. In other words, under normal circumstances, if the English Wikipedia article couldn't be retrieved, the
return value will fall back to a link to the language's category, but this won't normally happen for any other project.
]==]
function Language:getWikipediaArticle(noCategoryFallback, project)
Language.getWikipediaArticle = require(language_like_module).getWikipediaArticle
return self:getWikipediaArticle(noCategoryFallback, project)
end
function Language:makeWikipediaLink()
return make_link(self, "w:" .. self:getWikipediaArticle(), self:getCanonicalName())
end
--[==[Returns the name of the Wikimedia Commons category page for the language.]==]
function Language:getCommonsCategory()
Language.getCommonsCategory = require(language_like_module).getCommonsCategory
return self:getCommonsCategory()
end
--[==[Returns the Wikidata item id for the language or <code>nil</code>. This corresponds to the the second field in the data modules.]==]
function Language:getWikidataItem()
Language.getWikidataItem = require(language_like_module).getWikidataItem
return self:getWikidataItem()
end
--[==[Returns a table of <code>Script</code> objects for all scripts that the language is written in. See [[Module:scripts]].]==]
function Language:getScripts()
local scripts = self._scriptObjects
if scripts == nil then
local codes = self:getScriptCodes()
if codes[1] == "All" then
scripts = load_data(scripts_data_module)
else
scripts = {}
for i = 1, #codes do
scripts[i] = get_script(codes[i])
end
end
self._scriptObjects = scripts
end
return scripts
end
--[==[Returns the table of script codes in the language's data file.]==]
function Language:getScriptCodes()
local scripts = self._scriptCodes
if scripts == nil then
scripts = self._data[4]
if scripts then
local codes, n = {}, 0
for code in gmatch(scripts, "[^,]+") do
n = n + 1
-- Special handling of "Hants", which represents "Hani", "Hant" and "Hans" collectively.
if code == "Hants" then
codes[n] = "Hani"
codes[n + 1] = "Hant"
codes[n + 2] = "Hans"
n = n + 2
else
codes[n] = code
end
end
scripts = codes
else
scripts = {"None"}
end
self._scriptCodes = scripts
end
return scripts
end
--[==[Given some text, this function iterates through the scripts of a given language and tries to find the script that best matches the text. It returns a {{code|lua|Script}} object representing the script. If no match is found at all, it returns the {{code|lua|None}} script object.]==]
function Language:findBestScript(text, forceDetect)
if not text or text == "" or text == "-" then
return get_script("None")
end
-- Differs from table returned by getScriptCodes, as Hants is not normalized into its constituents.
local codes = self._bestScriptCodes
if codes == nil then
codes = self._data[4]
codes = codes and split(codes, ",", true, true) or {"None"}
self._bestScriptCodes = codes
end
local first_sc = codes[1]
if first_sc == "All" then
return find_best_script_without_lang(text)
end
local codes_len = #codes
if not (forceDetect or first_sc == "Hants" or codes_len > 1) then
first_sc = get_script(first_sc)
local charset = first_sc.characters
return charset and umatch(text, "[" .. charset .. "]") and first_sc or get_script("None")
end
-- Remove all formatting characters.
text = get_plaintext(text)
-- Remove all spaces and any ASCII punctuation. Some non-ASCII punctuation is script-specific, so can't be removed.
text = ugsub(text, "[%s!\"#%%&'()*,%-./:;?@[\\%]_{}]+", "")
if #text == 0 then
return get_script("None")
end
-- Try to match every script against the text,
-- and return the one with the most matching characters.
local bestcount, bestscript, length = 0
for i = 1, codes_len do
local sc = codes[i]
-- Special case for "Hants", which is a special code that represents whichever of "Hant" or "Hans" best matches, or "Hani" if they match equally. This avoids having to list all three. In addition, "Hants" will be treated as the best match if there is at least one matching character, under the assumption that a Han script is desirable in terms that contain a mix of Han and other scripts (not counting those which use Jpan or Kore).
if sc == "Hants" then
local Hani = get_script("Hani")
if not Hant_chars then
Hant_chars = load_data("Module:zh/data/ts")
Hans_chars = load_data("Module:zh/data/st")
end
local t, s, found = 0, 0
-- This is faster than using mw.ustring.gmatch directly.
for ch in gmatch((ugsub(text, "[" .. Hani.characters .. "]", "\255%0")), "\255(.[\128-\191]*)") do
found = true
if Hant_chars[ch] then
t = t + 1
if Hans_chars[ch] then
s = s + 1
end
elseif Hans_chars[ch] then
s = s + 1
else
t, s = t + 1, s + 1
end
end
if found then
if t == s then
return Hani
end
return get_script(t > s and "Hant" or "Hans")
end
else
sc = get_script(sc)
if not length then
length = ulen(text)
end
-- Count characters by removing everything in the script's charset and comparing to the original length.
local charset = sc.characters
local count = charset and length - ulen((ugsub(text, "[" .. charset .. "]+", ""))) or 0
if count >= length then
return sc
elseif count > bestcount then
bestcount = count
bestscript = sc
end
end
end
-- Return best matching script, or otherwise None.
return bestscript or get_script("None")
end
--[==[Returns a <code>Family</code> object for the language family that the language belongs to. See [[Module:families]].]==]
function Language:getFamily()
local family = self._familyObject
if family == nil then
family = self:getFamilyCode()
-- If the value is nil, it's cached as false.
family = family and get_family(family) or false
self._familyObject = family
end
return family or nil
end
--[==[Returns the family code in the language's data file.]==]
function Language:getFamilyCode()
local family = self._familyCode
if family == nil then
-- If the value is nil, it's cached as false.
family = self._data[3] or false
self._familyCode = family
end
return family or nil
end
function Language:getFamilyName()
local family = self._familyName
if family == nil then
family = self:getFamily()
-- If the value is nil, it's cached as false.
family = family and family:getCanonicalName() or false
self._familyName = family
end
return family or nil
end
do
local function check_family(self, family)
if type(family) == "table" then
family = family:getCode()
end
if self:getFamilyCode() == family then
return true
end
local self_family = self:getFamily()
if self_family:inFamily(family) then
return true
-- If the family isn't a real family (e.g. creoles) check any ancestors.
elseif self_family:inFamily("qfa-not") then
local ancestors = self:getAncestors()
for _, ancestor in ipairs(ancestors) do
if ancestor:inFamily(family) then
return true
end
end
end
end
--[==[Check whether the language belongs to `family` (which can be a family code or object). A list of objects can be given in place of `family`; in that case, return true if the language belongs to any of the specified families. Note that some languages (in particular, certain creoles) can have multiple immediate ancestors potentially belonging to different families; in that case, return true if the language belongs to any of the specified families.]==]
function Language:inFamily(...)
if self:getFamilyCode() == nil then
return false
end
return check_inputs(self, check_family, false, ...)
end
end
function Language:getParent()
local parent = self._parentObject
if parent == nil then
parent = self:getParentCode()
-- If the value is nil, it's cached as false.
parent = parent and get_by_code(parent, nil, true, true) or false
self._parentObject = parent
end
return parent or nil
end
function Language:getParentCode()
local parent = self._parentCode
if parent == nil then
-- If the value is nil, it's cached as false.
parent = self._data.parent or false
self._parentCode = parent
end
return parent or nil
end
function Language:getParentName()
local parent = self._parentName
if parent == nil then
parent = self:getParent()
-- If the value is nil, it's cached as false.
parent = parent and parent:getCanonicalName() or false
self._parentName = parent
end
return parent or nil
end
function Language:getParentChain()
local chain = self._parentChain
if chain == nil then
chain = {}
local parent, n = self:getParent(), 0
while parent do
n = n + 1
chain[n] = parent
parent = parent:getParent()
end
self._parentChain = chain
end
return chain
end
do
local function check_lang(self, lang)
for _, parent in ipairs(self:getParentChain()) do
if (type(lang) == "string" and lang or lang:getCode()) == parent:getCode() then
return true
end
end
end
function Language:hasParent(...)
return check_inputs(self, check_lang, false, ...)
end
end
--[==[
If the language is etymology-only, this iterates through parents until a full language or family is found, and the
corresponding object is returned. If the language is a full language, then it simply returns itself.
]==]
function Language:getFull()
local full = self._fullObject
if full == nil then
full = self:getFullCode()
full = full == self._code and self or get_by_code(full)
self._fullObject = full
end
return full
end
--[==[
If the language is an etymology-only language, this iterates through parents until a full language or family is
found, and the corresponding code is returned. If the language is a full language, then it simply returns the
language code.
]==]
function Language:getFullCode()
return self._fullCode or self._code
end
--[==[
If the language is an etymology-only language, this iterates through parents until a full language or family is
found, and the corresponding canonical name is returned. If the language is a full language, then it simply returns
the canonical name of the language.
]==]
function Language:getFullName()
local full = self._fullName
if full == nil then
full = self:getFull():getCanonicalName()
self._fullName = full
end
return full
end
--[==[Returns a table of <code class="nf">Language</code> objects for all languages that this language is directly descended from. Generally this is only a single language, but creoles, pidgins and mixed languages can have multiple ancestors.]==]
function Language:getAncestors()
local ancestors = self._ancestorObjects
if ancestors == nil then
ancestors = {}
local ancestor_codes = self:getAncestorCodes()
if #ancestor_codes > 0 then
for _, ancestor in ipairs(ancestor_codes) do
insert(ancestors, get_by_code(ancestor, nil, true))
end
else
local fam = self:getFamily()
local protoLang = fam and fam:getProtoLanguage() or nil
-- For the cases where the current language is the proto-language
-- of its family, or an etymology-only language that is ancestral to that
-- proto-language, we need to step up a level higher right from the
-- start.
if protoLang and (
protoLang:getCode() == self._code or
(self:hasType("etymology-only") and protoLang:hasAncestor(self))
) then
fam = fam:getFamily()
protoLang = fam and fam:getProtoLanguage() or nil
end
while not protoLang and not (not fam or fam:getCode() == "qfa-not") do
fam = fam:getFamily()
protoLang = fam and fam:getProtoLanguage() or nil
end
insert(ancestors, protoLang)
end
self._ancestorObjects = ancestors
end
return ancestors
end
do
-- Avoid a language being its own ancestor via class inheritance. We only need to check for this if the language has inherited an ancestor table from its parent, because we never want to drop ancestors that have been explicitly set in the data.
-- Recursively iterate over ancestors until we either find self or run out. If self is found, return true.
local function check_ancestor(self, lang)
local codes = lang:getAncestorCodes()
if not codes then
return nil
end
for i = 1, #codes do
local code = codes[i]
if code == self._code then
return true
end
local anc = get_by_code(code, nil, true)
if check_ancestor(self, anc) then
return true
end
end
end
--[==[Returns a table of <code class="nf">Language</code> codes for all languages that this language is directly descended from. Generally this is only a single language, but creoles, pidgins and mixed languages can have multiple ancestors.]==]
function Language:getAncestorCodes()
if self._ancestorCodes then
return self._ancestorCodes
end
local data = self._data
local codes = data.ancestors
if codes == nil then
codes = {}
self._ancestorCodes = codes
return codes
end
codes = split(codes, ",", true, true)
self._ancestorCodes = codes
-- If there are no codes or the ancestors weren't inherited data, there's nothing left to check.
if #codes == 0 or self:getData(false, "raw").ancestors ~= nil then
return codes
end
local i, code = 1
while i <= #codes do
code = codes[i]
if check_ancestor(self, self) then
remove(codes, i)
else
i = i + 1
end
end
return codes
end
end
--[==[Given a list of language objects or codes, returns true if at least one of them is an ancestor. This includes any etymology-only children of that ancestor. If the language's ancestor(s) are etymology-only languages, it will also return true for those language parent(s) (e.g. if Vulgar Latin is the ancestor, it will also return true for its parent, Latin). However, a parent is excluded from this if the ancestor is also ancestral to that parent (e.g. if Classical Persian is the ancestor, Persian would return false, because Classical Persian is also ancestral to Persian).]==]
function Language:hasAncestor(...)
local function iterateOverAncestorTree(node, func, parent_check)
local ancestors = node:getAncestors()
local ancestorsParents = {}
for _, ancestor in ipairs(ancestors) do
-- When checking the parents of the other language, and the ancestor is also a parent, skip to the next ancestor, so that we exclude any etymology-only children of that parent that are not directly related (see below).
local ret = (parent_check or not node:hasParent(ancestor)) and
func(ancestor) or iterateOverAncestorTree(ancestor, func, parent_check)
if ret then
return ret
end
end
-- Check the parents of any ancestors. We don't do this if checking the parents of the other language, so that we exclude any etymology-only children of those parents that are not directly related (e.g. if the ancestor is Vulgar Latin and we are checking New Latin, we want it to return false because they are on different ancestral branches. As such, if we're already checking the parent of New Latin (Latin) we don't want to compare it to the parent of the ancestor (Latin), as this would be a false positive; it should be one or the other).
if not parent_check then
return nil
end
for _, ancestor in ipairs(ancestors) do
local ancestorParents = ancestor:getParentChain()
for _, ancestorParent in ipairs(ancestorParents) do
if ancestorParent:getCode() == self._code or ancestorParent:hasAncestor(ancestor) then
break
else
insert(ancestorsParents, ancestorParent)
end
end
end
for _, ancestorParent in ipairs(ancestorsParents) do
local ret = func(ancestorParent)
if ret then
return ret
end
end
end
local function do_iteration(otherlang, parent_check)
-- otherlang can't be self
if (type(otherlang) == "string" and otherlang or otherlang:getCode()) == self._code then
return false
end
repeat
if iterateOverAncestorTree(
self,
function(ancestor)
return ancestor:getCode() == (type(otherlang) == "string" and otherlang or otherlang:getCode())
end,
parent_check
) then
return true
elseif type(otherlang) == "string" then
otherlang = get_by_code(otherlang, nil, true)
end
otherlang = otherlang:getParent()
parent_check = false
until not otherlang
end
local parent_check = true
for _, otherlang in ipairs{...} do
local ret = do_iteration(otherlang, parent_check)
if ret then
return true
end
end
return false
end
do
local function construct_node(lang, memo)
local branch, ancestors = {lang = lang:getCode()}
memo[lang:getCode()] = branch
for _, ancestor in ipairs(lang:getAncestors()) do
if ancestors == nil then
ancestors = {}
end
insert(ancestors, memo[ancestor:getCode()] or construct_node(ancestor, memo))
end
branch.ancestors = ancestors
return branch
end
function Language:getAncestorChain()
local chain = self._ancestorChain
if chain == nil then
chain = construct_node(self, {})
self._ancestorChain = chain
end
return chain
end
end
function Language:getAncestorChainOld()
local chain = self._ancestorChain
if chain == nil then
chain = {}
local step = self
while true do
local ancestors = step:getAncestors()
step = #ancestors == 1 and ancestors[1] or nil
if not step then
break
end
insert(chain, step)
end
self._ancestorChain = chain
end
return chain
end
local function fetch_descendants(self, fmt)
local descendants, family = {}, self:getFamily()
-- Iterate over all three datasets.
for _, data in ipairs{
require("Module:languages/code to canonical name"),
require("Module:etymology languages/code to canonical name"),
require("Module:families/code to canonical name"),
} do
for code in pairs(data) do
local lang = get_by_code(code, nil, true, true)
-- Test for a descendant. Earlier tests weed out most candidates, while the more intensive tests are only used sparingly.
if (
code ~= self._code and -- Not self.
lang:inFamily(family) and -- In the same family.
(
family:getProtoLanguageCode() == self._code or -- Self is the protolanguage.
self:hasDescendant(lang) or -- Full hasDescendant check.
(lang:getFullCode() == self._code and not self:hasAncestor(lang)) -- Etymology-only child which isn't an ancestor.
)
) then
if fmt == "object" then
insert(descendants, lang)
elseif fmt == "code" then
insert(descendants, code)
elseif fmt == "name" then
insert(descendants, lang:getCanonicalName())
end
end
end
end
return descendants
end
function Language:getDescendants()
local descendants = self._descendantObjects
if descendants == nil then
descendants = fetch_descendants(self, "object")
self._descendantObjects = descendants
end
return descendants
end
function Language:getDescendantCodes()
local descendants = self._descendantCodes
if descendants == nil then
descendants = fetch_descendants(self, "code")
self._descendantCodes = descendants
end
return descendants
end
function Language:getDescendantNames()
local descendants = self._descendantNames
if descendants == nil then
descendants = fetch_descendants(self, "name")
self._descendantNames = descendants
end
return descendants
end
do
local function check_lang(self, lang)
if type(lang) == "string" then
lang = get_by_code(lang, nil, true)
end
if lang:hasAncestor(self) then
return true
end
end
function Language:hasDescendant(...)
return check_inputs(self, check_lang, false, ...)
end
end
local function fetch_children(self, fmt)
local m_etym_data = require(etymology_languages_data_module)
local self_code, children = self._code, {}
for code, lang in pairs(m_etym_data) do
local _lang = lang
repeat
local parent = _lang.parent
if parent == self_code then
if fmt == "object" then
insert(children, get_by_code(code, nil, true))
elseif fmt == "code" then
insert(children, code)
elseif fmt == "name" then
insert(children, lang[1])
end
break
end
_lang = m_etym_data[parent]
until not _lang
end
return children
end
function Language:getChildren()
local children = self._childObjects
if children == nil then
children = fetch_children(self, "object")
self._childObjects = children
end
return children
end
function Language:getChildrenCodes()
local children = self._childCodes
if children == nil then
children = fetch_children(self, "code")
self._childCodes = children
end
return children
end
function Language:getChildrenNames()
local children = self._childNames
if children == nil then
children = fetch_children(self, "name")
self._childNames = children
end
return children
end
function Language:hasChild(...)
local lang = ...
if not lang then
return false
elseif type(lang) == "string" then
lang = get_by_code(lang, nil, true)
end
if lang:hasParent(self) then
return true
end
return self:hasChild(select(2, ...))
end
--[==[Returns the name of the main category of that language. Example: {{code|lua|"French language"}} for French, whose category is at [[:Category:French language]]. Unless optional argument <code>nocap</code> is given, the language name at the beginning of the returned value will be capitalized. This capitalization is correct for category names, but not if the language name is lowercase and the returned value of this function is used in the middle of a sentence.]==]
function Language:getCategoryName(nocap)
local name = self._categoryName
if name == nil then
name = self:getCanonicalName()
-- If a substrate, omit any leading article.
if self:getFamilyCode() == "qfa-sub" then
name = name:gsub("^the ", ""):gsub("^a ", "")
end
-- Only add " language" if a full language.
if self:hasType("full") then
-- Unless the canonical name already ends with "language", "lect" or their derivatives, add " language".
if not (match(name, "[Ll]anguage$") or match(name, "[Ll]ect$")) then
name = name .. " language"
end
end
self._categoryName = name
end
if nocap then
return name
end
return mw.getContentLanguage():ucfirst(name)
end
--[==[Creates a link to the category; the link text is the canonical name.]==]
function Language:makeCategoryLink()
return make_link(self, ":Category:" .. self:getCategoryName(), self:getDisplayForm())
end
function Language:getStandardCharacters(sc)
local standard_chars = self._data.standard_chars
if type(standard_chars) ~= "table" then
return standard_chars
elseif sc and type(sc) ~= "string" then
check_object("script", nil, sc)
sc = sc:getCode()
end
if (not sc) or sc == "None" then
local scripts = {}
for _, script in pairs(standard_chars) do
insert(scripts, script)
end
return concat(scripts)
end
if standard_chars[sc] then
return standard_chars[sc] .. (standard_chars[1] or "")
end
end
--[==[
Strip diacritics from display text `text` (in a language-specific fashion), which is in the script `sc`. If `sc` is
omitted or {nil}, the script is autodetected. This also strips certain punctuation characters from the end and (in the
case of Spanish upside-down question mark and exclamation points) from the beginning; strips any whitespace at the
end of the text or between the text and final stripped punctuation characters; and applies some language-specific
Unicode normalizations to replace discouraged characters with their prescribed alternatives. Return the stripped text.
]==]
function Language:stripDiacritics(text, sc)
if (not text) or text == "" then
return text
end
sc = checkScript(text, self, sc)
text = normalize(text, sc)
-- FIXME, rename makeEntryName to stripDiacritics and get rid of second and third return values
-- everywhere
text, _, _ = iterateSectionSubstitutions(self, text, sc, nil, nil,
self._data.strip_diacritics or self._data.entry_name, "strip_diacritics", "stripDiacritics")
text = umatch(text, "^[¿¡]?(.-[^%s%p].-)%s*[؟?!;՛՜ ՞ ՟?!︖︕।॥။၊་།]?$") or text
return text
end
--[==[
Convert a ''logical'' pagename (the pagename as it appears to the user, after diacritics and punctuation have been
stripped) to a ''physical'' pagename (the pagename as it appears in the MediaWiki database). Reasons for a difference
between the two are (a) unsupported titles such as `[ ]` (with square brackets in them), `#` (pound/hash sign) and
`¯\_(ツ)_/¯` (with underscores), as well as overly long titles of various sorts; (b) "mammoth" pages that are split into
parts (e.g. `a`, which is split into physical pagenames `a/languages A to L` and `a/languages M to Z`). For almost all
purposes, you should work with logical and not physical pagenames. But there are certain use cases that require physical
pagenames, such as checking the existence of a page or retrieving a page's contents.
`pagename` is the logical pagename to be converted. `is_reconstructed_or_appendix` indicates whether the page is in the
`Reconstruction` or `Appendix` namespaces. If it is omitted or has the value {nil}, the pagename is checked for an
initial asterisk, and if found, the page is assumed to be a `Reconstruction` page. Setting a value of `false` or `true`
to `is_reconstructed_or_appendix` disables this check and allows for mainspace pagenames that begin with an asterisk.
]==]
function Language:logicalToPhysical(pagename, is_reconstructed_or_appendix)
-- FIXME: This probably shouldn't happen but it happens when makeEntryName() receives nil.
if pagename == nil then
track("nil-passed-to-logicalToPhysical")
return nil
end
local initial_asterisk
if is_reconstructed_or_appendix == nil then
local pagename_minus_initial_asterisk
initial_asterisk, pagename_minus_initial_asterisk = pagename:match("^(%*)(.*)$")
if pagename_minus_initial_asterisk then
is_reconstructed_or_appendix = true
pagename = pagename_minus_initial_asterisk
elseif self:hasType("appendix-constructed") then
is_reconstructed_or_appendix = true
end
end
if not is_reconstructed_or_appendix then
-- Check if the pagename is a listed unsupported title.
local unsupportedTitles = load_data(links_data_module).unsupported_titles
if unsupportedTitles[pagename] then
return "Unsupported titles/" .. unsupportedTitles[pagename]
end
end
-- Set `unsupported` as true if certain conditions are met.
local unsupported
-- Check if there's an unsupported character. \239\191\189 is the replacement character U+FFFD, which can't be typed
-- directly here due to an abuse filter. Unix-style dot-slash notation is also unsupported, as it is used for
-- relative paths in links, as are 3 or more consecutive tildes. Note: match is faster with magic
-- characters/charsets; find is faster with plaintext.
if (
match(pagename, "[#<>%[%]_{|}]") or
find(pagename, "\239\191\189") or
match(pagename, "%f[^%z/]%.%.?%f[%z/]") or
find(pagename, "~~~")
) then
unsupported = true
-- If it looks like an interwiki link.
elseif find(pagename, ":") then
local prefix = gsub(pagename, "^:*(.-):.*", ulower)
if (
load_data("Module:data/namespaces")[prefix] or
load_data("Module:data/interwikis")[prefix]
) then
unsupported = true
end
end
-- Escape unsupported characters so they can be used in titles. ` is used as a delimiter for this, so a raw use of
-- it in an unsupported title is also escaped here to prevent interference; this is only done with unsupported
-- titles, though, so inclusion won't in itself mean a title is treated as unsupported (which is why it's excluded
-- from the earlier test).
if unsupported then
-- FIXME: This conversion needs to be different for reconstructed pages with unsupported characters. There
-- aren't any currently, but if there ever are, we need to fix this e.g. to put them in something like
-- Reconstruction:Proto-Indo-European/Unsupported titles/`lowbar``num`.
local unsupported_characters = load_data(links_data_module).unsupported_characters
pagename = pagename:gsub("[#<>%[%]_`{|}\239]\191?\189?", unsupported_characters)
:gsub("%f[^%z/]%.%.?%f[%z/]", function(m)
return (gsub(m, "%.", "`period`"))
end)
:gsub("~~~+", function(m)
return (gsub(m, "~", "`tilde`"))
end)
pagename = "Unsupported titles/" .. pagename
elseif not is_reconstructed_or_appendix then
-- Check if this is a mammoth page. If so, which subpage should we link to?
local m_links_data = load_data(links_data_module)
local mammoth_page_type = m_links_data.mammoth_pages[pagename]
if mammoth_page_type then
local canonical_name = self:getFullName()
if canonical_name ~= "Translingual" and canonical_name ~= "English" then
local this_subpage
local L2_sort_key = get_L2_sort_key(canonical_name)
for _, subpage_spec in ipairs(m_links_data.mammoth_page_subpage_types[mammoth_page_type]) do
-- unpack() fails utterly on data loaded using mw.loadData() even if offsets are given
local subpage, pattern = subpage_spec[1], subpage_spec[2]
if pattern == true or L2_sort_key:match(pattern) then
this_subpage = subpage
break
end
end
if not this_subpage then
error(("Internal error: Bad data in mammoth_page_subpage_pages in [[Module:links/data]] for mammoth page %s, type %s; last entry didn't have 'true' in it"):format(
pagename, mammoth_page_type))
end
pagename = pagename .. "/" .. this_subpage
end
end
end
return (initial_asterisk or "") .. pagename
end
--[==[
Strip the diacritics from a display pagename and convert the resulting logical pagename into a physical pagename.
This allows you, for example, to retrieve the contents of the page or check its existence. WARNING: This is deprecated
and will be going away. It is a simple composition of `self:stripDiacritics` and `self:logicalToPhysical`; most callers
only want the former, and if you need both, call them both yourself.
`text` and `sc` are as in `self:stripDiacritics`, and `is_reconstructed_or_appendix` is as in `self:logicalToPhysical`.
]==]
function Language:makeEntryName(text, sc, is_reconstructed_or_appendix)
return self:logicalToPhysical(self:stripDiacritics(text, sc), is_reconstructed_or_appendix)
end
--[==[Generates alternative forms using a specified method, and returns them as a table. If no method is specified, returns a table containing only the input term.]==]
function Language:generateForms(text, sc)
local generate_forms = self._data.generate_forms
if generate_forms == nil then
return {text}
end
sc = checkScript(text, self, sc)
return require("Module:" .. self._data.generate_forms).generateForms(text, self, sc)
end
--[==[Creates a sort key for the given stripped text, following the rules appropriate for the language. This removes
diacritical marks from the stripped text if they are not considered significant for sorting, and may perform some other
changes. Any initial hyphen is also removed, and anything in parentheses is removed as well.
The <code>sort_key</code> setting for each language in the data modules defines the replacements made by this function, or it gives the name of the module that takes the stripped text and returns a sortkey.]==]
function Language:makeSortKey(text, sc)
if (not text) or text == "" then
return text
end
if match(text, "<[^<>]+>") then
track("track HTML tag")
end
-- Remove directional characters, bold, italics, soft hyphens, strip markers and HTML tags.
-- FIXME: Partly duplicated with remove_formatting() in [[Module:links]].
text = ugsub(text, "[\194\173\226\128\170-\226\128\174\226\129\166-\226\129\169]", "")
text = text:gsub("('*)'''(.-'*)'''", "%1%2"):gsub("('*)''(.-'*)''", "%1%2")
text = gsub(unstrip(text), "<[^<>]+>", "")
text = decode_uri(text, "PATH")
text = checkNoEntities(self, text)
-- Remove initial hyphens and * unless the term only consists of spacing + punctuation characters.
text = ugsub(text, "^([-]*)[-־ـ᠊*]+([-]*)(.*[^%s%p].*)", "%1%2%3")
sc = checkScript(text, self, sc)
text = normalize(text, sc)
text = removeCarets(text, sc)
-- For languages with dotted dotless i, ensure that "İ" is sorted as "i", and "I" is sorted as "ı".
if self:hasDottedDotlessI() then
text = gsub(text, "I\204\135", "i") -- decomposed "İ"
:gsub("I", "ı")
text = sc:toFixedNFD(text)
end
-- Convert to lowercase, make the sortkey, then convert to uppercase. Where the language has dotted dotless i, it is
-- usually not necessary to convert "i" to "İ" and "ı" to "I" first, because "I" will always be interpreted as
-- conventional "I" (not dotless "İ") by any sorting algorithms, which will have been taken into account by the
-- sortkey substitutions themselves. However, if no sortkey substitutions have been specified, then conversion is
-- necessary so as to prevent "i" and "ı" both being sorted as "I".
--
-- An exception is made for scripts that (sometimes) sort by scraping page content, as that means they are sensitive
-- to changes in capitalization (as it changes the target page).
if not sc:sortByScraping() then
text = ulower(text)
end
local actual_substitution_data
-- Don't trim whitespace here because it's significant at the beginning of a sort key or sort base.
text, _, actual_substitution_data = iterateSectionSubstitutions(self, text, sc, nil, nil, self._data.sort_key,
"sort_key", "makeSortKey", "notrim")
if not sc:sortByScraping() then
if self:hasDottedDotlessI() and not actual_substitution_data then
text = text:gsub("ı", "I"):gsub("i", "İ")
text = sc:toFixedNFC(text)
end
text = uupper(text)
end
-- Remove parentheses, as long as they are either preceded or followed by something.
text = gsub(text, "(.)[()]+", "%1"):gsub("[()]+(.)", "%1")
text = escape_risky_characters(text)
return text
end
--[==[Create the form used as as a basis for display text and transliteration. FIXME: Rename to correctInputText().]==]
local function processDisplayText(text, self, sc, keepCarets, keepPrefixes)
local subbedChars = {}
text, subbedChars = doTempSubstitutions(text, subbedChars, keepCarets)
text = decode_uri(text, "PATH")
text = checkNoEntities(self, text)
sc = checkScript(text, self, sc)
text = normalize(text, sc)
text, subbedChars = iterateSectionSubstitutions(self, text, sc, subbedChars, keepCarets, self._data.display_text,
"display_text", "makeDisplayText")
text = removeCarets(text, sc)
-- Remove any interwiki link prefixes (unless they have been escaped or this has been disabled).
if find(text, ":") and not keepPrefixes then
local rep
repeat
text, rep = gsub(text, "\\\\(\\*:)", "\3%1")
until rep == 0
text = gsub(text, "\\:", "\4")
while true do
local prefix = gsub(text, "^(.-):.+", function(m1)
return (gsub(m1, "\244[\128-\191]*", ""))
end)
-- Check if the prefix is an interwiki, though ignore capitalised Wiktionary:, which is a namespace.
if not prefix or prefix == text or prefix == "Wiktionary"
or not (load_data("Module:data/interwikis")[ulower(prefix)] or prefix == "") then
break
end
text = gsub(text, "^(.-):(.*)", function(m1, m2)
local ret = {}
for subbedChar in gmatch(m1, "\244[\128-\191]*") do
insert(ret, subbedChar)
end
return concat(ret) .. m2
end)
end
text = gsub(text, "\3", "\\"):gsub("\4", ":")
end
return text, subbedChars
end
--[==[Make the display text (i.e. what is displayed on the page).]==]
function Language:makeDisplayText(text, sc, keepPrefixes)
if not text or text == "" then
return text
end
local subbedChars
text, subbedChars = processDisplayText(text, self, sc, nil, keepPrefixes)
text = escape_risky_characters(text)
return undoTempSubstitutions(text, subbedChars)
end
--[==[Transliterates the text from the given script into the Latin script (see
[[Wiktionary:Transliteration and romanization]]). The language must have the <code>translit</code> property for this to
work; if it is not present, {{code|lua|nil}} is returned.
The <code>sc</code> parameter is handled by the transliteration module, and how it is handled is specific to that
module. Some transliteration modules may tolerate {{code|lua|nil}} as the script, others require it to be one of the
possible scripts that the module can transliterate, and will throw an error if it's not one of them. For this reason,
the <code>sc</code> parameter should always be provided when writing non-language-specific code.
The <code>module_override</code> parameter is used to override the default module that is used to provide the
transliteration. This is useful in cases where you need to demonstrate a particular module in use, but there is no
default module yet, or you want to demonstrate an alternative version of a transliteration module before making it
official. It should not be used in real modules or templates, only for testing. All uses of this parameter are tracked
by [[Wiktionary:Tracking/languages/module_override]].
'''Known bugs''':
* This function assumes {tr(s1) .. tr(s2) == tr(s1 .. s2)}. When this assertion fails, wikitext markups like <nowiki>'''</nowiki> can cause wrong transliterations.
* HTML entities like <code>&apos;</code>, often used to escape wikitext markups, do not work.
]==]
function Language:transliterate(text, sc, module_override)
-- If there is no text, or the language doesn't have transliteration data and there's no override, return nil.
if not text or text == "" or text == "-" then
return text
end
-- If the script is not transliteratable (and no override is given), return nil.
sc = checkScript(text, self, sc)
if not (sc:isTransliterated() or module_override) then
-- temporary tracking to see if/when this gets triggered
track("non-transliterable")
track("non-transliterable/" .. self._code)
track("non-transliterable/" .. sc:getCode())
track("non-transliterable/" .. sc:getCode() .. "/" .. self._code)
return nil
end
-- Remove any strip markers.
text = unstrip(text)
-- Do not process the formatting into PUA characters for certain languages.
local processed = load_data(languages_data_module).substitution[self._code] ~= "none"
-- Get the display text with the keepCarets flag set.
local subbedChars
if processed then
text, subbedChars = processDisplayText(text, self, sc, true)
end
-- Transliterate (using the module override if applicable).
text, subbedChars = iterateSectionSubstitutions(self, text, sc, subbedChars, true, module_override or
self._data.translit, "translit", "tr")
if not text then
return nil
end
-- Incomplete transliterations return nil.
local charset = sc.characters
if charset and umatch(text, "[" .. charset .. "]") then
-- Remove any characters in Latin, which includes Latin characters also included in other scripts (as these are
-- false positives), as well as any PUA substitutions. Anything remaining should only be script code "None"
-- (e.g. numerals).
local check_text = ugsub(text, "[" .. get_script("Latn").characters .. "-]+", "")
-- Set none_is_last_resort_only flag, so that any non-None chars will cause a script other than "None" to be
-- returned.
if find_best_script_without_lang(check_text, true):getCode() ~= "None" then
return nil
end
end
if processed then
text = escape_risky_characters(text)
text = undoTempSubstitutions(text, subbedChars)
end
-- If the script does not use capitalization, then capitalize any letters of the transliteration which are
-- immediately preceded by a caret (and remove the caret).
if text and not sc:hasCapitalization() and text:find("^", 1, true) then
text = processCarets(text, "%^([\128-\191\244]*%*?)([^\128-\191\244][\128-\191]*)", function(m1, m2)
return m1 .. uupper(m2)
end)
end
-- Track module overrides.
if module_override ~= nil then
track("module_override")
end
return text
end
do
local function handle_language_spec(self, spec, sc)
local ret = self["_" .. spec]
if ret == nil then
ret = self._data[spec]
if type(ret) == "string" then
ret = list_to_set(split(ret, ",", true, true))
end
self["_" .. spec] = ret
end
if type(ret) == "table" then
ret = ret[sc:getCode()]
end
return not not ret
end
function Language:overrideManualTranslit(sc)
return handle_language_spec(self, "override_translit", sc)
end
function Language:link_tr(sc)
return handle_language_spec(self, "link_tr", sc)
end
end
--[==[Returns {{code|lua|true}} if the language has a transliteration module, or {{code|lua|false}} if it doesn't.]==]
function Language:hasTranslit()
return not not self._data.translit
end
--[==[Returns {{code|lua|true}} if the language uses the letters I/ı and İ/i, or {{code|lua|false}} if it doesn't.]==]
function Language:hasDottedDotlessI()
return not not self._data.dotted_dotless_i
end
function Language:toJSON(opts)
local strip_diacritics, strip_diacritics_patterns, strip_diacritics_remove_diacritics = self._data.strip_diacritics
if strip_diacritics then
if strip_diacritics.from then
strip_diacritics_patterns = {}
for i, from in ipairs(strip_diacritics.from) do
insert(strip_diacritics_patterns, {from = from, to = strip_diacritics.to[i] or ""})
end
end
strip_diacritics_remove_diacritics = strip_diacritics.remove_diacritics
end
-- mainCode should only end up non-nil if dontCanonicalizeAliases is passed to make_object().
-- props should either contain zero-argument functions to compute the value, or the value itself.
local props = {
ancestors = function() return self:getAncestorCodes() end,
canonicalName = function() return self:getCanonicalName() end,
categoryName = function() return self:getCategoryName("nocap") end,
code = self._code,
mainCode = self._mainCode,
parent = function() return self:getParentCode() end,
full = function() return self:getFullCode() end,
stripDiacriticsPatterns = strip_diacritics_patterns,
stripDiacriticsRemoveDiacritics = strip_diacritics_remove_diacritics,
family = function() return self:getFamilyCode() end,
aliases = function() return self:getAliases() end,
varieties = function() return self:getVarieties() end,
otherNames = function() return self:getOtherNames() end,
scripts = function() return self:getScriptCodes() end,
type = function() return keys_to_list(self:getTypes()) end,
wikimediaLanguages = function() return self:getWikimediaLanguageCodes() end,
wikidataItem = function() return self:getWikidataItem() end,
wikipediaArticle = function() return self:getWikipediaArticle(true) end,
}
local ret = {}
for prop, val in pairs(props) do
if not opts.skip_fields or not opts.skip_fields[prop] then
if type(val) == "function" then
ret[prop] = val()
else
ret[prop] = val
end
end
end
-- Use `deep_copy` when returning a table, so that there are no editing restrictions imposed by `mw.loadData`.
return opts and opts.lua_table and deep_copy(ret) or to_json(ret, opts)
end
function export.getDataModuleName(code)
local letter = match(code, "^(%l)%l%l?$")
return "Module:" .. (
letter == nil and "languages/data/exceptional" or
#code == 2 and "languages/data/2" or
"languages/data/3/" .. letter
)
end
get_data_module_name = export.getDataModuleName
function export.getExtraDataModuleName(code)
return get_data_module_name(code) .. "/extra"
end
get_extra_data_module_name = export.getExtraDataModuleName
do
local function make_stack(data)
local key_types = {
[2] = "unique",
aliases = "unique",
otherNames = "unique",
type = "append",
varieties = "unique",
wikipedia_article = "unique",
wikimedia_codes = "unique"
}
local function __index(self, k)
local stack, key_type = getmetatable(self), key_types[k]
-- Data that isn't inherited from the parent.
if key_type == "unique" then
local v = stack[stack[make_stack]][k]
if v == nil then
local layer = stack[0]
if layer then -- Could be false if there's no extra data.
v = layer[k]
end
end
return v
-- Data that is appended by each generation.
elseif key_type == "append" then
local parts, offset, n = {}, 0, stack[make_stack]
for i = 1, n do
local part = stack[i][k]
if part == nil then
offset = offset + 1
else
parts[i - offset] = part
end
end
return offset ~= n and concat(parts, ",") or nil
end
local n = stack[make_stack]
while true do
local layer = stack[n]
if not layer then -- Could be false if there's no extra data.
return nil
end
local v = layer[k]
if v ~= nil then
return v
end
n = n - 1
end
end
local function __newindex()
error("table is read-only")
end
local function __pairs(self)
-- Iterate down the stack, caching keys to avoid duplicate returns.
local stack, seen = getmetatable(self), {}
local n = stack[make_stack]
local iter, state, k, v = pairs(stack[n])
return function()
repeat
repeat
k = iter(state, k)
if k == nil then
n = n - 1
local layer = stack[n]
if not layer then -- Could be false if there's no extra data.
return nil
end
iter, state, k = pairs(layer)
end
until not (k == nil or seen[k])
-- Get the value via a lookup, as the one returned by the
-- iterator will be the raw value from the current layer,
-- which may not be the one __index will return for that
-- key. Also memoize the key in `seen` (even if the lookup
-- returns nil) so that it doesn't get looked up again.
-- TODO: store values in `self`, avoiding the need to create
-- the `seen` table. The iterator will need to iterate over
-- `self` with `next` first to find these on future loops.
v, seen[k] = self[k], true
until v ~= nil
return k, v
end
end
local __ipairs = require(table_module).indexIpairs
function make_stack(data)
local stack = {
data,
[make_stack] = 1, -- stores the length and acts as a sentinel to confirm a given metatable is a stack.
__index = __index,
__newindex = __newindex,
__pairs = __pairs,
__ipairs = __ipairs,
}
stack.__metatable = stack
return setmetatable({}, stack), stack
end
return make_stack(data)
end
local function get_stack(data)
local stack = getmetatable(data)
return stack and type(stack) == "table" and stack[make_stack] and stack or nil
end
--[==[
<span style="color: var(--wikt-palette-red,#BA0000)">This function is not for use in entries or other content pages.</span>
Returns a blob of data about the language. The format of this blob is undocumented, and perhaps unstable; it's intended for things like the module's own unit-tests, which are "close friends" with the module and will be kept up-to-date as the format changes. If `extra` is set, any extra data in the relevant `/extra` module will be included. (Note that it will be included anyway if it has already been loaded into the language object.) If `raw` is set, then the returned data will not contain any data inherited from parent objects.
-- Do NOT use these methods!
-- All uses should be pre-approved on the talk page!
]==]
function Language:getData(extra, raw)
if extra then
self:loadInExtraData()
end
local data = self._data
-- If raw is not set, just return the data.
if not raw then
return data
end
local stack = get_stack(data)
-- If there isn't a stack or its length is 1, return the data. Extra data (if any) will be included, as it's stored at key 0 and doesn't affect the reported length.
if stack == nil then
return data
end
local n = stack[make_stack]
if n == 1 then
return data
end
local extra = stack[0]
-- If there isn't any extra data, return the top layer of the stack.
if extra == nil then
return stack[n]
end
-- If there is, return a new stack which has the top layer at key 1 and the extra data at key 0.
data, stack = make_stack(stack[n])
stack[0] = extra
return data
end
function Language:loadInExtraData()
-- Only full languages have extra data.
if not self:hasType("language", "full") then
return
end
local data = self._data
-- If there's no stack, create one.
local stack = get_stack(self._data)
if stack == nil then
data, stack = make_stack(data)
-- If already loaded, return.
elseif stack[0] ~= nil then
return
end
self._data = data
-- Load extra data from the relevant module and add it to the stack at key 0, so that the __index and __pairs metamethods will pick it up, since they iterate down the stack until they run out of layers.
local code = self._code
local modulename = get_extra_data_module_name(code)
-- No data cached as false.
stack[0] = modulename and load_data(modulename)[code] or false
end
--[==[Returns the name of the module containing the language's data. Currently, this is always [[Module:scripts/data]].]==]
function Language:getDataModuleName()
local name = self._dataModuleName
if name == nil then
name = self:hasType("etymology-only") and etymology_languages_data_module or
get_data_module_name(self._mainCode or self._code)
self._dataModuleName = name
end
return name
end
--[==[Returns the name of the module containing the language's data. Currently, this is always [[Module:scripts/data]].]==]
function Language:getExtraDataModuleName()
local name = self._extraDataModuleName
if name == nil then
name = not self:hasType("etymology-only") and get_extra_data_module_name(self._mainCode or self._code) or false
self._extraDataModuleName = name
end
return name or nil
end
function export.makeObject(code, data, dontCanonicalizeAliases)
local data_type = type(data)
if data_type ~= "table" then
error(("bad argument #2 to 'makeObject' (table expected, got %s)"):format(data_type))
end
-- Convert any aliases.
local input_code = code
code = normalize_code(code)
input_code = dontCanonicalizeAliases and input_code or code
local parent
if data.parent then
parent = get_by_code(data.parent, nil, true, true)
else
parent = Language
end
parent.__index = parent
local lang = {_code = input_code}
-- This can only happen if dontCanonicalizeAliases is passed to make_object().
if code ~= input_code then
lang._mainCode = code
end
local parent_data = parent._data
if parent_data == nil then
-- Full code is the same as the code.
lang._fullCode = parent._code or code
else
-- Copy full code.
lang._fullCode = parent._fullCode
local stack = get_stack(parent_data)
if stack == nil then
parent_data, stack = make_stack(parent_data)
end
-- Insert the input data as the new top layer of the stack.
local n = stack[make_stack] + 1
data, stack[n], stack[make_stack] = parent_data, data, n
end
lang._data = data
return setmetatable(lang, parent)
end
make_object = export.makeObject
end
--[==[Finds the language whose code matches the one provided. If it exists, it returns a <code class="nf">Language</code> object representing the language. Otherwise, it returns {{code|lua|nil}}, unless <code class="n">paramForError</code> is given, in which case an error is generated. If <code class="n">paramForError</code> is {{code|lua|true}}, a generic error message mentioning the bad code is generated; otherwise <code class="n">paramForError</code> should be a string or number specifying the parameter that the code came from, and this parameter will be mentioned in the error message along with the bad code. If <code class="n">allowEtymLang</code> is specified, etymology-only language codes are allowed and looked up along with normal language codes. If <code class="n">allowFamily</code> is specified, language family codes are allowed and looked up along with normal language codes.]==]
function export.getByCode(code, paramForError, allowEtymLang, allowFamily)
-- Track uses of paramForError, ultimately so it can be removed, as error-handling should be done by [[Module:parameters]], not here.
if paramForError ~= nil then
track("paramForError")
end
if type(code) ~= "string" then
local typ
if not code then
typ = "nil"
elseif check_object("language", true, code) then
typ = "a language object"
elseif check_object("family", true, code) then
typ = "a family object"
else
typ = "a " .. type(code)
end
error("The function getByCode expects a string as its first argument, but received " .. typ .. ".")
end
local m_data = load_data(languages_data_module)
if m_data.aliases[code] or m_data.track[code] then
track(code)
end
local norm_code = normalize_code(code)
-- Get the data, checking for etymology-only languages if allowEtymLang is set.
local data = load_data(get_data_module_name(norm_code))[norm_code] or
allowEtymLang and load_data(etymology_languages_data_module)[norm_code]
-- If no data was found and allowFamily is set, check the family data. If the main family data was found, make the object with [[Module:families]] instead, as family objects have different methods. However, if it's an etymology-only family, use make_object in this module (which handles object inheritance), and the family-specific methods will be inherited from the parent object.
if data == nil and allowFamily then
data = load_data("Module:families/data")[norm_code]
if data ~= nil then
if data.parent == nil then
return make_family_object(norm_code, data)
elseif not allowEtymLang then
data = nil
end
end
end
local retval = code and data and make_object(code, data)
if not retval and paramForError then
require("Module:languages/errorGetBy").code(code, paramForError, allowEtymLang, allowFamily)
end
return retval
end
get_by_code = export.getByCode
--[==[Finds the language whose canonical name (the name used to represent that language on Wiktionary) or other name matches the one provided. If it exists, it returns a <code class="nf">Language</code> object representing the language. Otherwise, it returns {{code|lua|nil}}, unless <code class="n">paramForError</code> is given, in which case an error is generated. If <code class="n">allowEtymLang</code> is specified, etymology-only language codes are allowed and looked up along with normal language codes. If <code class="n">allowFamily</code> is specified, language family codes are allowed and looked up along with normal language codes.
The canonical name of languages should always be unique (it is an error for two languages on Wiktionary to share the same canonical name), so this is guaranteed to give at most one result.
This function is powered by [[Module:languages/canonical names]], which contains a pre-generated mapping of full-language canonical names to codes. It is generated by going through the [[:Category:Language data modules]] for full languages. When <code class="n">allowEtymLang</code> is specified for the above function, [[Module:etymology languages/canonical names]] may also be used, and when <code class="n">allowFamily</code> is specified for the above function, [[Module:families/canonical names]] may also be used.]==]
function export.getByCanonicalName(name, errorIfInvalid, allowEtymLang, allowFamily)
local byName = load_data("Module:languages/canonical names")
local code = byName and byName[name]
if not code and allowEtymLang then
byName = load_data("Module:etymology languages/canonical names")
code = byName and byName[name] or
byName[gsub(name, " [Ss]ubstrate$", "")] or
byName[gsub(name, "^a ", "")] or
byName[gsub(name, "^a ", ""):gsub(" [Ss]ubstrate$", "")] or
-- For etymology families like "ira-pro".
-- FIXME: This is not ideal, as it allows " languages" to be appended to any etymology-only language, too.
byName[match(name, "^(.*) languages$")]
end
if not code and allowFamily then
byName = load_data("Module:families/canonical names")
code = byName[name] or byName[match(name, "^(.*) languages$")]
end
local retval = code and get_by_code(code, errorIfInvalid, allowEtymLang, allowFamily)
if not retval and errorIfInvalid then
require("Module:languages/errorGetBy").canonicalName(name, allowEtymLang, allowFamily)
end
return retval
end
--[==[Used by [[Module:languages/data/2]] (et al.) and [[Module:etymology languages/data]], [[Module:families/data]], [[Module:scripts/data]] and [[Module:writing systems/data]] to finalize the data into the format that is actually returned.]==]
function export.finalizeData(data, main_type, variety)
local fields = {"type"}
if main_type == "language" then
insert(fields, 4) -- script codes
insert(fields, "ancestors")
insert(fields, "link_tr")
insert(fields, "override_translit")
insert(fields, "wikimedia_codes")
elseif main_type == "script" then
insert(fields, 3) -- writing system codes
end -- Families and writing systems have no extra fields to process.
local fields_len = #fields
for _, entity in next, data do
if variety then
-- Move parent from 3 to "parent" and family from "family" to 3. These are different for the sake of convenience, since very few varieties have the family specified, whereas all of them have a parent.
entity.parent, entity[3], entity.family = entity[3], entity.family
-- Give the type "regular" iff not a variety and no other types are assigned.
elseif not (entity.type or entity.parent) then
entity.type = "regular"
end
for i = 1, fields_len do
local key = fields[i]
local field = entity[key]
if field and type(field) == "string" then
entity[key] = gsub(field, "%s*,%s*", ",")
end
end
end
return data
end
--[==[For backwards compatibility only; modules should require the error themselves.]==]
function export.err(lang_code, param, code_desc, template_tag, not_real_lang)
return require("Module:languages/error")(lang_code, param, code_desc, template_tag, not_real_lang)
end
return export
hen6s6wbv7hthor9k3teh6x7me72099
uli
0
2881
43480
39307
2026-05-13T22:20:19Z
Kambai Akau
8
43480
wikitext
text/x-wiki
==Tyap (Gworog)==
===Yei===
* [[Za̱FBS]]: /ùlí/
* A̱nwuat:
===A̱lyoot===
'''uli''' ka (''á̱kpa'' '''usa''' na)
# [[a̱li]]
==Tyap (Ta̱kad, Tyuku)==
===Yei===
* [[Za̱FBS]]: /ùlí/
* A̱nwuat:
===A̱lyoot===
'''uli''' ki (''á̱kpa'' ?'''ka̱sa''' ni)
# [[a̱li]]
[[Sa:Gworog]]
[[Sa:Ta̱kad]]
[[Sa:Tyuku]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project Part 2]]
sjs4e1klqw7v8y98n1t75xj5oa6xbwv
43481
43480
2026-05-13T22:20:35Z
Kambai Akau
8
43481
wikitext
text/x-wiki
==Tyap (Gworog)==
===Yei===
* [[Za̱FBS]]: /ùlí/
* A̱nwuat:
===A̱lyoot===
'''uli''' ka (''á̱kpa'' '''usa''' na)
# [[a̱li]]
==Tyap (Ta̱kad, Tyuku)==
===Yei===
* [[Za̱FBS]]: /ùlí/
* A̱nwuat:
===A̱lyoot===
'''uli''' ki (''á̱kpa'' '''usa''' ni)
# [[a̱li]]
[[Sa:Gworog]]
[[Sa:Ta̱kad]]
[[Sa:Tyuku]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project Part 2]]
da7b3y4hid3rt92vggwkwa3893ccohg
Ta̱mpi̱let:interproject-box
10
3053
43501
39967
2026-05-13T23:23:40Z
Kambai Akau
8
/* */
43501
wikitext
text/x-wiki
<includeonly><div class="interproject-box {{#if:{{{sisterclass|}}}|sister-{{{sisterclass}}}}} sister-project noprint floatright"><div style="float: left;" class="interproject-box-logo">[[File:{{{logo}}}|{{{logosize|x40px}}}|none|link={{{logolink|}}}|alt={{{logoalt|}}}]]</div><div style="margin-left: 60px;">{{{intro}}}:<div style="margin-left: 10px;">{{#if:{{{nobold|}}}|<span {{#if:{{{nolang|}}}|| class="Latn" lang="en"}}>|<b {{#if:{{{nolang|}}}|| class="Latn" lang="en"}}>}}{{#if:{{{noitalic|}}}||<i>}}{{{link|}}}{{#if:{{{noitalic|}}}||</i>}}{{#if:{{{nobold|}}}|</span>|</b>}}</div></div>{{#if:{{{interprojectlink|}}}|<span class="interProject">{{{interprojectlink}}}</span>}}</div><templatestyles src="Module:interproject/style.css" /></includeonly><noinclude>{{interproject-box|logo=Commons-emblem-success.svg|logoalt=Hello, World!|intro=Ta̱mpi̱let Swánga̱lyiatwuki huni byia̱ nkhang jhyang mi̱ {{FULLPAGENAME}}|link=[[#documentation|{{FULLPAGENAME}}]]}}{{documentation}}</noinclude>
8i931qs5pda3us76y9d0hqwga3e7dfs
a̱nyeang
0
3251
43475
43457
2026-05-13T22:10:25Z
Kambai Akau
8
43475
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|/əɲeʲaŋ/}}
** {{audio|kcg|Kcg-a̱nyeang.ogg|a=}}
** {{hyphenation|kcg|a̱|nyeang}} (a̱ka̱tyong 2)
===A̱lyoot===
'''a̱nyeang''' [[wu#Ati̱kut|wu]] (''á̱kpa'' '''[[a̱nyeang]]''' [[ba#Ati̱kut|ba]])
# '''A̱nyeang''' yet nggwon-a̱nab kya ka̱ nat nyeang a̱ si shyia̱ a̱yang a̱tyok wu ku a̱tyia̱ a̱tyok nka wu a̱ni .
#* ''A̱nyeang ang wu nshia̱ a?.''
#* ''A̱yang kasang cat a̱nyeang wu bat.''
#* ''A̱nyeang a̱mali wu ni cat ta̱m ba.''
#* A̱yang A̱tyusan tsot a̱nyeang ngu wu lyuo.
====Bwan di̱ lilyem ghyáng====
{{trans-top|Nggwon-a̱nap ka̱ ka̱n nat nyeang a̱ni}}
* [[Dot]]: {{t+|nl|schoondochter}}
* [[Shong]]: {{t+|en|Daughter-inlaw}}
{{trans-bottom}}
====A̱ya̱fang====
<references/>
{{wikipedia}}
{{wikidata lexeme|L1565258}}
[[Sa:a̱lyoot]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
c3ne45ipriyrzxj1iqx44m6meavo7p6
43476
43475
2026-05-13T22:11:18Z
Kambai Akau
8
43476
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|/əɲeʲaŋ/}}
** {{audio|kcg|Kcg-a̱nyeang.ogg|a=}}
** {{hyphenation|kcg|a̱|nyeang}} (a̱ka̱tyong 2)
===A̱lyoot===
'''a̱nyeang''' [[wu#Ati̱kut|wu]] (''á̱kpa'' '''[[a̱nyeang]]''' [[ba#Ati̱kut|ba]])
# '''A̱nyeang''' yet nggwon-a̱nab kya ka̱ nat nyeang a̱ si shyia̱ a̱yang a̱tyok wu ku a̱tyia̱ a̱tyok nka wu a̱ni .
#* ''A̱nyeang ang wu nshia̱ a?.''
#* ''A̱yang kasang cat a̱nyeang wu bat.''
#* ''A̱nyeang a̱mali wu ni cat ta̱m ba.''
#* A̱yang A̱tyusan tsot a̱nyeang ngu wu lyuo.
====Bwan di̱ lilyem ghyáng====
{{trans-top|Nggwon-a̱nap ka̱ ka̱n nat nyeang a̱ni}}
* [[Dot]]: {{t+|nl|schoondochter}}
* [[Shong]]: {{t+|en|daughter-in-law}}
{{trans-bottom}}
====A̱ya̱fang====
<references/>
{{wikipedia}}
{{wikidata lexeme|L1565258}}
[[Sa:a̱lyoot]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
8ywmtbge7wrtkglv62iiuidf6phfrg8
43477
43476
2026-05-13T22:13:56Z
Kambai Akau
8
43477
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|/əɲeʲaŋ/}}
** {{audio|kcg|Kcg-a̱nyeang.ogg|a=}}
** {{hyphenation|kcg|a̱|nyeang}} (a̱ka̱tyong 2)
===A̱lyoot===
'''a̱nyeang''' [[wu#Ati̱kut|wu]] (''á̱kpa'' '''[[a̱nyeang]]''' [[ba#Ati̱kut|ba]])
# '''A̱nyeang''' yet nggwon-a̱nap kya ka̱ ka̱n nat nyeang a̱ si̱ shyia̱ a̱yang a̱tyok wu ku a̱tyia̱ a̱tyok nka wu a̱ni.
#* '''''A̱nyeang''' ang wu nshyia̱ a?''
#* ''A̱yang Kasang cat '''a̱nyeang''' nggu wu ba̱t!''
#* '''''A̱nyeang''' a̱mali wuni cat ta̱m bah.''
#* ''A̱yang A̱tyusan tsot '''a̱nyeang''' nggu wu lyuo.''
====Bwan di̱ lilyem ghyáng====
{{trans-top|Nggwon-a̱nap ka̱ ka̱n nat nyeang a̱ni}}
* [[Dot]]: {{t+|nl|schoondochter}}
* [[Shong]]: {{t+|en|daughter-in-law}}
{{trans-bottom}}
====A̱ya̱fang====
<references/>
{{wikipedia}}
{{wikidata lexeme|L1565258}}
[[Sa:a̱lyoot]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
a8tv7gcj1kc1qil32cw8idrsnjczw7r
43478
43477
2026-05-13T22:14:41Z
Kambai Akau
8
43478
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|/əɲeʲaŋ/}}
** {{audio|kcg|Kcg-a̱nyeang.ogg|a=}}
** {{hyphenation|kcg|a̱|nyeang}} (a̱ka̱tyong 2)
===A̱lyoot===
'''a̱nyeang''' [[wu#Ati̱kut|wu]] (''á̱kpa'' '''[[á̱nyeang]]''' [[ba#Ati̱kut|ba]])
# '''A̱nyeang''' yet nggwon-a̱nap kya ka̱ ka̱n nat nyeang a̱ si̱ shyia̱ a̱yang a̱tyok wu ku a̱tyia̱ a̱tyok nka wu a̱ni.
#* '''''A̱nyeang''' ang wu nshyia̱ a?''
#* ''A̱yang Kasang cat '''a̱nyeang''' nggu wu ba̱t!''
#* '''''A̱nyeang''' a̱mali wuni cat ta̱m bah.''
#* ''A̱yang A̱tyusan tsot '''a̱nyeang''' nggu wu lyuo.''
====Bwan di̱ lilyem ghyáng====
{{trans-top|Nggwon-a̱nap ka̱ ka̱n nat nyeang a̱ni}}
* [[Dot]]: {{t+|nl|schoondochter}}
* [[Shong]]: {{t+|en|daughter-in-law}}
{{trans-bottom}}
====A̱ya̱fang====
<references/>
{{wikipedia}}
{{wikidata lexeme|L1565258}}
[[Sa:a̱lyoot]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
b8goh92yv347140b5yqwg2xnk1dzbrs
á̱nyeang
0
3252
43479
42652
2026-05-13T22:16:28Z
Kambai Akau
8
43479
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|/əɲeʲaŋ/}}
** {{audio|kcg|Kcg-á̱nyeang.ogg|a=}}
** {{hyphenation|kcg|a̱|nyeang}} (a̱ka̱tyong 2)
===A̱lyoot===
# ''a̱nyiung'' '''[[a̱nyeang]]''' [[wu#Ati̱kut|wu]]
====Bwan di̱ lilyem ghyáng====
{{trans-top|Mman á̱nap ba̱ ka̱n nyia̱ nyeang a̱ni}}
* [[Dot]]: {{t+|nl|schoondochters}}
* [[Shong]]: {{t+|en|daughters-in-law}}
{{trans-bottom}}
[[Sa:a̱lyoot]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
d9rqkbxpo48wett4xzt4liv6wiojbii
Module:usex
828
3649
43482
2026-05-13T22:34:28Z
Kambai Akau
8
Created page with "local export = {} local debug_track_module = "Module:debug/track" local links_module = "Module:links" local scripts_module = "Module:scripts" local script_utilities_module = "Module:script utilities" local string_utilities_module = "Module:string utilities" local usex_data_module = "Module:usex/data" local m_str_utils = require(string_utilities_module) local rsubn = m_str_utils.gsub local rsplit = m_str_utils.split local rfind = m_str_utils.find local uupper = m_str_u..."
43482
Scribunto
text/plain
local export = {}
local debug_track_module = "Module:debug/track"
local links_module = "Module:links"
local scripts_module = "Module:scripts"
local script_utilities_module = "Module:script utilities"
local string_utilities_module = "Module:string utilities"
local usex_data_module = "Module:usex/data"
local m_str_utils = require(string_utilities_module)
local rsubn = m_str_utils.gsub
local rsplit = m_str_utils.split
local rfind = m_str_utils.find
local uupper = m_str_utils.upper
local ulen = m_str_utils.len
local u = m_str_utils.char
local translit_data = mw.loadData("Module:transliteration/data")
local needs_translit = translit_data[1]
local BRACKET_SUB = u(0xFFF0)
local original_text = "<small>''original:''</small> "
-- 100 more or less corresponds to the setting of 30 for the example text alone as formerly used in
-- {{hi-x}} and {{ur-x}}, taking into account transliteration, gloss and formatting characters.
-- FIXME: We should have different widths for desktop vs. mobile and generate the appropriate CSS so
-- both are handled correctly.
local MAX_INLINE_WIDTH = 100 -- In characters. HACK! FIXME! Do this a better way.
-- List of scripts whose characters are double-width/full-width.
local double_width_scripts = {"Hani", "Hrkt", "Hang"}
-- microformat2 classes, see https://phabricator.wikimedia.org/T138709
local css_classes = {
container_ux = 'h-usage-example',
container_quotation = 'h-quotation',
example = 'e-example',
quotation = 'e-quotation',
quotation_with_citation = 'e-quotation cited-passage',
translation = 'e-translation',
-- The following are added by [[Module:script utilities]], using [[Module:script utilities/data]]
-- transliteration = 'e-transliteration',
-- transcription = 'e-transcription',
normalization = 'e-normalization',
literally = 'e-literally',
qualifier = 'e-qualifier',
source = 'e-source',
footer = 'e-footer'
}
-- helper functions
local function track(page, code)
local tracking_page = "usex/" .. page
local debug_track = require(debug_track_module)
debug_track(tracking_page)
if code then
debug_track(tracking_page .. "/" .. code)
end
return true
end
-- version of rsubn() that discards all but the first return value
local function rsub(term, foo, bar)
local retval = rsubn(term, foo, bar)
return retval
end
local function wrap(tag, class, text, lang)
if lang then
lang = ' lang="' .. lang .. '"'
else
lang = ""
end
if text and class then
return table.concat{'<', tag, ' class="', class, '"', lang, '>', text, '</', tag, '>'}
else
return nil
end
end
local function span(class, text) return wrap('span', class, text) end
local function div(class, text) return wrap('div', class, text) end
-- Remove any HTML from the formatted text and resolve links, since the extra characters don't contribute to the
-- displayed length.
local function convert_to_raw_text(text)
text = rsub(text, "<.->", "")
if text:find("%[%[") then
text = require(links_module).remove_links(text)
end
return text
end
local function get_character_width(text)
local charsets = {}
for _, script in ipairs(double_width_scripts) do
table.insert(charsets, require(scripts_module).getByCode(script):getCharacters())
end
local single_width_chars = ulen(rsub(text, "[" .. table.concat(charsets) .. "]", ""))
local total_chars = ulen(text)
local double_width_chars = total_chars - single_width_chars
return single_width_chars + 2 * double_width_chars
end
--[==[
Apply the substitutions in `subst` (from the {{para|subst}} parameter or similar) to the example or quotation in
`usex` after removing links, returning the resulting text. `track`, if supplied, is a function of one argument that is
used to insert tracking categories: one for any call to this function, another if a single / is used in the `subst`
argument.
]==]
function export.apply_subst(usex, subst, track)
local subbed_usex = require(links_module).remove_links(usex)
local function do_track(page)
if track then
track(page)
end
return true
end
if subst then
-- [[Special:WhatLinksHere/Wiktionary:Tracking/usex/subst]]
do_track("subst")
subst = rsplit(subst, ",")
for _, subpair in ipairs(subst) do
-- [[Special:WhatLinksHere/Wiktionary:Tracking/usex/subst-single-slash]]
local subsplit = rsplit(subpair, rfind(subpair, "//") and "//" or do_track("subst-single-slash") and "/")
subbed_usex = rsub(subbed_usex, subsplit[1], subsplit[2])
end
end
return subbed_usex
end
--[=[
Process parameters for usex text (either the primary text or the original text) and associated annotations. On input,
the following fields are recognized in `data` (all are optional except as marked):
* `lang`: Language object of text; may be an etymology language (REQUIRED).
* `termlang`: The language object of the term being illustrated, which may be different from the language of the main
quotation text and should always be based off of the main text, not the original text. Used for
categories. May be an etymology language (REQUIRED).
* `usex`: Text of usex/quotation.
* `sc`: Script object of text.
* `tr`: Manual transliteration.
* `ts`: Transcription.
* `norm`: Normalized version of text.
* `normsc`: Script object of normalized version of text, or "auto".
* `subst`: String of substitutions for transliteration purposes.
* `quote`: If non-nil, this is a quotation (using {{tl|quote}} or {{tl|quote-*}}) instead of a usage example (using
{{tl|usex}}). If it has the specific value "quote-meta", this is a quotation with citation (invoked from
{{tl|quote-*}}). This controls the CSS class used to display the quotation, as well as the face used to tag the usex
(which in turn results in the usex being upright text if a quotation, and italic text if a usage example).
* `title`: Title object of the current page (REQUIRED).
* `q`: List of left qualifiers.
* `qq`: List of right qualifiers.
* `ref`: String to display directly after any right qualifier, with no space. (FIXME: Should be converted into
an actual ref.)
* `nocat`: Overall `data.nocat` value.
* `categories`: List to insert categories into (REQUIRED).
* `example_type`: Either "quotation" (if `quote` specified) or "usage example" (otherwise) (REQUIRED).
On output, return an object with four fields:
* `usex`: Formatted usex, including qualifiers attached to both sides and `ref` attached to the right. Always specified.
* `tr`: Formatted transliteration; may be nil.
* `ts`: Formatted transcription; may be nil.
* `norm`: Formatted normalized version of usex; may be nil.
]=]
local function process_usex_text(data)
local lang = data.lang
local termlang = data.termlang
local usex = data.usex
local sc = data.sc
local sc_explicit = sc
local tr = data.tr
local ts = data.ts
local norm = data.norm
local normsc = data.normsc
local subst = data.subst
local quote = data.quote
local leftq = data.q
local rightq = data.qq
local ref = data.ref
local nocat = data.nocat
local categories = data.categories
local example_type = data.example_type
local title = data.title
if normsc == "auto" then
normsc = nil
elseif not normsc then
normsc = sc
end
if not sc then
sc = lang:findBestScript(usex)
end
if not normsc and norm then
normsc = lang:findBestScript(norm)
end
local langcode = lang:getFullCode()
-- tr=- means omit transliteration altogether
if tr == "-" then
tr = nil
else
-- Try to auto-transliterate.
if tr then
-- [[Special:WhatLinksHere/Wiktionary:Tracking/usex/manual-tr]]
-- [[Special:WhatLinksHere/Wiktionary:Tracking/usex/manual-tr/LANGCODE]]
track("manual-tr", langcode)
else
-- First, try transliterating the normalization, if supplied.
if norm and normsc then
local normsc_code = normsc:getCode()
if normsc_code ~= "None" and not normsc_code:find("Lat") then -- Latn, Latf, Latg, pjt-Latn
local subbed_norm = export.apply_subst(norm, subst, track)
tr = lang:transliterate(subbed_norm, normsc)
end
end
-- If no normalization, or the normalization is in a Latin script, or the transliteration of the
-- normalization failed, fall back to transliterating the usex.
if not tr and usex then
local subbed_usex = export.apply_subst(usex, subst, track)
tr = lang:transliterate(subbed_usex, sc)
end
-- If the language doesn't have capitalization and is specified in [[Module:usex/data]], then capitalize any sentences.
-- Exclamation marks and question marks need to be unescaped then re-escaped.
if tr and mw.loadData(usex_data_module).capitalize_sentences[langcode] then
tr = tr:gsub("!", "!")
:gsub("?", "?")
tr = rsub(tr, "%f[^%z%p%s](.)(.-[%.%?!‽])", function(m1, m2)
return uupper(m1) .. m2
end)
tr = tr:gsub("!", "!")
:gsub("%?", "?")
end
end
-- If there is still no transliteration, then add a cleanup category.
if not tr and needs_translit[langcode] then
local sccode = sc:getCode()
if sccode ~= "None" and not sccode:find("Lat") then
table.insert(categories, ("Requests for transliteration of %s %ss"):format(lang:getCanonicalName(),
example_type))
end
end
end
if tr and norm then
track("tr-and-norm")
end
if tr then
tr = require(script_utilities_module).tag_translit(tr, langcode, "usex")
end
if ts then
ts = require(script_utilities_module).tag_transcription(ts, langcode, "usex")
ts = "/" .. ts .. "/"
end
local function do_language_and_script_tagging(usex, lang, sc, css_class)
usex = require(links_module).embedded_language_links{term = usex, lang = lang, sc = sc}
local face
if quote then
face = nil
else
face = "term"
end
usex = require(script_utilities_module).tag_text(usex, lang, sc, face, css_class)
return usex
end
if usex then
usex = do_language_and_script_tagging(usex, lang, sc,
quote == "quote-meta" and css_classes.quotation_with_citation or
quote and css_classes.quotation or css_classes.example)
if not nocat then
-- Only add [[Citations:foo]] to [[:Category:LANG terms with quotations]] if [[foo]] exists.
local ok_to_add_cat
if title.nsText ~= "Citations" then
ok_to_add_cat = true
else
-- Here we don't want to use the subpage text because we check [[Citations:foo]] against [[foo]] and
-- if there's a slash in what follows 'Citations:', we want to check against the full page with the
-- slash.
local mainspace_title = mw.title.new(title.text)
if mainspace_title and mainspace_title.exists then
ok_to_add_cat = true
end
end
if ok_to_add_cat then
-- Categories beginning with the language name should use full languages as that's what the poscat
-- system requires, but 'Requests for' categories can use etymology-only languages.
table.insert(categories, ("%s terms with %ss"):format(termlang:getFullName(), example_type))
end
end
else
if tr then
table.insert(categories, ("Requests for %s in %s %ss"):format(
sc_explicit and sc_explicit:getDisplayForm() or "native script",
lang:getCanonicalName(), example_type))
end
-- TODO: Trigger some kind of error here
usex = "<small>(please add the primary text of this " .. example_type .. ")</small>"
end
if norm then
-- Use brackets in HTML entity format just to make sure we don't interfere with links; add brackets before
-- script tagging so that if the script tagging increases the font size, the brackets get increased too.
norm = "[" .. norm .. "]"
norm = do_language_and_script_tagging(norm, lang, normsc, css_classes.normalization)
end
local result = {}
if leftq and #leftq > 0 then
table.insert(result, span(css_classes.qualifier, require("Module:qualifier").format_qualifier(leftq)) .. " ")
end
table.insert(result, usex)
if rightq and #rightq > 0 then
table.insert(result, " " .. span(css_classes.qualifier, require("Module:qualifier").format_qualifier(rightq)))
end
if ref and ref ~= "" then
track("ref")
table.insert(result, ref)
end
return {
usex = table.concat(result),
tr = tr,
ts = ts,
norm = norm
}
end
local function format_audio(audio)
if audio then
return " [[File:" .. audio .. "|25px]]"
else
return ""
end
end
--[==[
Format a usex or quotation. Implementation of {{tl|ux}}, {{tl|quote}} and {{tl|quote-*}} templates (e.g.
{{tl|quote-book}}, {{tl|quote-journal}}, {{tl|quote-web}}, etc.). FIXME: Should also be used by {{tl|Q}} and
[[Module:Quotations]].
Takes a single object `data`, containining the following fields:
* `usex`: The text of the usex or quotation to format. Semi-mandatory (a maintenance line is displayed if missing).
* `lang`: The language object of the text. Mandatory. May be an etymology language.
* `termlang`: The language object of the term, which may be different from the language of the text. Defaults to `lang`.
Used for categories. May be an etymology language.
* `sc`: The script object of the text. Autodetected if not given.
* `quote`: If specified, this is a quotation rather than a usex (uses a different CSS class that affects formatting).
* `inline`: If specified, format the usex or quotation inline (on one line).
* `translation`: Translation of the usex or quotation, if in a foreign language.
* `lit`: Literal translation (if the translation in `translation` is idiomatic and differs significantly from the
literal translation).
* `normalization`: Normalized version of the usex or quotation (esp. for older languages where nonstandard spellings
were common).
* `normsc`: Script object of the normalized text. If unspecified, use the script object given in `sc` if any, otherwise
do script detection on the normalized text. If "auto", do script detection on the normalized text even if
a script was specified in `sc`.
* `transliteration`: Transliteration of the usex. If unspecified, transliterate the normalization if specified and not
in a Latin script and transliterable, otherwise fall back to transliterating the usex text.
* `transcription`: Transcription of the usex, for languages where the transliteration differs significantly from the
pronunciation.
* `subst`: String indicating substitutions to perform on the usex/quotation and normalization prior to transliterating
them. Multiple substs are comma-separated and individual substs are of the form FROM//TO where FROM is a
Lua pattern and TO is a Lua replacement spec. (FROM/TO is also recognized if no // is present in the
substitution.)
* `q`: If specified, a list of left qualifiers to display before the usex/quotation text.
* `qq`: If specified, a list of right qualifiers to display after the usex/quotation text.
* `qualifiers`: If specified, a list of right qualifiers to display after the usex/quotation text, for compatibility
purposes.
* `ref`: Reference text to display directly after the right qualifiers. (FIXME: Instead, this should be actual
references.)
* `audio`: Name of the audio file containing the usex in spoken form.
* `orig`: Original text, if the primary text of the usex or quotation is a translation.
* `origlang`: The language object of the original text. Mandatory if original text given. May be an etymology language.
* `origsc`: The script object of the original text. Autodetected if not given.
* `orignorm`: Normalized version of the original text (esp. for older languages where nonstandard spellings were
common).
* `orignormsc`: Script object of the normalized original text. If unspecified, use the script object given in `origsc`
if any, otherwise do script detection on the normalized original text. If "auto", do script detection
on the normalized text even if a script was specified in `origsc`.
* `origtr`: Transliteration of the original text. If unspecified, transliterate the normalized original text if
specified and not in a Latin script and transliterable, otherwise fall back to transliterating the original
text.
* `origts`: Transcription of the original text, for languages where the transliteration differs significantly from the
pronunciation.
* `origsubst`: String indicating substitutions to perform on the original text and normalization thereof prior to
transliterating them. Multiple substs are comma-separated and individual substs are of the form FROM//TO
where FROM is a Lua pattern and TO is a Lua replacement spec. (FROM/TO is also recognized if no // is
present in the substitution.)
* `origq`: If specified, a list of left qualifiers to display before the original text.
* `origqq`: If specified, a list of right qualifiers to display after the original text.
* `origref`: Reference text to display directly after the right qualifiers of the original text. (FIXME: Instead, this
should be actual references.)
* `source`: Source of the quotation, displayed in parens after the quotation text.
* `footer`: Footer displaying miscellaneous information, shown after the quotation. (Typically this should be in a
small font.)
* `nocat`: Suppress categorization.
* `noreq`: Suppress request for translation when no translation provided.
* `sortkey`: Sort key for categories.
* `brackets`: If specified, show a bracket at the end (used with brackets= in {{tl|quote-*}} templates, which show the
bracket at the beginning, to indicate a mention rather than a use).
* `class`: Additional CSS class surrounding the entire formatted text.
]==]
function export.format_usex(data)
local lang = data.lang
local termlang = data.termlang or lang
local translation = data.translation
local quote = data.quote
local lit = data.lit
local audio = data.audio
local source = data.source
local brackets = data.brackets
local footer = data.footer
local sortkey = data.sortkey
local noreq = data.noreq
local title
if data.pagename then -- for testing, doc pages, etc.
title = mw.title.new(data.pagename)
if not title then
error(("Bad value for `data.pagename`: '%s'"):format(data.pagename))
end
else
title = mw.title.getCurrentTitle()
end
--[[
if title.nsText == "Reconstruction" or lang:hasType("reconstructed") then
error("Reconstructed languages and reconstructed terms cannot have usage examples, as we have no record of their use.")
end
]]
if lit then
lit = "(literally, “" .. span(css_classes.literally, lit) .. "”)"
end
if source then
source = "(" .. span(css_classes.source, source) .. ")"
end
if footer then
footer = span(css_classes.footer, footer)
end
local example_type = quote and "quotation" or "usage example" -- used in error messages and categories
local categories = {}
local usex_obj = process_usex_text {
lang = lang,
termlang = termlang,
usex = data.usex,
sc = data.sc,
tr = data.transliteration,
ts = data.transcription,
norm = data.normalization,
normsc = data.normsc,
subst = data.subst,
quote = data.quote,
title = title,
q = data.q,
qq = data.qq,
ref = data.ref,
nocat = data.nocat,
categories = categories,
example_type = example_type,
}
local orig_obj = data.orig and process_usex_text {
lang = data.origlang,
-- Any categories derived from the original text should use the language of the main text or the term inside it,
-- not the language of the original text.
termlang = termlang,
usex = data.orig,
sc = data.origsc,
tr = data.origtr,
ts = data.origts,
norm = data.orignorm,
normsc = data.orignormsc,
subst = data.origsubst,
quote = data.quote,
title = title,
q = data.origq,
qq = data.origqq,
ref = data.origref,
nocat = data.nocat,
categories = categories,
example_type = example_type,
} or nil
if translation == "-" then
translation = nil
table.insert(categories, ("%s %ss with omitted translation"):format(lang:getFullName(),
example_type))
elseif translation then
translation = span(css_classes.translation, translation)
elseif not noreq then
local langcode = lang:getFullCode()
local origlangcode = data.origlang and data.origlang:getFullCode()
if langcode ~= "en" and langcode ~= "mul" and langcode ~= "und" and origlangcode ~= "en" then
-- add trreq category if translation is unspecified and language is not english, translingual or
-- undetermined
table.insert(categories, ("Requests for translations of %s %ss"):format(lang:getCanonicalName(),
example_type))
if quote then
translation = "<small>(please [[WT:Quotations#Adding translations to quotations|add an English translation]] of this "
.. example_type .. ")</small>"
else
translation = "<small>(please add an English translation of this " .. example_type .. ")</small>"
end
end
end
local function generate_inline_usex()
local result = {}
local function ins(text)
table.insert(result, text)
end
ins(usex_obj.usex)
ins(format_audio(audio))
local function insert_annotations(obj)
if obj.norm then
ins(" " .. obj.norm)
end
if obj.tr or obj.ts then
ins(" ―")
if obj.tr then
ins(" " .. obj.tr)
end
if obj.ts then
ins(" " .. obj.ts)
end
end
end
insert_annotations(usex_obj)
if orig_obj then
ins(" (")
ins("[" .. original_text .. orig_obj.usex .. "]")
insert_annotations(orig_obj)
ins(")")
end
if translation then
ins(" ― " .. translation)
end
if lit then
ins(" " .. lit)
end
if source then
ins(" " .. source)
end
if footer then
ins(" " .. footer)
end
if data.brackets then
ins("]")
end
return table.concat(result)
end
local function generate_multiline_usex()
local result = {}
local function ins(text)
table.insert(result, text)
end
ins(usex_obj.usex)
ins(format_audio(audio))
local any_usex_annotations = usex_obj.tr or usex_obj.ts or usex_obj.norm or translation or lit
local any_orig_annotations = orig_obj and (orig_obj.tr or orig_obj.ts or orig_obj.norm)
if any_usex_annotations or orig_obj or source or footer then
ins("<dl>")
local function insert_dd(text)
if text then
ins("<dd>")
ins(text)
if data.brackets then
ins(BRACKET_SUB)
end
ins("</dd>")
end
end
insert_dd(usex_obj.norm)
insert_dd(usex_obj.tr)
insert_dd(usex_obj.ts)
if orig_obj then
insert_dd("[" .. original_text .. orig_obj.usex .. "]")
if any_orig_annotations then
ins("<dd><dl>")
insert_dd(orig_obj.norm)
insert_dd(orig_obj.tr)
insert_dd(orig_obj.ts)
ins("</dl></dd>")
end
end
insert_dd(translation)
insert_dd(lit)
if source or footer then
if any_usex_annotations then
ins("<dd><dl>")
end
insert_dd(source)
insert_dd(footer)
if any_usex_annotations then
ins("</dl></dd>")
end
end
ins("</dl>")
elseif data.brackets then
ins(BRACKET_SUB)
end
result = table.concat(result)
if data.brackets then
result = result:gsub("^(.*)" .. BRACKET_SUB, "%1]"):gsub(BRACKET_SUB, "")
end
return result
end
local is_inline
if data.inline == "auto" then
result = generate_inline_usex()
if get_character_width(convert_to_raw_text(result)) > MAX_INLINE_WIDTH then
result = generate_multiline_usex()
is_inline = false
else
is_inline = true
end
elseif data.inline then
result = generate_inline_usex()
is_inline = true
else
result = generate_multiline_usex()
is_inline = false
end
local class = quote and css_classes.container_quotation or css_classes.container_ux
if data.class then
class = class .. " " .. data.class
end
result = (is_inline and span or div)(class, result)
return result .. require("Module:utilities").format_categories(categories, lang, sortkey)
end
return export
o14rj399xkfd16bbul4vgyizscnixhy
Ta̱mpi̱let:cd
10
3650
43483
2026-05-13T22:35:47Z
Kambai Akau
8
Redirected page to [[Ta̱mpi̱let:code]]
43483
wikitext
text/x-wiki
#REDIRECT [[Template:code]]
kaxcai41ou8uprnp6jghkf8869oymbt
Module:languages/error
828
3651
43484
2026-05-13T22:36:47Z
Kambai Akau
8
Created page with "--[=[ Throw an error for an invalid language code or script code. `code` (required) is the bad code and can be nil or a non-string. `param` (required) is the name of the parameter in which the code was contained. It can be a string, a number (for a numeric param, in which case the param will show up in the error message as an ordinal such as "first" or "second"), or `true` if no parameter can be clearly identified. It can also be a function of one argument (an error..."
43484
Scribunto
text/plain
--[=[
Throw an error for an invalid language code or script code.
`code` (required) is the bad code and can be nil or a non-string.
`param` (required) is the name of the parameter in which the code was contained. It can be a string, a number (for a
numeric param, in which case the param will show up in the error message as an ordinal such as "first" or "second"),
or `true` if no parameter can be clearly identified. It can also be a function of one argument (an error message) or
two arguments (an error message and a number of stack frames to ignore, same as in the error() function itself), in
which case the function will be called with the same error message as if `true` were passed in. (This is useful e.g.
when parsing inline modifiers, where the parameter name may not be available or may not be so clearly identifiable,
but an error function is often available.)
`code_desc` (optional) is text describing what the code is; by default, "language code".
`template_text` (optional) is a string specifying the template that generated the error, or a function
to generate this string. If given, it will be displayed in the error message.
]=]
local dump = mw.dumpObject
local function internal_err(param, types, param_type)
error(("Internal error: the parameter %s must be %s, but received %s"):format(dump(param), types, param_type))
end
return function(code, param, code_desc, template_tag)
local is_callable = require("Module:fun").is_callable
if param == nil then
param = true
end
if not code_desc then
code_desc = "language code"
end
if template_tag then
if is_callable(template_tag) then
template_tag = template_tag()
else
local template_tag_type = type(template_tag)
if template_tag_type ~= "string" then
internal_err("template_tag", "a function, callable table or nil", "a " .. template_tag_type)
end
end
template_tag = (" (original template: %s)"):format(template_tag)
else
template_tag = ""
end
local param_is_callable = is_callable(param)
local function err(msg)
msg = msg .. template_tag
if not(param_is_callable) then
error(require("Module:string utilities").ucfirst(msg) .. ".", 3)
end
param(msg, 3)
end
if not (param == true or param_is_callable) then
local param_type = type(param)
if not (param_type == "string" or param_type == "number") then
internal_err("param", "a number, string, function, callable table, true or nil", param == false and "false" or "a " .. param_type)
end
param = ("parameter %s "):format(dump(param))
end
if code == nil or code == "" then
if param == true or param_is_callable then
err(code_desc .. " is missing")
else
err(param .. "(" .. code_desc .. ") is missing")
end
else
local code_type = type(code)
if code_type ~= "string" then
internal_err("code", "a string or nil", "a " .. code_type)
end
end
err(("%smust be a valid %s; the value %s is not valid (see [[Wiktionary:List of languages]])"):format(
(param == true or param_is_callable) and "" or param, code_desc, dump(code)
))
end
68uzi96qzi6p7xo9o94htn1wi1zdfbk
Ta̱mpi̱let:th-usex
10
3652
43485
2026-05-13T22:38:12Z
Kambai Akau
8
Created page with "<includeonly>{{#invoke:th|usex}}[[Category:Thai terms with usage examples]]</includeonly><noinclude>{{documentation}}</noinclude>"
43485
wikitext
text/x-wiki
<includeonly>{{#invoke:th|usex}}[[Category:Thai terms with usage examples]]</includeonly><noinclude>{{documentation}}</noinclude>
0nm02qs8y9xqwprd0mhrwqvn1emo4uq
Module:scripts/canonical names
828
3653
43486
2026-05-13T22:42:26Z
Kambai Akau
8
Created page with "return { ["Adlam"] = "Adlm", ["Afaka"] = "Afak", ["Ahom"] = "Ahom", ["Anatolian hieroglyphic"] = "Hluw", ["Ancient North Arabian"] = "Narb", ["Ancient South Arabian"] = "Sarb", ["Arabic"] = "Arab", ["Armenian"] = "Armn", ["Assamese"] = "as-Beng", ["Avestan"] = "Avst", ["Balinese"] = "Bali", ["Bamum"] = "Bamu", ["Bassa"] = "Bass", ["Batak"] = "Batk", ["Baybayin"] = "Tglg", ["Bengali"] = "Beng", ["Bhaiksuki"] = "Bhks", ["Blissymbolic"] = "Blis", ["Book P..."
43486
Scribunto
text/plain
return {
["Adlam"] = "Adlm",
["Afaka"] = "Afak",
["Ahom"] = "Ahom",
["Anatolian hieroglyphic"] = "Hluw",
["Ancient North Arabian"] = "Narb",
["Ancient South Arabian"] = "Sarb",
["Arabic"] = "Arab",
["Armenian"] = "Armn",
["Assamese"] = "as-Beng",
["Avestan"] = "Avst",
["Balinese"] = "Bali",
["Bamum"] = "Bamu",
["Bassa"] = "Bass",
["Batak"] = "Batk",
["Baybayin"] = "Tglg",
["Bengali"] = "Beng",
["Bhaiksuki"] = "Bhks",
["Blissymbolic"] = "Blis",
["Book Pahlavi"] = "Phlv",
["Brahmi"] = "Brah",
["Braille"] = "Brai",
["Buhid"] = "Buhd",
["Burmese"] = "Mymr",
["Canadian syllabic"] = "Cans",
["Carian"] = "Cari",
["Caucasian Albanian"] = "Aghb",
["Chakma"] = "Cakm",
["Cham"] = "Cham",
["Cherokee"] = "Cher",
["Chisoi"] = "Chis",
["Clear Script"] = "xwo-Mong",
["Coptic"] = "Copt",
["Cuneiform"] = "Xsux",
["Cypriot"] = "Cprt",
["Cypro-Minoan"] = "Cpmn",
["Cyrillic"] = "Cyrl",
["Demotic"] = "Egyd",
["Deseret"] = "Dsrt",
["Devanagari"] = "Deva",
["Dhives Akuru"] = "Diak",
["Dogra"] = "Dogr",
["Dongba"] = "Nkdb",
["Duployan"] = "Dupl",
["Egyptian hieroglyphic"] = "Egyp",
["Elbasan"] = "Elba",
["Elymaic"] = "Elym",
["Ethiopic"] = "Ethi",
["Fraktur"] = "Latf",
["Fraser"] = "Lisu",
["Gaelic"] = "Latg",
["Garay"] = "Gara",
["Geba"] = "Nkgb",
["Georgian"] = "Geor",
["Glagolitic"] = "Glag",
["Gothic"] = "Goth",
["Grantha"] = "Gran",
["Greek"] = "Grek",
["Gujarati"] = "Gujr",
["Gunjala Gondi"] = "Gong",
["Gurmukhi"] = "Guru",
["Han"] = "Hani",
["Hangul"] = "Hang",
["Hanifi Rohingya"] = "Rohg",
["Hanunoo"] = "Hano",
["Hatran"] = "Hatr",
["Hebrew"] = "Hebr",
["Hieratic"] = "Egyh",
["Hiragana"] = "Hira",
["Image-rendered"] = "Image",
["Imperial Aramaic"] = "Armi",
["Indus"] = "Inds",
["Inscriptional Pahlavi"] = "Phli",
["Inscriptional Parthian"] = "Prti",
["International Phonetic Alphabet"] = "Ipach",
["Japanese"] = "Jpan",
["Javanese"] = "Java",
["Jurchen"] = "Jurc",
["Kaithi"] = "Kthi",
["Kana"] = "Hrkt",
["Kannada"] = "Knda",
["Katakana"] = "Kana",
["Kawi"] = "Kawi",
["Kayah Li"] = "Kali",
["Kharoshthi"] = "Khar",
["Khema"] = "Gukh",
["Khitan large"] = "Kitl",
["Khitan small"] = "Kits",
["Khmer"] = "Khmr",
["Khojki"] = "Khoj",
["Khom Thai"] = "Khomt",
["Khudabadi"] = "Sind",
["Khutsuri"] = "Geok",
["Khwarezmian"] = "Chrs",
["Kirat Rai"] = "Krai",
["Korean"] = "Kore",
["Kpelle"] = "Kpel",
["Kulitan"] = "Kulit",
["Lai Tay"] = "Tayo",
["Lao"] = "Laoo",
["Latin"] = "Latn",
["Leke"] = "Leke",
["Lepcha"] = "Lepc",
["Limbu"] = "Limb",
["Linear A"] = "Lina",
["Linear B"] = "Linb",
["Loma"] = "Loma",
["Lontara"] = "Bugi",
["Lycian"] = "Lyci",
["Lydian"] = "Lydi",
["Mahajani"] = "Mahj",
["Makasar"] = "Maka",
["Malayalam"] = "Mlym",
["Manchu"] = "mnc-Mong",
["Mandaic"] = "Mand",
["Manichaean"] = "Mani",
["Marchen"] = "Marc",
["Masaram Gondi"] = "Gonm",
["Maya"] = "Maya",
["Medefaidrin"] = "Medf",
["Meitei Mayek"] = "Mtei",
["Mende"] = "Mend",
["Meroitic cursive"] = "Merc",
["Meroitic hieroglyphic"] = "Mero",
["Modi"] = "Modi",
["Mongolian"] = "Mong",
["Moon"] = "Moon",
["Morse code"] = "Morse",
["Mru"] = "Mroo",
["Multani"] = "Mult",
["Mundari Bani"] = "Nagm",
["N'Ko"] = "Nkoo",
["Nabataean"] = "Nbat",
["Nandinagari"] = "Nand",
["New Tai Lue"] = "Talu",
["Newa"] = "Newa",
["Northeastern Iberian"] = "Ibrnn",
["Nyiakeng Puachue Hmong"] = "Hmnp",
["Nüshu"] = "Nshu",
["Odia"] = "Orya",
["Ogham"] = "Ogam",
["Ol Chiki"] = "Olck",
["Ol Onal"] = "Onao",
["Old Cyrillic"] = "Cyrs",
["Old Hungarian"] = "Hung",
["Old Italic"] = "Ital",
["Old Permic"] = "Perm",
["Old Persian"] = "Xpeo",
["Old Sogdian"] = "Sogo",
["Old Turkic"] = "Orkh",
["Old Uyghur"] = "Ougr",
["Osage"] = "Osge",
["Osmanya"] = "Osma",
["Pahawh Hmong"] = "Hmng",
["Palmyrene"] = "Palm",
["Pau Cin Hau"] = "Pauc",
["Pazend"] = "pal-Avst",
["Phags-pa"] = "Phag",
["Phoenician"] = "Phnx",
["Pollard"] = "Plrd",
["Proto-Cuneiform"] = "Pcun",
["Proto-Elamite"] = "Pelm",
["Proto-Sinaitic"] = "Psin",
["Psalter Pahlavi"] = "Phlp",
["Ranjana"] = "Ranj",
["Rejang"] = "Rjng",
["Rongorongo"] = "Roro",
["Rumi numerals"] = "Rumin",
["Runic"] = "Runr",
["Samaritan"] = "Samr",
["Saurashtra"] = "Saur",
["Shahmukhi"] = "pa-Arab",
["Sharada"] = "Shrd",
["Shavian"] = "Shaw",
["Siddham"] = "Sidd",
["Sidetic"] = "Sidt",
["SignWriting"] = "Sgnw",
["Simplified Han"] = "Hans",
["Sinhalese"] = "Sinh",
["Sogdian"] = "Sogd",
["Sorang Sompeng"] = "Sora",
["Southeastern Iberian"] = "Ibrns",
["Soyombo"] = "Soyo",
["Sui"] = "Shui",
["Sundanese"] = "Sund",
["Sunuwar"] = "Sunu",
["Sylheti Nagri"] = "Sylo",
["Syriac"] = "Syrc",
["Tagbanwa"] = "Tagb",
["Tai Nüa"] = "Tale",
["Tai Tham"] = "Lana",
["Tai Viet"] = "Tavt",
["Takri"] = "Takr",
["Tamil"] = "Taml",
["Tamyig"] = "sit-tam-Tibt",
["Tangsa"] = "Tnsa",
["Tangut"] = "Tang",
["Telugu"] = "Telu",
["Tengwar"] = "Teng",
["Thaana"] = "Thaa",
["Thai"] = "Thai",
["Tibetan"] = "Tibt",
["Tifinagh"] = "Tfng",
["Tigalari"] = "Tutg",
["Tirhuta"] = "Tirh",
["Todhri"] = "Todr",
["Tolong Siki"] = "Tols",
["Toto"] = "Toto",
["Traditional Han"] = "Hant",
["Ugaritic"] = "Ugar",
["Vai"] = "Vaii",
["Varang Kshiti"] = "Wara",
["Visible Speech"] = "Visp",
["Vithkuqi"] = "Vith",
["Wancho"] = "Wcho",
["Woleai"] = "Wole",
["Xibe"] = "sjo-Mong",
["Yezidi"] = "Yezi",
["Yi"] = "Yiii",
["Zanabazar Square"] = "Zanb",
["Zhuyin"] = "Bopo",
["Znamenny musical notation"] = "Zname",
["flag semaphore"] = "Semap",
["mathematical notation"] = "Zmth",
["musical notation"] = "Music",
["symbolic"] = "Zsym",
["uncoded"] = "Zzzz",
["undetermined"] = "Zyyy",
["unspecified"] = "None",
}
cu4r200bvy8ibvz2mtty0ijbsh3nm5x
Module:string/removeComments
828
3654
43487
2026-05-13T22:50:12Z
Kambai Akau
8
Created page with "local concat = table.concat local find = string.find local sub = string.sub local function remove_comments(str, pre) local pos1 = find(str, "<!--", nil, true) if not pos1 then return str end local pos2 = find(str, "-->", pos1 + 4, true) if not pos2 then return pre and sub(str, 1, pos1 - 1) or str end local ret = sub(str, 1, pos1 - 1) pos1 = pos2 + 3 pos2 = find(str, "<!--", pos1, true) if not pos2 then return ret .. sub(str, pos1) end local pos3 = find..."
43487
Scribunto
text/plain
local concat = table.concat
local find = string.find
local sub = string.sub
local function remove_comments(str, pre)
local pos1 = find(str, "<!--", nil, true)
if not pos1 then
return str
end
local pos2 = find(str, "-->", pos1 + 4, true)
if not pos2 then
return pre and sub(str, 1, pos1 - 1) or str
end
local ret = sub(str, 1, pos1 - 1)
pos1 = pos2 + 3
pos2 = find(str, "<!--", pos1, true)
if not pos2 then
return ret .. sub(str, pos1)
end
local pos3 = find(str, "-->", pos2 + 4, true)
if not pos3 then
return ret .. sub(str, pos1, pre and pos2 - 1 or nil)
end
ret = {ret, sub(str, pos1, pos2 - 1)}
local n = 2
while true do
pos1 = pos3 + 3
pos2 = find(str, "<!--", pos1, true)
if not pos2 then
return concat(ret) .. sub(str, pos1)
end
pos3 = find(str, "-->", pos2 + 4, true)
if not pos3 then
return concat(ret) .. sub(str, pos1, pre and pos2 - 1 or nil)
end
n = n + 1
ret[n] = sub(str, pos1, pos2 - 1)
end
end
--[==[Removes any HTML comments from the input text. `stage` can be one of three options:
* {"PRE"} (default) applies the method used by MediaWiki's preprocessor: all {{code|html|<nowiki><!-- ... --></nowiki>}} pairs are removed, as well as any text after an unclosed {{code|html|<nowiki><!--</nowiki>}}. This is generally suitable when parsing raw template or [[mw:Parser extension tags|parser extension tag]] code. (Note, however, that the actual method used by the preprocessor is considerably more complex and differs under certain conditions (e.g. comments inside nowiki tags); if full accuracy is absolutely necessary, use [[Module:template parser]] instead).
* {"POST"} applies the method used to generate the final page output once all templates have been expanded: it loops over the text, removing any {{code|html|<nowiki><!-- ... --></nowiki>}} pairs until no more are found (e.g. {{code|html|<nowiki><!-<!-- ... -->- ... --></nowiki>}} would be fully removed), but any unclosed {{code|html|<nowiki><!--</nowiki>}} is ignored. This is suitable for handling links embedded in template inputs, where the {"PRE"} method will have already been applied by the native parser.
* {"BOTH"} applies {"PRE"} then {"POST"}.]==]
return function(str, stage)
if stage == nil or stage == "PRE" then
return remove_comments(str, true)
elseif stage == "BOTH" then
str = remove_comments(str, true)
elseif stage ~= "POST" then
error("bad argument #2 to 'remove_comments' (expected PRE, POST, or BOTH)", 2)
end
while true do
local processed = remove_comments(str)
if processed == str then
return str
end
str = processed
end
end
6q2u94c4jook0g0vegx0uyg6x3jd1ms
Module:grc-pronunciation/data
828
3655
43488
2026-05-13T22:55:18Z
Kambai Akau
8
Created page with "local ipairs = ipairs local pairs = pairs local toNFD = mw.ustring.toNFD local U = require("Module:string utilities").char local m_utils_data = require("Module:grc-utilities/data") local diacritics = m_utils_data.diacritics local nonsyllabic = U(0x32F) local high = U(0x341) -- combining acute tone mark local low = U(0x340) -- combining grave tone mark local midHigh = U(0x1DC4) -- mid–high pitch local midLow = U(0x1DC6) -- mid–low pitch local highMid = U(0x1DC7) --..."
43488
Scribunto
text/plain
local ipairs = ipairs
local pairs = pairs
local toNFD = mw.ustring.toNFD
local U = require("Module:string utilities").char
local m_utils_data = require("Module:grc-utilities/data")
local diacritics = m_utils_data.diacritics
local nonsyllabic = U(0x32F)
local high = U(0x341) -- combining acute tone mark
local low = U(0x340) -- combining grave tone mark
local midHigh = U(0x1DC4) -- mid–high pitch
local midLow = U(0x1DC6) -- mid–low pitch
local highMid = U(0x1DC7) -- high–mid pitch
local rising = U(0x30C) -- combining caron
local falling = diacritics.Latin_circum -- combining circumflex
local voiceless = U(0x325) -- combining ring below
local aspirated = "ʰ"
local stress_mark = "ˈ"
local long = "ː"
local macron = diacritics.spacing_macron
local breve = diacritics.spacing_breve
local circumflex_on_long_vowel = falling
local acute_on_long_vowel = rising
local acute_on_short_vowel = high
local grave_pitch_mark = low
local pre_hasMacronBreve = { { "0~hasMacronBreve", 1 }, 0 }
local pre_hasSubI = { { "0~hasSubI", 1 }, 0 }
local pre_hasMacronBreve_hasSubI = { { "0~hasMacronBreve/0~hasSubI", 1 }, 0 }
local data = {}
-- Letters which can be treated as allographs of other letters.
local aliases = {}
for k, v in pairs{
["κͱ"] = "χ",
["ξ"] = "κσ",
["ϝͱ"] = "ϝ̔",
["λͱ"] = "λ̔",
["πͱ"] = "φ",
["ϻ"] = "σ",
["ϙ"] = "κ",
["ϙͱ"] = "χ",
["ς"] = "σ",
["τͱ"] = "θ",
["ψ"] = "πσ"
} do
aliases[toNFD(k)] = toNFD(v)
end
data.aliases = aliases
data[" "] = {
["p"] = {
["cla"] = " ",
["koi1"] = " ",
["koi2"] = " ",
["byz1"] = " ",
["byz2"] = " ",
},
}
data["β"] = {
["p"] = {
["cla"] = "b",
["koi1"] = "b",
["koi2"] = {
{ "-1=μ", "b" },
"β"
},
["byz1"] = {
{ "-1=μ", "b" },
"v"
},
["byz2"] = {
{ "1=β", "" },
{ "-1=μ", "b" },
"v",
},
},
}
data["γ"] = {
["p"] = {
["cla"] = {
{ "1.dorsal/1=μ", "ŋ" },
"ɡ",
},
["koi1"] = {
{ "1.dorsal", "ŋ" },
"ɡ",
},
["koi2"] = {
{ "1.dorsal",
{
{ "1~preFront", "ɲ" },
"ŋ",
},
},
{ "0~preFront",
{
{ "-1=γ/-1=ν", "ɟ" },
"ʝ",
},
},
{ "-1=γ/-1=ν", "ɡ" },
"ɣ", },
["byz1"] = {
{ "1.dorsal",
{
{ "1~preFront", "ɲ" },
"ŋ",
},
},
{ "0~preFront",
{
{ "-1=γ/-1=ν", "ɟ" },
"ʝ",
},
},
{ "-1=γ/-1=ν", "ɡ" },
"ɣ",
},
["byz2"] = {
{ "1.dorsal",
{
{ "1~preFront", "ɲ" },
"ŋ",
},
},
{ "0~preFront",
{
{ "-1=γ/-1=ν", "ɟ" },
"ʝ",
},
},
{ "-1=γ/-1=ν", "ɡ" },
"ɣ",
},
},
}
data["δ"] = {
["p"] = {
["cla"] = "d",
["koi1"] = "d",
["koi2"] = {
{ "-1=ν", "d" },
"ð",
},
["byz1"] = {
{ "-1=ν", "d" },
"ð",
},
["byz2"] = {
{ "1=δ", "" },
{ "-1=ν", "d" },
"ð",
},
},
}
data["ϝ"] = {
["pre"] = {
{ "1=̓/1=̔", 1 },
0,
},
["p"] = {
-- "w" in (early) Epic and Aeolic
["cla"] = "",
["koi1"] = "",
["koi2"] = "",
["byz1"] = "",
["byz2"] = "",
},
}
data["ϝ̓"] = {
["p"] = data["ϝ"]["p"]
}
data["ϝ̔"] = {
["p"] = {
-- "ʍ" in (early) Epic and Aeolic
["cla"] = "h",
["koi1"] = "(h)",
["koi2"] = data["ϝ̓"].p.koi2,
["byz1"] = data["ϝ̓"].p.byz1,
["byz2"] = data["ϝ̓"].p.byz2,
},
}
data["ζ"] = {
["p"] = {
["cla"] = "zd",
["koi1"] = "z",
["koi2"] = "z",
["byz1"] = "z",
["byz2"] = {
{ "1=ζ", "" },
"z",
},
},
}
data["ͱ"] = {
["p"] = {
["cla"] = "h",
["koi1"] = "(h)",
["koi2"] = "",
["byz1"] = "",
["byz2"] = "",
},
}
data["θ"] = {
["p"] = {
["cla"] = "tʰ",
["koi1"] = "tʰ",
["koi2"] = "θ",
["byz1"] = "θ",
["byz2"] = {
{ "1=θ", "" },
"θ",
},
},
}
data["ϳ"] = {
["p"] = {
["cla"] = "j",
["koi1"] = "j",
["koi2"] = "j",
["byz1"] = "j",
["byz2"] = "j",
},
}
data["κ"] = {
["p"] = {
["cla"] = {
{ "1.voiced+1.stop", "ɡ" },
{ "1.aspirated", "kʰ" },
"k",
},
["koi1"] = {
{ "1.voiced+1.stop", "ɡ" },
"k",
},
["koi2"] = {
{ "1=κ", "" },
{ "1.voiced+1.stop", "ɡ" },
{ "-1=γ/-1=ν",
{
{ "0~preFront", "ɟ" },
{ "1.unvoiced", "k" },
"ɡ",
},
},
{ "0~preFront", "c" },
"k",
},
["byz1"] = {
{ "1=κ", "" },
{ "1.voiced+1.stop", "ɡ" },
{ "-1=γ/-1=ν",
{
{ "0~preFront", "ɟ" },
{ "1.unvoiced", "k" },
"ɡ",
},
},
{ "0~preFront", "c" },
"k",
},
["byz2"] = {
{ "1=κ", "" },
{ "1.voiced+1.stop", "ɡ" },
{ "-1=γ/-1=ν",
{
{ "0~preFront", "ɟ" },
{ "1.unvoiced", "k" },
"ɡ",
},
},
{ "0~preFront", "c" },
"k",
},
},
}
data["λ"] = {
["pre"] = {
{ "1=̓/1=̔", 1 },
0,
},
["p"] = {
["cla"] = {
{ "1=λ",
{
{ "2=̔", "l̥" },
"l",
},
},
{ "-1=̔",
{
{ "-2=λ", "l̥" },
"l",
},
},
"l",
},
["koi1"] = "l",
["koi2"] = "l",
["byz1"] = "l",
["byz2"] = {
{ "1=λ", "" },
"l",
},
},
}
data["λ̓"] = {
["p"] = {
["cla"] = {
{ "2=λ",
{
{ "3=̔", "l̥" },
"l",
},
},
"l",
},
["koi1"] = data["λ"].p.koi1,
["koi2"] = data["λ"].p.koi2,
["byz1"] = data["λ"].p.byz1,
["byz2"] = {
{ "2=λ", "" },
"l",
},
},
}
data["λ̔"] = {
["p"] = {
["cla"] = "l̥",
["koi1"] = data["λ̓"].p.koi1,
["koi2"] = data["λ̓"].p.koi2,
["byz1"] = data["λ̓"].p.byz1,
["byz2"] = data["λ̓"].p.byz2,
},
}
data["μ"] = {
["p"] = {
["cla"] = "m",
["koi1"] = "m",
["koi2"] = "m",
["byz1"] = "m",
["byz2"] = {
{ "1=μ", "" },
"m",
},
},
}
data["ν"] = {
["p"] = {
["cla"] = "n",
["koi1"] = "n",
["koi2"] = "n",
["byz1"] = "n",
["byz2"] = {
{ "1=ν", "" },
"n",
},
},
}
data["π"] = {
["p"] = {
["cla"] = {
{ "1.aspirated", "pʰ" },
"p",
},
["koi1"] = "p",
["koi2"] = "p",
["byz1"] = "p",
["byz2"] = {
{ "-1=μ",
{
{ "1.unvoiced", "p" },
"b",
},
},
{ "1=π", "" },
"p",
},
},
}
data["ρ"] = {
["p"] = {
["cla"] = {
{ "1=ρ/1=ῥ/-1=ρ/-1=ῤ/-1=ῥ", "r̥" },
"r",
},
["koi1"] = "r",
["koi2"] = "r",
["byz1"] = "r",
["byz2"] = {
{ "1=ρ/1=ῤ/1=ῥ", "" },
"r",
},
},
}
data["ῤ"] = {
["p"] = {
["cla"] = {
{ "1=ρ/1=ῥ", "r̥" },
"r",
},
["koi1"] = data["ρ"].p.koi1,
["koi2"] = data["ρ"].p.koi2,
["byz1"] = data["ρ"].p.byz1,
["byz2"] = data["ρ"].p.byz2,
},
}
data["ῥ"] = {
["p"] = {
["cla"] = "r̥",
["koi1"] = data["ῤ"].p.koi1,
["koi2"] = data["ῤ"].p.koi2,
["byz1"] = data["ῤ"].p.byz1,
["byz2"] = data["ῤ"].p.byz2,
},
}
data["σ"] = {
["p"] = {
["cla"] = {
{ "1.voiced", "z" },
"s",
},
["koi1"] = {
{ "1.voiced", "z" },
"s",
},
["koi2"] = {
{ "1.voiced", "z" },
"s",
},
["byz1"] = {
{ "1.voiced", "z" },
"s",
},
["byz2"] = {
{ "1=σ", "" },
{ "1.voiced", "z" },
"s",
},
},
}
data["τ"] = {
["pre"] = {
{ "1=ζ/1=σ", 1 },
0,
},
["p"] = {
["cla"] = {
{ "1.aspirated", "tʰ" },
"t",
},
["koi1"] = "t",
["koi2"] = "t",
["byz1"] = {
{ "1=ζ", "d" },
"t",
},
["byz2"] = {
{ "-1=ν", "d" },
{ "1=τ", "" },
"t",
},
},
}
data["τζ"] = {
["p"] = {
["cla"] = {
{ "2.unvoiced", "st" },
{ "-1=ν/2.voiced", "zd" },
"st",
},
["koi1"] = {
{ "2.unvoiced", "ts" },
{ "-1=ν/2.voiced", "dz" },
"ts",
},
["koi2"] = {
{ "2.unvoiced", "ts" },
{ "-1=ν/2.voiced", "dz" },
"ts",
},
["byz1"] = {
{ "2.unvoiced", "ts" },
{ "-1=ν/2.voiced", "dz" },
"ts",
},
["byz2"] = {
{ "2=ζ", "d" },
{ "2.unvoiced", "ts" },
{ "-1=ν/2.voiced", "dz" },
"ts",
},
},
}
data["τσ"] = {
["p"] = {
["cla"] = {
{ "2.voiced", "z" },
"s",
},
["koi1"] = {
{ "2.voiced", "dz" },
"ts",
},
["koi2"] = {
{ "2.voiced", "dz" },
"ts",
},
["byz1"] = {
{ "2.voiced", "dz" },
"ts",
},
["byz2"] = {
{ "2=σ", "t" },
{ "2.voiced", "dz" },
"ts",
},
},
}
data["φ"] = {
["p"] = {
["cla"] = "pʰ",
["koi1"] = "pʰ",
["koi2"] = "ɸ",
["byz1"] = "f",
["byz2"] = {
{ "1=φ", "" },
"f",
},
},
}
data["χ"] = {
["p"] = {
["cla"] = "kʰ",
["koi1"] = "kʰ",
["koi2"] = {
{ "1=χ", "" },
{ "0~preFront", "ç" },
"x" },
["byz1"] = {
{ "1=χ", "" },
{ "0~preFront", "ç" },
"x" },
["byz2"] = {
{ "1=χ", "" },
{ "0~preFront", "ç" },
"x",
},
},
}
data["α"] = {
["pre"] = {
{ "0~hasMacronBreve",
{
{ "1~isIDiphth/1~isUDiphth", 2 },
1,
},
},
{ "0~isIDiphth/0~isUDiphth", 1 },
0,
},
["p"] = {
["cla"] = "a",
["koi1"] = "a",
["koi2"] = "a",
["byz1"] = "a",
["byz2"] = "a",
},
}
data["ε"] = {
["pre"] = {
{ "0~isIDiphth/0~isUDiphth", 1 },
0,
},
["p"] = {
["cla"] = "e",
["koi1"] = "e",
["koi2"] = "e",
["byz1"] = "e",
["byz2"] = "e",
},
}
data["η"] = {
["pre"] = {
{ "0~isIDiphth/0~isUDiphth", 1 },
0,
},
["p"] = {
["cla"] = "ɛ",
["koi1"] = "e̝",
["koi2"] = "i",
["byz1"] = "i",
["byz2"] = "i",
},
}
data["ι"] = {
["p"] = {
["cla"] = "i",
["koi1"] = "i",
["koi2"] = "i",
["byz1"] = "i",
["byz2"] = "i",
},
}
data["ο"] = {
["pre"] = {
{ "0~isIDiphth/0~isUDiphth", 1 },
0,
},
["p"] = {
["cla"] = "o",
["koi1"] = "o",
["koi2"] = "o",
["byz1"] = "o",
["byz2"] = "o",
},
}
data["υ"] = {
["pre"] = {
{ "0~hasMacronBreve",
{
{ "1~isIDiphth", 2 },
1,
},
},
{ "0~isIDiphth/0~hasSubI", 1 },
0,
},
["p"] = {
["cla"] = "y",
["koi1"] = "y",
["koi2"] = "y",
["byz1"] = "y",
["byz2"] = "i",
},
}
data["ω"] = {
["pre"] = {
{ "0~isIDiphth/0~isUDiphth", 1 },
0,
},
["p"] = {
["cla"] = "ɔ",
["koi1"] = "o",
["koi2"] = "o",
["byz1"] = "o",
["byz2"] = "o",
},
}
local categories = {
[1] = {
["stop"] = { "β", "γ", "δ", "θ", "κ", "π", "τ", "φ", "χ" },
["dorsal"] = { "γ", "κ", "χ" },
["voiced"] = { "β", "γ", "δ", "ζ", "λ", "λ̓", "μ", "ν", "ρ", "ῤ" },
["unvoiced"] = { "ͱ", "θ", "κ", "λ̔", "π", "ῥ", "σ", "τ", "φ", "χ" },
["aspirated"] = { "ͱ", "θ", "φ", "χ" },
["diaer"] = { "ϊ", "ΐ", "ῒ", "ῗ", "ϋ", "ΰ", "ῢ", "ῧ", "ϋͅ", "ΰͅ", "ῢͅ", "ῧͅ" },
["subi"] = { "ᾳ", "ᾴ", "ᾲ", "ᾷ", "ᾀ", "ᾄ", "ᾂ", "ᾆ", "ᾁ", "ᾅ", "ᾃ", "ᾇ", "ῃ", "ῄ", "ῂ", "ῇ", "ᾐ", "ᾔ", "ᾒ", "ᾖ", "ᾑ", "ᾕ", "ᾓ", "ᾗ", "υͅ", "ύͅ", "ὺͅ", "ῦͅ", "ὐͅ", "ὔͅ", "ὒͅ", "ὖͅ", "ὑͅ", "ὕͅ", "ὓͅ", "ὗͅ", "ϋͅ", "ΰͅ", "ῢͅ", "ῧͅ", "ῳ", "ῴ", "ῲ", "ῷ", "ᾠ", "ᾤ", "ᾢ", "ᾦ", "ᾡ", "ᾥ", "ᾣ", "ᾧ" },
},
["type"] = {
["vowel"] = { "α", "ε", "η", "ι", "ο", "υ", "ω" }, -- Not currently used; if it were, it might need to include all the accented vowel characters.
["consonant"] = { "β", "γ", "δ", "ζ", "ͱ", "θ", "ϳ", "κ", "λ", "μ", "ν", "π", "ρ", "σ", "τ", "φ", "χ" },
["long"] = { "ᾱ", "η", "ῑ", "ῡ", "ω" },
["short"] = { "ᾰ", "ε", "ῐ", "ο", "ῠ" },
["either"] = { "α", "ι", "υ" },
["diacritic"] = { diacritics.macron, diacritics.spacing_macron, diacritics.modifier_macron, diacritics.breve, diacritics.spacing_breve, diacritics.rough, diacritics.smooth, diacritics.diaeresis, diacritics.acute, diacritics.grave, diacritics.circum, diacritics.Latin_circum, diacritics.coronis, diacritics.subscript },
},
["accent"] = {
["acute"] = { "ά", "ᾴ", "ἄ", "ᾄ", "ἅ", "ᾅ", "έ", "ἔ", "ἕ", "ή", "ῄ", "ἤ", "ᾔ", "ἥ", "ᾕ", "ί", "ἴ", "ἵ", "ΐ", "ό", "ὄ", "ὅ", "ύ", "ύͅ", "ὔ", "ὔͅ", "ὕ", "ὕͅ", "ΰ", "ΰͅ", "ώ", "ῴ", "ὤ", "ᾤ", "ὥ", "ᾥ" },
["grave"] = { "ὰ", "ᾲ", "ἂ", "ᾂ", "ἃ", "ᾃ", "ὲ", "ἒ", "ἓ", "ὴ", "ῂ", "ἢ", "ᾒ", "ἣ", "ᾓ", "ὶ", "ἲ", "ἳ", "ῒ", "ὸ", "ὂ", "ὃ", "ὺ", "ὺͅ", "ὒ", "ὒͅ", "ὓ", "ὓͅ", "ῢ", "ῢͅ", "ὼ", "ῲ", "ὢ", "ᾢ", "ὣ", "ᾣ" },
["circum"] = { "ᾶ", "ᾷ", "ἆ", "ᾆ", "ἇ", "ᾇ", "ῆ", "ῇ", "ἦ", "ᾖ", "ἧ", "ᾗ", "ῖ", "ἶ", "ἷ", "ῗ", "ῦ", "ῦͅ", "ὖ", "ὖͅ", "ὗ", "ὗͅ", "ῧ", "ῧͅ", "ῶ", "ῷ", "ὦ", "ᾦ", "ὧ", "ᾧ" },
},
["breath"] = {
["rough"] = { "ἁ", "ᾁ", "ἅ", "ᾅ", "ἃ", "ᾃ", "ἇ", "ᾇ", "ἑ", "ἕ", "ἓ", "ϝ̔", "ἡ", "ᾑ", "ἥ", "ᾕ", "ἣ", "ᾓ", "ἧ", "ᾗ", "ἱ", "ἵ", "ἳ", "ἷ", "λ̔", "ὁ", "ὅ", "ὃ", "ῥ", "ὑ", "ὑͅ", "ὕ", "ὕͅ", "ὓ", "ὓͅ", "ὗ", "ὗͅ", "ὡ", "ᾡ", "ὥ", "ᾥ", "ὣ", "ᾣ", "ὧ", "ᾧ" },
["smooth"] = { "ἀ", "ᾀ", "ἄ", "ᾄ", "ἂ", "ᾂ", "ἆ", "ᾆ", "ἐ", "ἔ", "ἒ", "ϝ̓", "ἠ", "ᾐ", "ἤ", "ᾔ", "ἢ", "ᾒ", "ἦ", "ᾖ", "ἰ", "ἴ", "ἲ", "ἶ", "λ̓", "ὀ", "ὄ", "ὂ", "ῤ", "ὐ", "ὐͅ", "ὔ", "ὔͅ", "ὒ", "ὒͅ", "ὖ", "ὖͅ", "ὠ", "ᾠ", "ὤ", "ᾤ", "ὢ", "ᾢ", "ὦ", "ᾦ" },
},
}
for key1, list in pairs(categories) do
for key2, letters in pairs(list) do
if type(key1) == "number" then
for _, letter in ipairs(letters) do
if not data[letter] then
data[letter] = {}
end
data[letter][key2] = true
end
elseif type(key1) == "string" then
for _, letter in ipairs(letters) do
if not data[letter] then
data[letter] = {}
end
data[letter][key1] = key2
end
end
end
end
local rough_breathing = { "h", "(h)" }
local smooth_breathing = { "", "" }
local function get_breathing(breathing)
return breathing == "rough" and rough_breathing or smooth_breathing
end
local function get_pitch_marks(accent_type, long)
if accent_type == "acute" then
if long then
return acute_on_long_vowel
else
return acute_on_short_vowel
end
elseif accent_type == "grave" then
return grave_pitch_mark
elseif accent_type == "circum" then
return circumflex_on_long_vowel
end
return ""
end
local function epsilon(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, false)
return {
["cla"] = breathing[1] .. "e" .. pitch,
["koi1"] = breathing[2] .. stress .. "e",
["koi2"] = stress .. "e",
["byz1"] = stress .. "e",
["byz2"] = stress .. "e"
}
end
for _, letter in ipairs{ "ε", "έ", "ὲ", "ἐ", "ἔ", "ἒ", "ἑ", "ἕ", "ἓ" } do
local l_data = data[letter]
l_data.p = epsilon(l_data.breath, l_data.accent)
end
local function omicron(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, false)
return {
["cla"] = breathing[1] .. "o" .. pitch,
["koi1"] = breathing[2] .. stress .. "o",
["koi2"] = stress .. "o",
["byz1"] = stress .. "o",
["byz2"] = stress .. "o"
}
end
for _, letter in ipairs{ "ο", "ό", "ὸ", "ὀ", "ὄ", "ὂ", "ὁ", "ὅ", "ὃ" } do
local l_data = data[letter]
l_data.p = omicron(l_data.breath, l_data.accent)
end
local function eta(breathing, accent, iota)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
local offglide = iota and ("i" .. nonsyllabic) or ""
return {
["cla"] = breathing[1] .. "ɛ" .. pitch .. long .. offglide,
["koi1"] = breathing[2] .. stress .. "e̝",
["koi2"] = stress .. "i",
["byz1"] = stress .. "i",
["byz2"] = stress .. "i"
}
end
for _, letter in ipairs{ "η", "ῃ", "ή", "ῄ", "ὴ", "ῂ", "ῆ", "ῇ", "ἠ", "ᾐ", "ἤ", "ᾔ", "ἢ", "ᾒ", "ἦ", "ᾖ", "ἡ", "ᾑ", "ἥ", "ᾕ", "ἣ", "ᾓ", "ἧ", "ᾗ" } do
local l_data = data[letter]
l_data.p = eta(l_data.breath, l_data.accent, l_data.subi)
end
local function omega(breathing, accent, iota)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
local offglide = iota and ("i" .. nonsyllabic) or ""
return {
["cla"] = breathing[1] .. "ɔ" .. pitch .. long .. offglide,
["koi1"] = breathing[2] .. stress .. "o",
["koi2"] = stress .. "o",
["byz1"] = stress .. "o",
["byz2"] = stress .. "o"
}
end
for _, letter in ipairs{ "ω", "ῳ", "ώ", "ῴ", "ὼ", "ῲ", "ῶ", "ῷ", "ὠ", "ᾠ", "ὤ", "ᾤ", "ὢ", "ᾢ", "ὦ", "ᾦ", "ὡ", "ᾡ", "ὥ", "ᾥ", "ὣ", "ᾣ", "ὧ", "ᾧ" } do
local l_data = data[letter]
l_data.p = omega(l_data.breath, l_data.accent, l_data.subi)
end
local function alpha(breathing, accent, iota, isLong)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, isLong)
local length = (isLong or iota or accent == "circum") and long or ""
local offglide = iota and ("i" .. nonsyllabic) or ""
return {
["cla"] = breathing[1] .. "a" .. pitch .. length .. offglide,
["koi1"] = breathing[2] .. stress .. "a",
["koi2"] = stress .. "a",
["byz1"] = stress .. "a",
["byz2"] = stress .. "a"
}
end
for _, letter in ipairs{ "α", "ᾳ", "ά", "ᾴ", "ὰ", "ᾲ", "ᾶ", "ᾷ", "ἀ", "ᾀ", "ἄ", "ᾄ", "ἂ", "ᾂ", "ἆ", "ᾆ", "ἁ", "ᾁ", "ἅ", "ᾅ", "ἃ", "ᾃ", "ἇ", "ᾇ" } do
local l_data = data[letter]
l_data.p = alpha(l_data.breath, l_data.accent, l_data.subi)
if not (l_data.subi or l_data.accent == "circum") then
if not l_data.pre then
l_data.pre = pre_hasMacronBreve
end
data[letter .. breve] = { p = l_data.p }
data[letter .. macron] = { p = alpha(l_data.breath, l_data.accent, false, true) }
end
end
local function iota(breathing, accent, isLong)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, isLong)
local length = (isLong or accent == "circum") and long or ""
return {
["cla"] = breathing[1] .. "i" .. pitch .. length,
["koi1"] = breathing[2] .. stress .. "i",
["koi2"] = stress .. "i",
["byz1"] = stress .. "i",
["byz2"] = stress .. "i"
}
end
local function ai(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "a" .. pitch .. "i" .. nonsyllabic,
["koi1"] = breathing[2] .. stress .. "ɛ",
["koi2"] = stress .. "ɛ",
["byz1"] = stress .. "e",
["byz2"] = stress .. "e"
}
end
local function ei(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "e" .. pitch .. long,
["koi1"] = breathing[2] .. stress .. "i",
["koi2"] = stress .. "i",
["byz1"] = stress .. "i",
["byz2"] = stress .. "i"
}
end
local function oi(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "o" .. pitch .. "i" .. nonsyllabic,
["koi1"] = breathing[2] .. stress .. "y",
["koi2"] = stress .. "y",
["byz1"] = stress .. "y",
["byz2"] = stress .. "i"
}
end
local function ui(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "y" .. pitch .. long,
["koi1"] = breathing[2] .. stress .. "y",
["koi2"] = stress .. "y",
["byz1"] = stress .. "y",
["byz2"] = stress .. "i"
}
end
for _, letter in ipairs{ "ι", "ί", "ὶ", "ῖ", "ἰ", "ἴ", "ἲ", "ἶ", "ἱ", "ἵ", "ἳ", "ἷ", "ϊ", "ΐ", "ῒ", "ῗ" } do
local l_data = data[letter]
l_data.p = iota(l_data.breath, l_data.accent)
if l_data.accent ~= "circum" then
l_data.pre = pre_hasMacronBreve
data[letter .. breve] = { p = l_data.p }
data[letter .. macron] = { p = iota(l_data.breath, l_data.accent, true) }
end
if not l_data.diar then
local ai_data = { p = ai(l_data.breath, l_data.accent) }
data["α" .. letter] = ai_data
data["α" .. breve .. letter] = ai_data
data["α" .. macron .. letter] = { p = alpha(l_data.breath, l_data.accent, true, true) }
data["ε" .. letter] = { p = ei(l_data.breath, l_data.accent) }
data["η" .. letter] = { p = eta(l_data.breath, l_data.accent, true) }
data["ο" .. letter] = { p = oi(l_data.breath, l_data.accent) }
local ui_data = { p = ui(l_data.breath, l_data.accent) }
data["υ" .. letter] = ui_data
data["υ" .. breve .. letter] = ui_data
data["υ" .. macron .. letter] = ui_data
data["ϋ" .. letter] = ui_data
data["ϋ" .. breve .. letter] = ui_data
data["ϋ" .. macron .. letter] = ui_data
data["ω" .. letter] = { p = omega(l_data.breath, l_data.accent, true) }
end
end
local function upsilon(breathing, accent, iota, isLong)
if iota then
return ui(breathing, accent)
end
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, isLong)
local length = (isLong or accent == "circum") and long or ""
return {
["cla"] = breathing[1] .. "y" .. pitch .. length,
["koi1"] = breathing[2] .. stress .. "y",
["koi2"] = stress .. "y",
["byz1"] = stress .. "y",
["byz2"] = stress .. "i"
}
end
local function au(breathing, accent, marked)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
if marked then
return {
["cla"] = breathing[1] .. "a" .. pitch .. "u" .. nonsyllabic,
["koi1"] = {
{ "3=σ+4.voiced", breathing[2] .. stress .. "a" .. "w" },
{ "3.unvoiced", breathing[2] .. stress .. "a" .. "ʍ" },
breathing[2] .. stress .. "a" .. "w",
},
["koi2"] = {
{ "3=σ+4.voiced", stress .. "aβ" },
{ "3.unvoiced", stress .. "aɸ" },
stress .. "aβ",
},
["byz1"] = {
{ "3=σ+4.voiced/3=τ+4=ζ", stress .. "av" },
{ "3.unvoiced", stress .. "af" },
stress .. "av",
},
["byz2"] = {
{ "3=σ+4.voiced/3=τ+4=ζ", stress .. "av" },
{ "3.unvoiced", stress .. "af" },
stress .. "av",
},
}
end
return {
["cla"] = breathing[1] .. "a" .. pitch .. "u" .. nonsyllabic,
["koi1"] = {
{ "2=σ+3.voiced", breathing[2] .. stress .. "a" .. "w" },
{ "2.unvoiced", breathing[2] .. stress .. "a" .. "ʍ" },
breathing[2] .. stress .. "a" .. "w",
},
["koi2"] = {
{ "2=σ+3.voiced", stress .. "aβ" },
{ "2.unvoiced", stress .. "aɸ" },
stress .. "aβ",
},
["byz1"] = {
{ "2=σ+3.voiced/2=τ+3=ζ", stress .. "av" },
{ "2.unvoiced", stress .. "af" },
stress .. "av",
},
["byz2"] = {
{ "2=σ+3.voiced/2=τ+3=ζ", stress .. "av" },
{ "2.unvoiced", stress .. "af" },
stress .. "av",
},
}
end
local function aau(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "a" .. pitch .. "ːu" .. nonsyllabic,
["koi1"] = {
{ "3=σ+4.voiced", breathing[2] .. stress .. "a" .. "w" },
{ "3.unvoiced", breathing[2] .. stress .. "a" .. "ʍ" },
breathing[2] .. stress .. "a" .. "w",
},
["koi2"] = {
{ "3=σ+4.voiced", stress .. "aβ" },
{ "3.unvoiced", stress .. "aɸ" },
stress .. "aβ",
},
["byz1"] = {
{ "3=σ+4.voiced/3=τ+4=ζ", stress .. "av" },
{ "3.unvoiced", stress .. "af" },
stress .. "av",
},
["byz2"] = {
{ "3=σ+4.voiced/3=τ+4=ζ", stress .. "av" },
{ "3.unvoiced", stress .. "af" },
stress .. "av",
},
}
end
local function eu(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "e" .. pitch .. "u" .. nonsyllabic,
["koi1"] = {
{ "2=σ+3.voiced", breathing[2] .. stress .. "e" .. "w" },
{ "2.unvoiced", breathing[2] .. stress .. "e" .. "ʍ" },
breathing[2] .. stress .. "e" .. "w",
},
["koi2"] = {
{ "2=σ+3.voiced", stress .. "eβ" },
{ "2.unvoiced", stress .. "eɸ" },
stress .. "eβ",
},
["byz1"] = {
{ "2=σ+3.voiced/2=τ+3=ζ", stress .. "ev" },
{ "2.unvoiced", stress .. "ef" },
stress .. "ev",
},
["byz2"] = {
{ "2=σ+3.voiced/2=τ+3=ζ", stress .. "ev" },
{ "2.unvoiced", stress .. "ef" },
stress .. "ev",
},
}
end
local function hu(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "ɛ" .. pitch .. "ːu" .. nonsyllabic,
["koi1"] = {
{ "2=σ+3.voiced", breathing[2] .. stress .. "e̝" .. "w" },
{ "2.unvoiced", breathing[2] .. stress .. "e̝" .. "ʍ" },
breathing[2] .. stress .. "e̝" .. "w",
},
["koi2"] = {
{ "2=σ+3.voiced", stress .. "iβ" },
{ "2.unvoiced", stress .. "iɸ" },
stress .. "iβ",
},
["byz1"] = {
{ "2=σ+3.voiced/2=τ+3=ζ", stress .. "iv" },
{ "2.unvoiced", stress .. "if" },
stress .. "iv",
},
["byz2"] = {
{ "2=σ+3.voiced/2=τ+3=ζ", stress .. "iv" },
{ "2.unvoiced", stress .. "if" },
stress .. "iv",
},
}
end
local function ou(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "u" .. pitch .. long,
["koi1"] = breathing[2] .. stress .. "u",
["koi2"] = stress .. "u",
["byz1"] = stress .. "u",
["byz2"] = stress .. "u"
}
end
local function wu(breathing, accent)
local breathing = get_breathing(breathing)
local stress = accent and stress_mark or ""
local pitch = get_pitch_marks(accent, true)
return {
["cla"] = breathing[1] .. "ɔ" .. pitch .. "ːu" .. nonsyllabic,
["koi1"] = {
{ "2=σ+3.voiced", breathing[2] .. stress .. "o" .. "w" },
{ "2.unvoiced", breathing[2] .. stress .. "o" .. "ʍ" },
breathing[2] .. stress .. "o" .. "w",
},
["koi2"] = {
{ "2=σ+3.voiced", stress .. "oβ" },
{ "2.unvoiced", stress .. "oɸ" },
stress .. "oβ",
},
["byz1"] = {
{ "2=σ+3.voiced/2=τ+3=ζ", stress .. "ov" },
{ "2.unvoiced", stress .. "of" },
stress .. "ov",
},
["byz2"] = {
{ "2=σ+3.voiced/2=τ+3=ζ", stress .. "ov" },
{ "2.unvoiced", stress .. "of" },
stress .. "ov",
},
}
end
for _, letter in ipairs{ "υ", "υͅ", "ύ", "ύͅ", "ὺ", "ὺͅ", "ῦ", "ῦͅ", "ὐ", "ὐͅ", "ὔ", "ὔͅ", "ὒ", "ὒͅ", "ὖ", "ὖͅ", "ὑ", "ὑͅ", "ὕ", "ὕͅ", "ὓ", "ὓͅ", "ὗ", "ὗͅ", "ϋ", "ϋͅ", "ΰ", "ΰͅ", "ῢ", "ῢͅ", "ῧ", "ῧͅ" } do
local l_data = data[letter]
l_data.p = upsilon(l_data.breath, l_data.accent, l_data.subi)
if not l_data.subi then
if l_data.accent == "circum" then
l_data.pre = pre_hasSubI
else
if letter == "ϋ" then
l_data.pre = data["υ"].pre
elseif letter ~= "υ" then
l_data.pre = pre_hasMacronBreve_hasSubI
end
data[letter .. breve] = { p = l_data.p }
data[letter .. macron] = { p = upsilon(l_data.breath, l_data.accent, false, true) }
end
if not l_data.diar then
data["α" .. letter] = { p = au(l_data.breath, l_data.accent) }
data["α" .. breve .. letter] = { p = au(l_data.breath, l_data.accent, true) }
data["α" .. macron .. letter] = { p = aau(l_data.breath, l_data.accent) }
data["ε" .. letter] = { p = eu(l_data.breath, l_data.accent) }
data["η" .. letter] = { p = hu(l_data.breath, l_data.accent) }
data["ο" .. letter] = { p = ou(l_data.breath, l_data.accent) }
data["ω" .. letter] = { p = wu(l_data.breath, l_data.accent) }
end
end
end
data["chars"] = {
["cons"] = "bɡŋdzklmnprstβðɣɸθxfvɟʝcçwʍj",
["vowel"] = "aeiouyɛɔ",
["diacritic"] = high .. low .. midHigh .. midLow .. highMid .. long .. aspirated .. voiceless .. nonsyllabic .. rising .. falling,
["liquid"] = "rln",
["obst"] = "bɡdkptβðɣɸθxfv",
["frontDiphth"] = "[αο]ι",
["frontVowel"] = "ιηευ",
["iDiaer"] = "ϊΐῒῗ",
["long"] = "ηω",
["short"] = "εο",
["ambig"] = "αιυ",
["uDiphth"] = "αᾰᾱεηοω", -- first members for diphthongs ending in "υ"
["iDiphth"] = "αᾰᾱεηουῠῡω", -- first members for diphthongs ending in "ι"
["Greekdiacritic"] = m_utils_data.all,
["Greekconsonant"] = m_utils_data.consonants
}
data.chars.frontDiphthong = data.chars.frontDiphth
return data
rld1xydw59s6kaigbuu34l3lbs6j8k7
Module:accent qualifier
828
3656
43489
2026-05-13T22:56:22Z
Kambai Akau
8
Created page with "local export = {} local labels_module = "Module:labels" --[==[ Format accent qualifiers. Implements the {{tl|a}} (shortcut for {{tl|accent}}) template. This template is now virtually identical to a non-categorizing {{tl|lb}}, although a few labels display and/or link differently (e.g. Egyptian-language label 'Old Egyptian' displays as "reconstructed Old Egyptian" instead of just "Old Egyptian", and English-language label 'Australia' displays as "General Australian" ins..."
43489
Scribunto
text/plain
local export = {}
local labels_module = "Module:labels"
--[==[
Format accent qualifiers. Implements the {{tl|a}} (shortcut for {{tl|accent}}) template. This template is now virtually
identical to a non-categorizing {{tl|lb}}, although a few labels display and/or link differently (e.g. Egyptian-language
label 'Old Egyptian' displays as "reconstructed Old Egyptian" instead of just "Old Egyptian", and English-language
label 'Australia' displays as "General Australian" instead of just "Australia", and links to
[[w:Australian English phonology]] instead of [[w:Australian English]]).
]==]
function export.format_qualifiers(lang, qualifiers)
return require(labels_module).show_labels {
lang = lang,
labels = qualifiers,
nocat = true,
mode = "accent",
}
end
--[==[
External entry point that implements {{tl|accent}} and {{tl|a}}.
]==]
function export.show(frame)
if not frame.getParent then
error("When calling [[Module:accent qualifier]] internally, use format_qualifiers() not show()")
end
local parent_args = frame:getParent().args
local params = {
[1] = {type = "language", default = "und"},
[2] = {list = true, required = true, default = "{{{2}}}"},
}
local args = require("Module:parameters").process(parent_args, params)
return export.format_qualifiers(args[1], args[2])
end
return export
0rcst5n5s8nsw4jv6sx0zdfg8in23w6
Module:grc-utilities/templates
828
3657
43490
2026-05-13T22:57:49Z
Kambai Akau
8
Created page with "local export = {} local m_table = require('Module:table') local m_utils = require('Module:grc-utilities') local m_data = require('Module:grc-utilities/data') local tag = m_utils.tag local link = m_utils.link local tokenize = m_utils.tokenize local diacritic = m_data.diacritic local U = mw.ustring.char local toNFD = mw.ustring.toNFD local gsub = mw.ustring.gsub local dottedCircle = U(0x25CC) export.addDottedCircle = require("Module:Unicode data").add_dotted_circle fu..."
43490
Scribunto
text/plain
local export = {}
local m_table = require('Module:table')
local m_utils = require('Module:grc-utilities')
local m_data = require('Module:grc-utilities/data')
local tag = m_utils.tag
local link = m_utils.link
local tokenize = m_utils.tokenize
local diacritic = m_data.diacritic
local U = mw.ustring.char
local toNFD = mw.ustring.toNFD
local gsub = mw.ustring.gsub
local dottedCircle = U(0x25CC)
export.addDottedCircle = require("Module:Unicode data").add_dotted_circle
function export.printTokens(frame)
text = frame.args[1]
local token_format = '<span class="Polyt" style="background-color: var(--wikt-palette-grey-2,#E9E9E9);>%s</span>'
local spacing = {
["\n"] = "¶",
["\r"] = "¶",
[" "] = " ",
}
if text then
local tokens = m_table.shallowCopy(tokenize(text))
for i, token in pairs(tokens) do
tokens[i] = token_format:format(string.gsub(token, "%s", spacing))
end
return "|-\n| " .. tag(text) .. " || " .. tag(table.concat(tokens, ", "))
else
error("Provide text to tokenize in first parameter.")
end
end
function export.printDiacritics(frame)
local functionToPrint = frame.args[1] or error('Specify a function in the first parameter.')
local term = frame.args[2] or error('Add text in the second parameter.')
local result = m_utils[functionToPrint](term)
-- Show diacritics above or below a dotted circle.
content = {
term = tag(term),
term_decomposition = tag(export.addDottedCircle(toNFD(term))),
result = tag(result),
result_decomposition = tag(export.addDottedCircle(result)),
}
local output = [[ term (term_decomposition) → result (result_decomposition)]]
local function addContent(item)
return content[item] or ""
end
return (output:gsub("[%a_]+", addContent))
end
function export.decompose(frame)
local params = {
[1] = {},
["link"] = { type = "boolean" },
}
args = require("Module:parameters").process(frame.args, params)
local text = args[1]
text = toNFD(text)
local link = args.link
local composed
if link then
composed = link(text, nil, nil, "-")
else
composed = tag(text)
end
local decomposed = export.addDottedCircle(text)
if link then
local result = {}
for seat, letter in gmatch(decomposed, "(" .. dottedCircle .. "?)(.)") do
local link
if letter then
link = linkNoTag(letter, seat .. letter)
end
table.insert(result, link)
end
decomposed = table.concat(result)
end
decomposed = tag(decomposed)
return composed .. " (" .. decomposed .. ")"
end
function export.tokenize(frame)
local map = require("Module:fun").map
local token_format = '<span class="Polyt" style="background-color: var(--wikt-palette-lightergrey,#EFEFEF);>%s</span>'
local spacing = {
["\n"] = "¶",
["\r"] = "¶",
[" "] = " ",
}
local _tokenize = tokenize
local function tokenize(word, ...)
return _tokenize(word)
end
local function print_tokens(tokens)
if type(tokens) == "string" then
return tokens
end
local output = {}
for i, token in ipairs(tokens) do
output[i] = string.format(token_format, string.gsub(token, "%s", spacing))
end
return table.concat(output, " ")
end
return table.concat(map(print_tokens, map(tokenize, frame.args)), "<br>")
end
return export
b9z9ex8m90uhqsixcrkxldxlb7c9icx
Module:Unicode data/combining classes
828
3658
43491
2026-05-13T22:59:11Z
Kambai Akau
8
Created page with "-- Generated by [[Module:Unicode data/build]] combining_classes_formatted (Unicode version 17.0) return { ["\204\128"] = 230, -- U+0300 ["\204\129"] = 230, -- U+0301 ["\204\130"] = 230, -- U+0302 ["\204\131"] = 230, -- U+0303 ["\204\132"] = 230, -- U+0304 ["\204\133"] = 230, -- U+0305 ["\204\134"] = 230, -- U+0306 ["\204\135"] = 230, -- U+0307 ["\204\136"] = 230, -- U+0308 ["\204\137"] = 230, -- U+0309 ["\204\138"] = 230, -- U+030A ["\204\139"] = 230, -- U+03..."
43491
Scribunto
text/plain
-- Generated by [[Module:Unicode data/build]] combining_classes_formatted (Unicode version 17.0)
return {
["\204\128"] = 230, -- U+0300
["\204\129"] = 230, -- U+0301
["\204\130"] = 230, -- U+0302
["\204\131"] = 230, -- U+0303
["\204\132"] = 230, -- U+0304
["\204\133"] = 230, -- U+0305
["\204\134"] = 230, -- U+0306
["\204\135"] = 230, -- U+0307
["\204\136"] = 230, -- U+0308
["\204\137"] = 230, -- U+0309
["\204\138"] = 230, -- U+030A
["\204\139"] = 230, -- U+030B
["\204\140"] = 230, -- U+030C
["\204\141"] = 230, -- U+030D
["\204\142"] = 230, -- U+030E
["\204\143"] = 230, -- U+030F
["\204\144"] = 230, -- U+0310
["\204\145"] = 230, -- U+0311
["\204\146"] = 230, -- U+0312
["\204\147"] = 230, -- U+0313
["\204\148"] = 230, -- U+0314
["\204\149"] = 232, -- U+0315
["\204\150"] = 220, -- U+0316
["\204\151"] = 220, -- U+0317
["\204\152"] = 220, -- U+0318
["\204\153"] = 220, -- U+0319
["\204\154"] = 232, -- U+031A
["\204\155"] = 216, -- U+031B
["\204\156"] = 220, -- U+031C
["\204\157"] = 220, -- U+031D
["\204\158"] = 220, -- U+031E
["\204\159"] = 220, -- U+031F
["\204\160"] = 220, -- U+0320
["\204\161"] = 202, -- U+0321
["\204\162"] = 202, -- U+0322
["\204\163"] = 220, -- U+0323
["\204\164"] = 220, -- U+0324
["\204\165"] = 220, -- U+0325
["\204\166"] = 220, -- U+0326
["\204\167"] = 202, -- U+0327
["\204\168"] = 202, -- U+0328
["\204\169"] = 220, -- U+0329
["\204\170"] = 220, -- U+032A
["\204\171"] = 220, -- U+032B
["\204\172"] = 220, -- U+032C
["\204\173"] = 220, -- U+032D
["\204\174"] = 220, -- U+032E
["\204\175"] = 220, -- U+032F
["\204\176"] = 220, -- U+0330
["\204\177"] = 220, -- U+0331
["\204\178"] = 220, -- U+0332
["\204\179"] = 220, -- U+0333
["\204\180"] = 1, -- U+0334
["\204\181"] = 1, -- U+0335
["\204\182"] = 1, -- U+0336
["\204\183"] = 1, -- U+0337
["\204\184"] = 1, -- U+0338
["\204\185"] = 220, -- U+0339
["\204\186"] = 220, -- U+033A
["\204\187"] = 220, -- U+033B
["\204\188"] = 220, -- U+033C
["\204\189"] = 230, -- U+033D
["\204\190"] = 230, -- U+033E
["\204\191"] = 230, -- U+033F
["\205\128"] = 230, -- U+0340
["\205\129"] = 230, -- U+0341
["\205\130"] = 230, -- U+0342
["\205\131"] = 230, -- U+0343
["\205\132"] = 230, -- U+0344
["\205\133"] = 240, -- U+0345
["\205\134"] = 230, -- U+0346
["\205\135"] = 220, -- U+0347
["\205\136"] = 220, -- U+0348
["\205\137"] = 220, -- U+0349
["\205\138"] = 230, -- U+034A
["\205\139"] = 230, -- U+034B
["\205\140"] = 230, -- U+034C
["\205\141"] = 220, -- U+034D
["\205\142"] = 220, -- U+034E
["\205\144"] = 230, -- U+0350
["\205\145"] = 230, -- U+0351
["\205\146"] = 230, -- U+0352
["\205\147"] = 220, -- U+0353
["\205\148"] = 220, -- U+0354
["\205\149"] = 220, -- U+0355
["\205\150"] = 220, -- U+0356
["\205\151"] = 230, -- U+0357
["\205\152"] = 232, -- U+0358
["\205\153"] = 220, -- U+0359
["\205\154"] = 220, -- U+035A
["\205\155"] = 230, -- U+035B
["\205\156"] = 233, -- U+035C
["\205\157"] = 234, -- U+035D
["\205\158"] = 234, -- U+035E
["\205\159"] = 233, -- U+035F
["\205\160"] = 234, -- U+0360
["\205\161"] = 234, -- U+0361
["\205\162"] = 233, -- U+0362
["\205\163"] = 230, -- U+0363
["\205\164"] = 230, -- U+0364
["\205\165"] = 230, -- U+0365
["\205\166"] = 230, -- U+0366
["\205\167"] = 230, -- U+0367
["\205\168"] = 230, -- U+0368
["\205\169"] = 230, -- U+0369
["\205\170"] = 230, -- U+036A
["\205\171"] = 230, -- U+036B
["\205\172"] = 230, -- U+036C
["\205\173"] = 230, -- U+036D
["\205\174"] = 230, -- U+036E
["\205\175"] = 230, -- U+036F
["\210\131"] = 230, -- U+0483
["\210\132"] = 230, -- U+0484
["\210\133"] = 230, -- U+0485
["\210\134"] = 230, -- U+0486
["\210\135"] = 230, -- U+0487
["\214\145"] = 220, -- U+0591
["\214\146"] = 230, -- U+0592
["\214\147"] = 230, -- U+0593
["\214\148"] = 230, -- U+0594
["\214\149"] = 230, -- U+0595
["\214\150"] = 220, -- U+0596
["\214\151"] = 230, -- U+0597
["\214\152"] = 230, -- U+0598
["\214\153"] = 230, -- U+0599
["\214\154"] = 222, -- U+059A
["\214\155"] = 220, -- U+059B
["\214\156"] = 230, -- U+059C
["\214\157"] = 230, -- U+059D
["\214\158"] = 230, -- U+059E
["\214\159"] = 230, -- U+059F
["\214\160"] = 230, -- U+05A0
["\214\161"] = 230, -- U+05A1
["\214\162"] = 220, -- U+05A2
["\214\163"] = 220, -- U+05A3
["\214\164"] = 220, -- U+05A4
["\214\165"] = 220, -- U+05A5
["\214\166"] = 220, -- U+05A6
["\214\167"] = 220, -- U+05A7
["\214\168"] = 230, -- U+05A8
["\214\169"] = 230, -- U+05A9
["\214\170"] = 220, -- U+05AA
["\214\171"] = 230, -- U+05AB
["\214\172"] = 230, -- U+05AC
["\214\173"] = 222, -- U+05AD
["\214\174"] = 228, -- U+05AE
["\214\175"] = 230, -- U+05AF
["\214\176"] = 10, -- U+05B0
["\214\177"] = 11, -- U+05B1
["\214\178"] = 12, -- U+05B2
["\214\179"] = 13, -- U+05B3
["\214\180"] = 14, -- U+05B4
["\214\181"] = 15, -- U+05B5
["\214\182"] = 16, -- U+05B6
["\214\183"] = 17, -- U+05B7
["\214\184"] = 18, -- U+05B8
["\214\185"] = 19, -- U+05B9
["\214\186"] = 19, -- U+05BA
["\214\187"] = 20, -- U+05BB
["\214\188"] = 21, -- U+05BC
["\214\189"] = 22, -- U+05BD
["\214\191"] = 23, -- U+05BF
["\215\129"] = 24, -- U+05C1
["\215\130"] = 25, -- U+05C2
["\215\132"] = 230, -- U+05C4
["\215\133"] = 220, -- U+05C5
["\215\135"] = 18, -- U+05C7
["\216\144"] = 230, -- U+0610
["\216\145"] = 230, -- U+0611
["\216\146"] = 230, -- U+0612
["\216\147"] = 230, -- U+0613
["\216\148"] = 230, -- U+0614
["\216\149"] = 230, -- U+0615
["\216\150"] = 230, -- U+0616
["\216\151"] = 230, -- U+0617
["\216\152"] = 30, -- U+0618
["\216\153"] = 31, -- U+0619
["\216\154"] = 32, -- U+061A
["\217\139"] = 27, -- U+064B
["\217\140"] = 28, -- U+064C
["\217\141"] = 29, -- U+064D
["\217\142"] = 30, -- U+064E
["\217\143"] = 31, -- U+064F
["\217\144"] = 32, -- U+0650
["\217\145"] = 33, -- U+0651
["\217\146"] = 34, -- U+0652
["\217\147"] = 230, -- U+0653
["\217\148"] = 230, -- U+0654
["\217\149"] = 220, -- U+0655
["\217\150"] = 220, -- U+0656
["\217\151"] = 230, -- U+0657
["\217\152"] = 230, -- U+0658
["\217\153"] = 230, -- U+0659
["\217\154"] = 230, -- U+065A
["\217\155"] = 230, -- U+065B
["\217\156"] = 220, -- U+065C
["\217\157"] = 230, -- U+065D
["\217\158"] = 230, -- U+065E
["\217\159"] = 220, -- U+065F
["\217\176"] = 35, -- U+0670
["\219\150"] = 230, -- U+06D6
["\219\151"] = 230, -- U+06D7
["\219\152"] = 230, -- U+06D8
["\219\153"] = 230, -- U+06D9
["\219\154"] = 230, -- U+06DA
["\219\155"] = 230, -- U+06DB
["\219\156"] = 230, -- U+06DC
["\219\159"] = 230, -- U+06DF
["\219\160"] = 230, -- U+06E0
["\219\161"] = 230, -- U+06E1
["\219\162"] = 230, -- U+06E2
["\219\163"] = 220, -- U+06E3
["\219\164"] = 230, -- U+06E4
["\219\167"] = 230, -- U+06E7
["\219\168"] = 230, -- U+06E8
["\219\170"] = 220, -- U+06EA
["\219\171"] = 230, -- U+06EB
["\219\172"] = 230, -- U+06EC
["\219\173"] = 220, -- U+06ED
["\220\145"] = 36, -- U+0711
["\220\176"] = 230, -- U+0730
["\220\177"] = 220, -- U+0731
["\220\178"] = 230, -- U+0732
["\220\179"] = 230, -- U+0733
["\220\180"] = 220, -- U+0734
["\220\181"] = 230, -- U+0735
["\220\182"] = 230, -- U+0736
["\220\183"] = 220, -- U+0737
["\220\184"] = 220, -- U+0738
["\220\185"] = 220, -- U+0739
["\220\186"] = 230, -- U+073A
["\220\187"] = 220, -- U+073B
["\220\188"] = 220, -- U+073C
["\220\189"] = 230, -- U+073D
["\220\190"] = 220, -- U+073E
["\220\191"] = 230, -- U+073F
["\221\128"] = 230, -- U+0740
["\221\129"] = 230, -- U+0741
["\221\130"] = 220, -- U+0742
["\221\131"] = 230, -- U+0743
["\221\132"] = 220, -- U+0744
["\221\133"] = 230, -- U+0745
["\221\134"] = 220, -- U+0746
["\221\135"] = 230, -- U+0747
["\221\136"] = 220, -- U+0748
["\221\137"] = 230, -- U+0749
["\221\138"] = 230, -- U+074A
["\223\171"] = 230, -- U+07EB
["\223\172"] = 230, -- U+07EC
["\223\173"] = 230, -- U+07ED
["\223\174"] = 230, -- U+07EE
["\223\175"] = 230, -- U+07EF
["\223\176"] = 230, -- U+07F0
["\223\177"] = 230, -- U+07F1
["\223\178"] = 220, -- U+07F2
["\223\179"] = 230, -- U+07F3
["\223\189"] = 220, -- U+07FD
["\224\160\150"] = 230, -- U+0816
["\224\160\151"] = 230, -- U+0817
["\224\160\152"] = 230, -- U+0818
["\224\160\153"] = 230, -- U+0819
["\224\160\155"] = 230, -- U+081B
["\224\160\156"] = 230, -- U+081C
["\224\160\157"] = 230, -- U+081D
["\224\160\158"] = 230, -- U+081E
["\224\160\159"] = 230, -- U+081F
["\224\160\160"] = 230, -- U+0820
["\224\160\161"] = 230, -- U+0821
["\224\160\162"] = 230, -- U+0822
["\224\160\163"] = 230, -- U+0823
["\224\160\165"] = 230, -- U+0825
["\224\160\166"] = 230, -- U+0826
["\224\160\167"] = 230, -- U+0827
["\224\160\169"] = 230, -- U+0829
["\224\160\170"] = 230, -- U+082A
["\224\160\171"] = 230, -- U+082B
["\224\160\172"] = 230, -- U+082C
["\224\160\173"] = 230, -- U+082D
["\224\161\153"] = 220, -- U+0859
["\224\161\154"] = 220, -- U+085A
["\224\161\155"] = 220, -- U+085B
["\224\162\151"] = 230, -- U+0897
["\224\162\152"] = 230, -- U+0898
["\224\162\153"] = 220, -- U+0899
["\224\162\154"] = 220, -- U+089A
["\224\162\155"] = 220, -- U+089B
["\224\162\156"] = 230, -- U+089C
["\224\162\157"] = 230, -- U+089D
["\224\162\158"] = 230, -- U+089E
["\224\162\159"] = 230, -- U+089F
["\224\163\138"] = 230, -- U+08CA
["\224\163\139"] = 230, -- U+08CB
["\224\163\140"] = 230, -- U+08CC
["\224\163\141"] = 230, -- U+08CD
["\224\163\142"] = 230, -- U+08CE
["\224\163\143"] = 220, -- U+08CF
["\224\163\144"] = 220, -- U+08D0
["\224\163\145"] = 220, -- U+08D1
["\224\163\146"] = 220, -- U+08D2
["\224\163\147"] = 220, -- U+08D3
["\224\163\148"] = 230, -- U+08D4
["\224\163\149"] = 230, -- U+08D5
["\224\163\150"] = 230, -- U+08D6
["\224\163\151"] = 230, -- U+08D7
["\224\163\152"] = 230, -- U+08D8
["\224\163\153"] = 230, -- U+08D9
["\224\163\154"] = 230, -- U+08DA
["\224\163\155"] = 230, -- U+08DB
["\224\163\156"] = 230, -- U+08DC
["\224\163\157"] = 230, -- U+08DD
["\224\163\158"] = 230, -- U+08DE
["\224\163\159"] = 230, -- U+08DF
["\224\163\160"] = 230, -- U+08E0
["\224\163\161"] = 230, -- U+08E1
["\224\163\163"] = 220, -- U+08E3
["\224\163\164"] = 230, -- U+08E4
["\224\163\165"] = 230, -- U+08E5
["\224\163\166"] = 220, -- U+08E6
["\224\163\167"] = 230, -- U+08E7
["\224\163\168"] = 230, -- U+08E8
["\224\163\169"] = 220, -- U+08E9
["\224\163\170"] = 230, -- U+08EA
["\224\163\171"] = 230, -- U+08EB
["\224\163\172"] = 230, -- U+08EC
["\224\163\173"] = 220, -- U+08ED
["\224\163\174"] = 220, -- U+08EE
["\224\163\175"] = 220, -- U+08EF
["\224\163\176"] = 27, -- U+08F0
["\224\163\177"] = 28, -- U+08F1
["\224\163\178"] = 29, -- U+08F2
["\224\163\179"] = 230, -- U+08F3
["\224\163\180"] = 230, -- U+08F4
["\224\163\181"] = 230, -- U+08F5
["\224\163\182"] = 220, -- U+08F6
["\224\163\183"] = 230, -- U+08F7
["\224\163\184"] = 230, -- U+08F8
["\224\163\185"] = 220, -- U+08F9
["\224\163\186"] = 220, -- U+08FA
["\224\163\187"] = 230, -- U+08FB
["\224\163\188"] = 230, -- U+08FC
["\224\163\189"] = 230, -- U+08FD
["\224\163\190"] = 230, -- U+08FE
["\224\163\191"] = 230, -- U+08FF
["\224\164\188"] = 7, -- U+093C
["\224\165\141"] = 9, -- U+094D
["\224\165\145"] = 230, -- U+0951
["\224\165\146"] = 220, -- U+0952
["\224\165\147"] = 230, -- U+0953
["\224\165\148"] = 230, -- U+0954
["\224\166\188"] = 7, -- U+09BC
["\224\167\141"] = 9, -- U+09CD
["\224\167\190"] = 230, -- U+09FE
["\224\168\188"] = 7, -- U+0A3C
["\224\169\141"] = 9, -- U+0A4D
["\224\170\188"] = 7, -- U+0ABC
["\224\171\141"] = 9, -- U+0ACD
["\224\172\188"] = 7, -- U+0B3C
["\224\173\141"] = 9, -- U+0B4D
["\224\175\141"] = 9, -- U+0BCD
["\224\176\188"] = 7, -- U+0C3C
["\224\177\141"] = 9, -- U+0C4D
["\224\177\149"] = 84, -- U+0C55
["\224\177\150"] = 91, -- U+0C56
["\224\178\188"] = 7, -- U+0CBC
["\224\179\141"] = 9, -- U+0CCD
["\224\180\187"] = 9, -- U+0D3B
["\224\180\188"] = 9, -- U+0D3C
["\224\181\141"] = 9, -- U+0D4D
["\224\183\138"] = 9, -- U+0DCA
["\224\184\184"] = 103, -- U+0E38
["\224\184\185"] = 103, -- U+0E39
["\224\184\186"] = 9, -- U+0E3A
["\224\185\136"] = 107, -- U+0E48
["\224\185\137"] = 107, -- U+0E49
["\224\185\138"] = 107, -- U+0E4A
["\224\185\139"] = 107, -- U+0E4B
["\224\186\184"] = 118, -- U+0EB8
["\224\186\185"] = 118, -- U+0EB9
["\224\186\186"] = 9, -- U+0EBA
["\224\187\136"] = 122, -- U+0EC8
["\224\187\137"] = 122, -- U+0EC9
["\224\187\138"] = 122, -- U+0ECA
["\224\187\139"] = 122, -- U+0ECB
["\224\188\152"] = 220, -- U+0F18
["\224\188\153"] = 220, -- U+0F19
["\224\188\181"] = 220, -- U+0F35
["\224\188\183"] = 220, -- U+0F37
["\224\188\185"] = 216, -- U+0F39
["\224\189\177"] = 129, -- U+0F71
["\224\189\178"] = 130, -- U+0F72
["\224\189\180"] = 132, -- U+0F74
["\224\189\186"] = 130, -- U+0F7A
["\224\189\187"] = 130, -- U+0F7B
["\224\189\188"] = 130, -- U+0F7C
["\224\189\189"] = 130, -- U+0F7D
["\224\190\128"] = 130, -- U+0F80
["\224\190\130"] = 230, -- U+0F82
["\224\190\131"] = 230, -- U+0F83
["\224\190\132"] = 9, -- U+0F84
["\224\190\134"] = 230, -- U+0F86
["\224\190\135"] = 230, -- U+0F87
["\224\191\134"] = 220, -- U+0FC6
["\225\128\183"] = 7, -- U+1037
["\225\128\185"] = 9, -- U+1039
["\225\128\186"] = 9, -- U+103A
["\225\130\141"] = 220, -- U+108D
["\225\141\157"] = 230, -- U+135D
["\225\141\158"] = 230, -- U+135E
["\225\141\159"] = 230, -- U+135F
["\225\156\148"] = 9, -- U+1714
["\225\156\149"] = 9, -- U+1715
["\225\156\180"] = 9, -- U+1734
["\225\159\146"] = 9, -- U+17D2
["\225\159\157"] = 230, -- U+17DD
["\225\162\169"] = 228, -- U+18A9
["\225\164\185"] = 222, -- U+1939
["\225\164\186"] = 230, -- U+193A
["\225\164\187"] = 220, -- U+193B
["\225\168\151"] = 230, -- U+1A17
["\225\168\152"] = 220, -- U+1A18
["\225\169\160"] = 9, -- U+1A60
["\225\169\181"] = 230, -- U+1A75
["\225\169\182"] = 230, -- U+1A76
["\225\169\183"] = 230, -- U+1A77
["\225\169\184"] = 230, -- U+1A78
["\225\169\185"] = 230, -- U+1A79
["\225\169\186"] = 230, -- U+1A7A
["\225\169\187"] = 230, -- U+1A7B
["\225\169\188"] = 230, -- U+1A7C
["\225\169\191"] = 220, -- U+1A7F
["\225\170\176"] = 230, -- U+1AB0
["\225\170\177"] = 230, -- U+1AB1
["\225\170\178"] = 230, -- U+1AB2
["\225\170\179"] = 230, -- U+1AB3
["\225\170\180"] = 230, -- U+1AB4
["\225\170\181"] = 220, -- U+1AB5
["\225\170\182"] = 220, -- U+1AB6
["\225\170\183"] = 220, -- U+1AB7
["\225\170\184"] = 220, -- U+1AB8
["\225\170\185"] = 220, -- U+1AB9
["\225\170\186"] = 220, -- U+1ABA
["\225\170\187"] = 230, -- U+1ABB
["\225\170\188"] = 230, -- U+1ABC
["\225\170\189"] = 220, -- U+1ABD
["\225\170\191"] = 220, -- U+1ABF
["\225\171\128"] = 220, -- U+1AC0
["\225\171\129"] = 230, -- U+1AC1
["\225\171\130"] = 230, -- U+1AC2
["\225\171\131"] = 220, -- U+1AC3
["\225\171\132"] = 220, -- U+1AC4
["\225\171\133"] = 230, -- U+1AC5
["\225\171\134"] = 230, -- U+1AC6
["\225\171\135"] = 230, -- U+1AC7
["\225\171\136"] = 230, -- U+1AC8
["\225\171\137"] = 230, -- U+1AC9
["\225\171\138"] = 220, -- U+1ACA
["\225\171\139"] = 230, -- U+1ACB
["\225\171\140"] = 230, -- U+1ACC
["\225\171\141"] = 230, -- U+1ACD
["\225\171\142"] = 230, -- U+1ACE
["\225\172\180"] = 7, -- U+1B34
["\225\173\132"] = 9, -- U+1B44
["\225\173\171"] = 230, -- U+1B6B
["\225\173\172"] = 220, -- U+1B6C
["\225\173\173"] = 230, -- U+1B6D
["\225\173\174"] = 230, -- U+1B6E
["\225\173\175"] = 230, -- U+1B6F
["\225\173\176"] = 230, -- U+1B70
["\225\173\177"] = 230, -- U+1B71
["\225\173\178"] = 230, -- U+1B72
["\225\173\179"] = 230, -- U+1B73
["\225\174\170"] = 9, -- U+1BAA
["\225\174\171"] = 9, -- U+1BAB
["\225\175\166"] = 7, -- U+1BE6
["\225\175\178"] = 9, -- U+1BF2
["\225\175\179"] = 9, -- U+1BF3
["\225\176\183"] = 7, -- U+1C37
["\225\179\144"] = 230, -- U+1CD0
["\225\179\145"] = 230, -- U+1CD1
["\225\179\146"] = 230, -- U+1CD2
["\225\179\148"] = 1, -- U+1CD4
["\225\179\149"] = 220, -- U+1CD5
["\225\179\150"] = 220, -- U+1CD6
["\225\179\151"] = 220, -- U+1CD7
["\225\179\152"] = 220, -- U+1CD8
["\225\179\153"] = 220, -- U+1CD9
["\225\179\154"] = 230, -- U+1CDA
["\225\179\155"] = 230, -- U+1CDB
["\225\179\156"] = 220, -- U+1CDC
["\225\179\157"] = 220, -- U+1CDD
["\225\179\158"] = 220, -- U+1CDE
["\225\179\159"] = 220, -- U+1CDF
["\225\179\160"] = 230, -- U+1CE0
["\225\179\162"] = 1, -- U+1CE2
["\225\179\163"] = 1, -- U+1CE3
["\225\179\164"] = 1, -- U+1CE4
["\225\179\165"] = 1, -- U+1CE5
["\225\179\166"] = 1, -- U+1CE6
["\225\179\167"] = 1, -- U+1CE7
["\225\179\168"] = 1, -- U+1CE8
["\225\179\173"] = 220, -- U+1CED
["\225\179\180"] = 230, -- U+1CF4
["\225\179\184"] = 230, -- U+1CF8
["\225\179\185"] = 230, -- U+1CF9
["\225\183\128"] = 230, -- U+1DC0
["\225\183\129"] = 230, -- U+1DC1
["\225\183\130"] = 220, -- U+1DC2
["\225\183\131"] = 230, -- U+1DC3
["\225\183\132"] = 230, -- U+1DC4
["\225\183\133"] = 230, -- U+1DC5
["\225\183\134"] = 230, -- U+1DC6
["\225\183\135"] = 230, -- U+1DC7
["\225\183\136"] = 230, -- U+1DC8
["\225\183\137"] = 230, -- U+1DC9
["\225\183\138"] = 220, -- U+1DCA
["\225\183\139"] = 230, -- U+1DCB
["\225\183\140"] = 230, -- U+1DCC
["\225\183\141"] = 234, -- U+1DCD
["\225\183\142"] = 214, -- U+1DCE
["\225\183\143"] = 220, -- U+1DCF
["\225\183\144"] = 202, -- U+1DD0
["\225\183\145"] = 230, -- U+1DD1
["\225\183\146"] = 230, -- U+1DD2
["\225\183\147"] = 230, -- U+1DD3
["\225\183\148"] = 230, -- U+1DD4
["\225\183\149"] = 230, -- U+1DD5
["\225\183\150"] = 230, -- U+1DD6
["\225\183\151"] = 230, -- U+1DD7
["\225\183\152"] = 230, -- U+1DD8
["\225\183\153"] = 230, -- U+1DD9
["\225\183\154"] = 230, -- U+1DDA
["\225\183\155"] = 230, -- U+1DDB
["\225\183\156"] = 230, -- U+1DDC
["\225\183\157"] = 230, -- U+1DDD
["\225\183\158"] = 230, -- U+1DDE
["\225\183\159"] = 230, -- U+1DDF
["\225\183\160"] = 230, -- U+1DE0
["\225\183\161"] = 230, -- U+1DE1
["\225\183\162"] = 230, -- U+1DE2
["\225\183\163"] = 230, -- U+1DE3
["\225\183\164"] = 230, -- U+1DE4
["\225\183\165"] = 230, -- U+1DE5
["\225\183\166"] = 230, -- U+1DE6
["\225\183\167"] = 230, -- U+1DE7
["\225\183\168"] = 230, -- U+1DE8
["\225\183\169"] = 230, -- U+1DE9
["\225\183\170"] = 230, -- U+1DEA
["\225\183\171"] = 230, -- U+1DEB
["\225\183\172"] = 230, -- U+1DEC
["\225\183\173"] = 230, -- U+1DED
["\225\183\174"] = 230, -- U+1DEE
["\225\183\175"] = 230, -- U+1DEF
["\225\183\176"] = 230, -- U+1DF0
["\225\183\177"] = 230, -- U+1DF1
["\225\183\178"] = 230, -- U+1DF2
["\225\183\179"] = 230, -- U+1DF3
["\225\183\180"] = 230, -- U+1DF4
["\225\183\181"] = 230, -- U+1DF5
["\225\183\182"] = 232, -- U+1DF6
["\225\183\183"] = 228, -- U+1DF7
["\225\183\184"] = 228, -- U+1DF8
["\225\183\185"] = 220, -- U+1DF9
["\225\183\186"] = 218, -- U+1DFA
["\225\183\187"] = 230, -- U+1DFB
["\225\183\188"] = 233, -- U+1DFC
["\225\183\189"] = 220, -- U+1DFD
["\225\183\190"] = 230, -- U+1DFE
["\225\183\191"] = 220, -- U+1DFF
["\226\131\144"] = 230, -- U+20D0
["\226\131\145"] = 230, -- U+20D1
["\226\131\146"] = 1, -- U+20D2
["\226\131\147"] = 1, -- U+20D3
["\226\131\148"] = 230, -- U+20D4
["\226\131\149"] = 230, -- U+20D5
["\226\131\150"] = 230, -- U+20D6
["\226\131\151"] = 230, -- U+20D7
["\226\131\152"] = 1, -- U+20D8
["\226\131\153"] = 1, -- U+20D9
["\226\131\154"] = 1, -- U+20DA
["\226\131\155"] = 230, -- U+20DB
["\226\131\156"] = 230, -- U+20DC
["\226\131\161"] = 230, -- U+20E1
["\226\131\165"] = 1, -- U+20E5
["\226\131\166"] = 1, -- U+20E6
["\226\131\167"] = 230, -- U+20E7
["\226\131\168"] = 220, -- U+20E8
["\226\131\169"] = 230, -- U+20E9
["\226\131\170"] = 1, -- U+20EA
["\226\131\171"] = 1, -- U+20EB
["\226\131\172"] = 220, -- U+20EC
["\226\131\173"] = 220, -- U+20ED
["\226\131\174"] = 220, -- U+20EE
["\226\131\175"] = 220, -- U+20EF
["\226\131\176"] = 230, -- U+20F0
["\226\179\175"] = 230, -- U+2CEF
["\226\179\176"] = 230, -- U+2CF0
["\226\179\177"] = 230, -- U+2CF1
["\226\181\191"] = 9, -- U+2D7F
["\226\183\160"] = 230, -- U+2DE0
["\226\183\161"] = 230, -- U+2DE1
["\226\183\162"] = 230, -- U+2DE2
["\226\183\163"] = 230, -- U+2DE3
["\226\183\164"] = 230, -- U+2DE4
["\226\183\165"] = 230, -- U+2DE5
["\226\183\166"] = 230, -- U+2DE6
["\226\183\167"] = 230, -- U+2DE7
["\226\183\168"] = 230, -- U+2DE8
["\226\183\169"] = 230, -- U+2DE9
["\226\183\170"] = 230, -- U+2DEA
["\226\183\171"] = 230, -- U+2DEB
["\226\183\172"] = 230, -- U+2DEC
["\226\183\173"] = 230, -- U+2DED
["\226\183\174"] = 230, -- U+2DEE
["\226\183\175"] = 230, -- U+2DEF
["\226\183\176"] = 230, -- U+2DF0
["\226\183\177"] = 230, -- U+2DF1
["\226\183\178"] = 230, -- U+2DF2
["\226\183\179"] = 230, -- U+2DF3
["\226\183\180"] = 230, -- U+2DF4
["\226\183\181"] = 230, -- U+2DF5
["\226\183\182"] = 230, -- U+2DF6
["\226\183\183"] = 230, -- U+2DF7
["\226\183\184"] = 230, -- U+2DF8
["\226\183\185"] = 230, -- U+2DF9
["\226\183\186"] = 230, -- U+2DFA
["\226\183\187"] = 230, -- U+2DFB
["\226\183\188"] = 230, -- U+2DFC
["\226\183\189"] = 230, -- U+2DFD
["\226\183\190"] = 230, -- U+2DFE
["\226\183\191"] = 230, -- U+2DFF
["\227\128\170"] = 218, -- U+302A
["\227\128\171"] = 228, -- U+302B
["\227\128\172"] = 232, -- U+302C
["\227\128\173"] = 222, -- U+302D
["\227\128\174"] = 224, -- U+302E
["\227\128\175"] = 224, -- U+302F
["\227\130\153"] = 8, -- U+3099
["\227\130\154"] = 8, -- U+309A
["\234\153\175"] = 230, -- U+A66F
["\234\153\180"] = 230, -- U+A674
["\234\153\181"] = 230, -- U+A675
["\234\153\182"] = 230, -- U+A676
["\234\153\183"] = 230, -- U+A677
["\234\153\184"] = 230, -- U+A678
["\234\153\185"] = 230, -- U+A679
["\234\153\186"] = 230, -- U+A67A
["\234\153\187"] = 230, -- U+A67B
["\234\153\188"] = 230, -- U+A67C
["\234\153\189"] = 230, -- U+A67D
["\234\154\158"] = 230, -- U+A69E
["\234\154\159"] = 230, -- U+A69F
["\234\155\176"] = 230, -- U+A6F0
["\234\155\177"] = 230, -- U+A6F1
["\234\160\134"] = 9, -- U+A806
["\234\160\172"] = 9, -- U+A82C
["\234\163\132"] = 9, -- U+A8C4
["\234\163\160"] = 230, -- U+A8E0
["\234\163\161"] = 230, -- U+A8E1
["\234\163\162"] = 230, -- U+A8E2
["\234\163\163"] = 230, -- U+A8E3
["\234\163\164"] = 230, -- U+A8E4
["\234\163\165"] = 230, -- U+A8E5
["\234\163\166"] = 230, -- U+A8E6
["\234\163\167"] = 230, -- U+A8E7
["\234\163\168"] = 230, -- U+A8E8
["\234\163\169"] = 230, -- U+A8E9
["\234\163\170"] = 230, -- U+A8EA
["\234\163\171"] = 230, -- U+A8EB
["\234\163\172"] = 230, -- U+A8EC
["\234\163\173"] = 230, -- U+A8ED
["\234\163\174"] = 230, -- U+A8EE
["\234\163\175"] = 230, -- U+A8EF
["\234\163\176"] = 230, -- U+A8F0
["\234\163\177"] = 230, -- U+A8F1
["\234\164\171"] = 220, -- U+A92B
["\234\164\172"] = 220, -- U+A92C
["\234\164\173"] = 220, -- U+A92D
["\234\165\147"] = 9, -- U+A953
["\234\166\179"] = 7, -- U+A9B3
["\234\167\128"] = 9, -- U+A9C0
["\234\170\176"] = 230, -- U+AAB0
["\234\170\178"] = 230, -- U+AAB2
["\234\170\179"] = 230, -- U+AAB3
["\234\170\180"] = 220, -- U+AAB4
["\234\170\183"] = 230, -- U+AAB7
["\234\170\184"] = 230, -- U+AAB8
["\234\170\190"] = 230, -- U+AABE
["\234\170\191"] = 230, -- U+AABF
["\234\171\129"] = 230, -- U+AAC1
["\234\171\182"] = 9, -- U+AAF6
["\234\175\173"] = 9, -- U+ABED
["\239\172\158"] = 26, -- U+FB1E
["\239\184\160"] = 230, -- U+FE20
["\239\184\161"] = 230, -- U+FE21
["\239\184\162"] = 230, -- U+FE22
["\239\184\163"] = 230, -- U+FE23
["\239\184\164"] = 230, -- U+FE24
["\239\184\165"] = 230, -- U+FE25
["\239\184\166"] = 230, -- U+FE26
["\239\184\167"] = 220, -- U+FE27
["\239\184\168"] = 220, -- U+FE28
["\239\184\169"] = 220, -- U+FE29
["\239\184\170"] = 220, -- U+FE2A
["\239\184\171"] = 220, -- U+FE2B
["\239\184\172"] = 220, -- U+FE2C
["\239\184\173"] = 220, -- U+FE2D
["\239\184\174"] = 230, -- U+FE2E
["\239\184\175"] = 230, -- U+FE2F
["\240\144\135\189"] = 220, -- U+101FD
["\240\144\139\160"] = 220, -- U+102E0
["\240\144\141\182"] = 230, -- U+10376
["\240\144\141\183"] = 230, -- U+10377
["\240\144\141\184"] = 230, -- U+10378
["\240\144\141\185"] = 230, -- U+10379
["\240\144\141\186"] = 230, -- U+1037A
["\240\144\168\141"] = 220, -- U+10A0D
["\240\144\168\143"] = 230, -- U+10A0F
["\240\144\168\184"] = 230, -- U+10A38
["\240\144\168\185"] = 1, -- U+10A39
["\240\144\168\186"] = 220, -- U+10A3A
["\240\144\168\191"] = 9, -- U+10A3F
["\240\144\171\165"] = 230, -- U+10AE5
["\240\144\171\166"] = 220, -- U+10AE6
["\240\144\180\164"] = 230, -- U+10D24
["\240\144\180\165"] = 230, -- U+10D25
["\240\144\180\166"] = 230, -- U+10D26
["\240\144\180\167"] = 230, -- U+10D27
["\240\144\181\169"] = 230, -- U+10D69
["\240\144\181\170"] = 230, -- U+10D6A
["\240\144\181\171"] = 230, -- U+10D6B
["\240\144\181\172"] = 230, -- U+10D6C
["\240\144\181\173"] = 230, -- U+10D6D
["\240\144\186\171"] = 230, -- U+10EAB
["\240\144\186\172"] = 230, -- U+10EAC
["\240\144\187\189"] = 220, -- U+10EFD
["\240\144\187\190"] = 220, -- U+10EFE
["\240\144\187\191"] = 220, -- U+10EFF
["\240\144\189\134"] = 220, -- U+10F46
["\240\144\189\135"] = 220, -- U+10F47
["\240\144\189\136"] = 230, -- U+10F48
["\240\144\189\137"] = 230, -- U+10F49
["\240\144\189\138"] = 230, -- U+10F4A
["\240\144\189\139"] = 220, -- U+10F4B
["\240\144\189\140"] = 230, -- U+10F4C
["\240\144\189\141"] = 220, -- U+10F4D
["\240\144\189\142"] = 220, -- U+10F4E
["\240\144\189\143"] = 220, -- U+10F4F
["\240\144\189\144"] = 220, -- U+10F50
["\240\144\190\130"] = 230, -- U+10F82
["\240\144\190\131"] = 220, -- U+10F83
["\240\144\190\132"] = 230, -- U+10F84
["\240\144\190\133"] = 220, -- U+10F85
["\240\145\129\134"] = 9, -- U+11046
["\240\145\129\176"] = 9, -- U+11070
["\240\145\129\191"] = 9, -- U+1107F
["\240\145\130\185"] = 9, -- U+110B9
["\240\145\130\186"] = 7, -- U+110BA
["\240\145\132\128"] = 230, -- U+11100
["\240\145\132\129"] = 230, -- U+11101
["\240\145\132\130"] = 230, -- U+11102
["\240\145\132\179"] = 9, -- U+11133
["\240\145\132\180"] = 9, -- U+11134
["\240\145\133\179"] = 7, -- U+11173
["\240\145\135\128"] = 9, -- U+111C0
["\240\145\135\138"] = 7, -- U+111CA
["\240\145\136\181"] = 9, -- U+11235
["\240\145\136\182"] = 7, -- U+11236
["\240\145\139\169"] = 7, -- U+112E9
["\240\145\139\170"] = 9, -- U+112EA
["\240\145\140\187"] = 7, -- U+1133B
["\240\145\140\188"] = 7, -- U+1133C
["\240\145\141\141"] = 9, -- U+1134D
["\240\145\141\166"] = 230, -- U+11366
["\240\145\141\167"] = 230, -- U+11367
["\240\145\141\168"] = 230, -- U+11368
["\240\145\141\169"] = 230, -- U+11369
["\240\145\141\170"] = 230, -- U+1136A
["\240\145\141\171"] = 230, -- U+1136B
["\240\145\141\172"] = 230, -- U+1136C
["\240\145\141\176"] = 230, -- U+11370
["\240\145\141\177"] = 230, -- U+11371
["\240\145\141\178"] = 230, -- U+11372
["\240\145\141\179"] = 230, -- U+11373
["\240\145\141\180"] = 230, -- U+11374
["\240\145\143\142"] = 9, -- U+113CE
["\240\145\143\143"] = 9, -- U+113CF
["\240\145\143\144"] = 9, -- U+113D0
["\240\145\145\130"] = 9, -- U+11442
["\240\145\145\134"] = 7, -- U+11446
["\240\145\145\158"] = 230, -- U+1145E
["\240\145\147\130"] = 9, -- U+114C2
["\240\145\147\131"] = 7, -- U+114C3
["\240\145\150\191"] = 9, -- U+115BF
["\240\145\151\128"] = 7, -- U+115C0
["\240\145\152\191"] = 9, -- U+1163F
["\240\145\154\182"] = 9, -- U+116B6
["\240\145\154\183"] = 7, -- U+116B7
["\240\145\156\171"] = 9, -- U+1172B
["\240\145\160\185"] = 9, -- U+11839
["\240\145\160\186"] = 7, -- U+1183A
["\240\145\164\189"] = 9, -- U+1193D
["\240\145\164\190"] = 9, -- U+1193E
["\240\145\165\131"] = 7, -- U+11943
["\240\145\167\160"] = 9, -- U+119E0
["\240\145\168\180"] = 9, -- U+11A34
["\240\145\169\135"] = 9, -- U+11A47
["\240\145\170\153"] = 9, -- U+11A99
["\240\145\176\191"] = 9, -- U+11C3F
["\240\145\181\130"] = 7, -- U+11D42
["\240\145\181\132"] = 9, -- U+11D44
["\240\145\181\133"] = 9, -- U+11D45
["\240\145\182\151"] = 9, -- U+11D97
["\240\145\189\129"] = 9, -- U+11F41
["\240\145\189\130"] = 9, -- U+11F42
["\240\150\132\175"] = 9, -- U+1612F
["\240\150\171\176"] = 1, -- U+16AF0
["\240\150\171\177"] = 1, -- U+16AF1
["\240\150\171\178"] = 1, -- U+16AF2
["\240\150\171\179"] = 1, -- U+16AF3
["\240\150\171\180"] = 1, -- U+16AF4
["\240\150\172\176"] = 230, -- U+16B30
["\240\150\172\177"] = 230, -- U+16B31
["\240\150\172\178"] = 230, -- U+16B32
["\240\150\172\179"] = 230, -- U+16B33
["\240\150\172\180"] = 230, -- U+16B34
["\240\150\172\181"] = 230, -- U+16B35
["\240\150\172\182"] = 230, -- U+16B36
["\240\150\191\176"] = 6, -- U+16FF0
["\240\150\191\177"] = 6, -- U+16FF1
["\240\155\178\158"] = 1, -- U+1BC9E
["\240\157\133\165"] = 216, -- U+1D165
["\240\157\133\166"] = 216, -- U+1D166
["\240\157\133\167"] = 1, -- U+1D167
["\240\157\133\168"] = 1, -- U+1D168
["\240\157\133\169"] = 1, -- U+1D169
["\240\157\133\173"] = 226, -- U+1D16D
["\240\157\133\174"] = 216, -- U+1D16E
["\240\157\133\175"] = 216, -- U+1D16F
["\240\157\133\176"] = 216, -- U+1D170
["\240\157\133\177"] = 216, -- U+1D171
["\240\157\133\178"] = 216, -- U+1D172
["\240\157\133\187"] = 220, -- U+1D17B
["\240\157\133\188"] = 220, -- U+1D17C
["\240\157\133\189"] = 220, -- U+1D17D
["\240\157\133\190"] = 220, -- U+1D17E
["\240\157\133\191"] = 220, -- U+1D17F
["\240\157\134\128"] = 220, -- U+1D180
["\240\157\134\129"] = 220, -- U+1D181
["\240\157\134\130"] = 220, -- U+1D182
["\240\157\134\133"] = 230, -- U+1D185
["\240\157\134\134"] = 230, -- U+1D186
["\240\157\134\135"] = 230, -- U+1D187
["\240\157\134\136"] = 230, -- U+1D188
["\240\157\134\137"] = 230, -- U+1D189
["\240\157\134\138"] = 220, -- U+1D18A
["\240\157\134\139"] = 220, -- U+1D18B
["\240\157\134\170"] = 230, -- U+1D1AA
["\240\157\134\171"] = 230, -- U+1D1AB
["\240\157\134\172"] = 230, -- U+1D1AC
["\240\157\134\173"] = 230, -- U+1D1AD
["\240\157\137\130"] = 230, -- U+1D242
["\240\157\137\131"] = 230, -- U+1D243
["\240\157\137\132"] = 230, -- U+1D244
["\240\158\128\128"] = 230, -- U+1E000
["\240\158\128\129"] = 230, -- U+1E001
["\240\158\128\130"] = 230, -- U+1E002
["\240\158\128\131"] = 230, -- U+1E003
["\240\158\128\132"] = 230, -- U+1E004
["\240\158\128\133"] = 230, -- U+1E005
["\240\158\128\134"] = 230, -- U+1E006
["\240\158\128\136"] = 230, -- U+1E008
["\240\158\128\137"] = 230, -- U+1E009
["\240\158\128\138"] = 230, -- U+1E00A
["\240\158\128\139"] = 230, -- U+1E00B
["\240\158\128\140"] = 230, -- U+1E00C
["\240\158\128\141"] = 230, -- U+1E00D
["\240\158\128\142"] = 230, -- U+1E00E
["\240\158\128\143"] = 230, -- U+1E00F
["\240\158\128\144"] = 230, -- U+1E010
["\240\158\128\145"] = 230, -- U+1E011
["\240\158\128\146"] = 230, -- U+1E012
["\240\158\128\147"] = 230, -- U+1E013
["\240\158\128\148"] = 230, -- U+1E014
["\240\158\128\149"] = 230, -- U+1E015
["\240\158\128\150"] = 230, -- U+1E016
["\240\158\128\151"] = 230, -- U+1E017
["\240\158\128\152"] = 230, -- U+1E018
["\240\158\128\155"] = 230, -- U+1E01B
["\240\158\128\156"] = 230, -- U+1E01C
["\240\158\128\157"] = 230, -- U+1E01D
["\240\158\128\158"] = 230, -- U+1E01E
["\240\158\128\159"] = 230, -- U+1E01F
["\240\158\128\160"] = 230, -- U+1E020
["\240\158\128\161"] = 230, -- U+1E021
["\240\158\128\163"] = 230, -- U+1E023
["\240\158\128\164"] = 230, -- U+1E024
["\240\158\128\166"] = 230, -- U+1E026
["\240\158\128\167"] = 230, -- U+1E027
["\240\158\128\168"] = 230, -- U+1E028
["\240\158\128\169"] = 230, -- U+1E029
["\240\158\128\170"] = 230, -- U+1E02A
["\240\158\130\143"] = 230, -- U+1E08F
["\240\158\132\176"] = 230, -- U+1E130
["\240\158\132\177"] = 230, -- U+1E131
["\240\158\132\178"] = 230, -- U+1E132
["\240\158\132\179"] = 230, -- U+1E133
["\240\158\132\180"] = 230, -- U+1E134
["\240\158\132\181"] = 230, -- U+1E135
["\240\158\132\182"] = 230, -- U+1E136
["\240\158\138\174"] = 230, -- U+1E2AE
["\240\158\139\172"] = 230, -- U+1E2EC
["\240\158\139\173"] = 230, -- U+1E2ED
["\240\158\139\174"] = 230, -- U+1E2EE
["\240\158\139\175"] = 230, -- U+1E2EF
["\240\158\147\172"] = 232, -- U+1E4EC
["\240\158\147\173"] = 232, -- U+1E4ED
["\240\158\147\174"] = 220, -- U+1E4EE
["\240\158\147\175"] = 230, -- U+1E4EF
["\240\158\151\174"] = 230, -- U+1E5EE
["\240\158\151\175"] = 220, -- U+1E5EF
["\240\158\163\144"] = 220, -- U+1E8D0
["\240\158\163\145"] = 220, -- U+1E8D1
["\240\158\163\146"] = 220, -- U+1E8D2
["\240\158\163\147"] = 220, -- U+1E8D3
["\240\158\163\148"] = 220, -- U+1E8D4
["\240\158\163\149"] = 220, -- U+1E8D5
["\240\158\163\150"] = 220, -- U+1E8D6
["\240\158\165\132"] = 230, -- U+1E944
["\240\158\165\133"] = 230, -- U+1E945
["\240\158\165\134"] = 230, -- U+1E946
["\240\158\165\135"] = 230, -- U+1E947
["\240\158\165\136"] = 230, -- U+1E948
["\240\158\165\137"] = 230, -- U+1E949
["\240\158\165\138"] = 7, -- U+1E94A
}
ium5zhpfi0yhd8u17lmimgbd73r48zt
Module:Unicode data/blocks
828
3659
43492
2026-05-13T23:00:59Z
Kambai Akau
8
Created page with "-- Generated by [[Module:Unicode data/build]] blocks_formatted (Unicode version 17.0) local blocks = { { 0x000000, 0x00007F, "Basic Latin" }, { 0x000080, 0x0000FF, "Latin-1 Supplement" }, { 0x000100, 0x00017F, "Latin Extended-A" }, { 0x000180, 0x00024F, "Latin Extended-B" }, { 0x000250, 0x0002AF, "..."
43492
Scribunto
text/plain
-- Generated by [[Module:Unicode data/build]] blocks_formatted (Unicode version 17.0)
local blocks = {
{ 0x000000, 0x00007F, "Basic Latin" },
{ 0x000080, 0x0000FF, "Latin-1 Supplement" },
{ 0x000100, 0x00017F, "Latin Extended-A" },
{ 0x000180, 0x00024F, "Latin Extended-B" },
{ 0x000250, 0x0002AF, "IPA Extensions" },
{ 0x0002B0, 0x0002FF, "Spacing Modifier Letters" },
{ 0x000300, 0x00036F, "Combining Diacritical Marks" },
{ 0x000370, 0x0003FF, "Greek and Coptic" },
{ 0x000400, 0x0004FF, "Cyrillic" },
{ 0x000500, 0x00052F, "Cyrillic Supplement" },
{ 0x000530, 0x00058F, "Armenian" },
{ 0x000590, 0x0005FF, "Hebrew" },
{ 0x000600, 0x0006FF, "Arabic" },
{ 0x000700, 0x00074F, "Syriac" },
{ 0x000750, 0x00077F, "Arabic Supplement" },
{ 0x000780, 0x0007BF, "Thaana" },
{ 0x0007C0, 0x0007FF, "NKo" },
{ 0x000800, 0x00083F, "Samaritan" },
{ 0x000840, 0x00085F, "Mandaic" },
{ 0x000860, 0x00086F, "Syriac Supplement" },
{ 0x000870, 0x00089F, "Arabic Extended-B" },
{ 0x0008A0, 0x0008FF, "Arabic Extended-A" },
{ 0x000900, 0x00097F, "Devanagari" },
{ 0x000980, 0x0009FF, "Bengali" },
{ 0x000A00, 0x000A7F, "Gurmukhi" },
{ 0x000A80, 0x000AFF, "Gujarati" },
{ 0x000B00, 0x000B7F, "Oriya" },
{ 0x000B80, 0x000BFF, "Tamil" },
{ 0x000C00, 0x000C7F, "Telugu" },
{ 0x000C80, 0x000CFF, "Kannada" },
{ 0x000D00, 0x000D7F, "Malayalam" },
{ 0x000D80, 0x000DFF, "Sinhala" },
{ 0x000E00, 0x000E7F, "Thai" },
{ 0x000E80, 0x000EFF, "Lao" },
{ 0x000F00, 0x000FFF, "Tibetan" },
{ 0x001000, 0x00109F, "Myanmar" },
{ 0x0010A0, 0x0010FF, "Georgian" },
{ 0x001100, 0x0011FF, "Hangul Jamo" },
{ 0x001200, 0x00137F, "Ethiopic" },
{ 0x001380, 0x00139F, "Ethiopic Supplement" },
{ 0x0013A0, 0x0013FF, "Cherokee" },
{ 0x001400, 0x00167F, "Unified Canadian Aboriginal Syllabics" },
{ 0x001680, 0x00169F, "Ogham" },
{ 0x0016A0, 0x0016FF, "Runic" },
{ 0x001700, 0x00171F, "Tagalog" },
{ 0x001720, 0x00173F, "Hanunoo" },
{ 0x001740, 0x00175F, "Buhid" },
{ 0x001760, 0x00177F, "Tagbanwa" },
{ 0x001780, 0x0017FF, "Khmer" },
{ 0x001800, 0x0018AF, "Mongolian" },
{ 0x0018B0, 0x0018FF, "Unified Canadian Aboriginal Syllabics Extended" },
{ 0x001900, 0x00194F, "Limbu" },
{ 0x001950, 0x00197F, "Tai Le" },
{ 0x001980, 0x0019DF, "New Tai Lue" },
{ 0x0019E0, 0x0019FF, "Khmer Symbols" },
{ 0x001A00, 0x001A1F, "Buginese" },
{ 0x001A20, 0x001AAF, "Tai Tham" },
{ 0x001AB0, 0x001AFF, "Combining Diacritical Marks Extended" },
{ 0x001B00, 0x001B7F, "Balinese" },
{ 0x001B80, 0x001BBF, "Sundanese" },
{ 0x001BC0, 0x001BFF, "Batak" },
{ 0x001C00, 0x001C4F, "Lepcha" },
{ 0x001C50, 0x001C7F, "Ol Chiki" },
{ 0x001C80, 0x001C8F, "Cyrillic Extended-C" },
{ 0x001C90, 0x001CBF, "Georgian Extended" },
{ 0x001CC0, 0x001CCF, "Sundanese Supplement" },
{ 0x001CD0, 0x001CFF, "Vedic Extensions" },
{ 0x001D00, 0x001D7F, "Phonetic Extensions" },
{ 0x001D80, 0x001DBF, "Phonetic Extensions Supplement" },
{ 0x001DC0, 0x001DFF, "Combining Diacritical Marks Supplement" },
{ 0x001E00, 0x001EFF, "Latin Extended Additional" },
{ 0x001F00, 0x001FFF, "Greek Extended" },
{ 0x002000, 0x00206F, "General Punctuation" },
{ 0x002070, 0x00209F, "Superscripts and Subscripts" },
{ 0x0020A0, 0x0020CF, "Currency Symbols" },
{ 0x0020D0, 0x0020FF, "Combining Diacritical Marks for Symbols" },
{ 0x002100, 0x00214F, "Letterlike Symbols" },
{ 0x002150, 0x00218F, "Number Forms" },
{ 0x002190, 0x0021FF, "Arrows" },
{ 0x002200, 0x0022FF, "Mathematical Operators" },
{ 0x002300, 0x0023FF, "Miscellaneous Technical" },
{ 0x002400, 0x00243F, "Control Pictures" },
{ 0x002440, 0x00245F, "Optical Character Recognition" },
{ 0x002460, 0x0024FF, "Enclosed Alphanumerics" },
{ 0x002500, 0x00257F, "Box Drawing" },
{ 0x002580, 0x00259F, "Block Elements" },
{ 0x0025A0, 0x0025FF, "Geometric Shapes" },
{ 0x002600, 0x0026FF, "Miscellaneous Symbols" },
{ 0x002700, 0x0027BF, "Dingbats" },
{ 0x0027C0, 0x0027EF, "Miscellaneous Mathematical Symbols-A" },
{ 0x0027F0, 0x0027FF, "Supplemental Arrows-A" },
{ 0x002800, 0x0028FF, "Braille Patterns" },
{ 0x002900, 0x00297F, "Supplemental Arrows-B" },
{ 0x002980, 0x0029FF, "Miscellaneous Mathematical Symbols-B" },
{ 0x002A00, 0x002AFF, "Supplemental Mathematical Operators" },
{ 0x002B00, 0x002BFF, "Miscellaneous Symbols and Arrows" },
{ 0x002C00, 0x002C5F, "Glagolitic" },
{ 0x002C60, 0x002C7F, "Latin Extended-C" },
{ 0x002C80, 0x002CFF, "Coptic" },
{ 0x002D00, 0x002D2F, "Georgian Supplement" },
{ 0x002D30, 0x002D7F, "Tifinagh" },
{ 0x002D80, 0x002DDF, "Ethiopic Extended" },
{ 0x002DE0, 0x002DFF, "Cyrillic Extended-A" },
{ 0x002E00, 0x002E7F, "Supplemental Punctuation" },
{ 0x002E80, 0x002EFF, "CJK Radicals Supplement" },
{ 0x002F00, 0x002FDF, "Kangxi Radicals" },
{ 0x002FF0, 0x002FFF, "Ideographic Description Characters" },
{ 0x003000, 0x00303F, "CJK Symbols and Punctuation" },
{ 0x003040, 0x00309F, "Hiragana" },
{ 0x0030A0, 0x0030FF, "Katakana" },
{ 0x003100, 0x00312F, "Bopomofo" },
{ 0x003130, 0x00318F, "Hangul Compatibility Jamo" },
{ 0x003190, 0x00319F, "Kanbun" },
{ 0x0031A0, 0x0031BF, "Bopomofo Extended" },
{ 0x0031C0, 0x0031EF, "CJK Strokes" },
{ 0x0031F0, 0x0031FF, "Katakana Phonetic Extensions" },
{ 0x003200, 0x0032FF, "Enclosed CJK Letters and Months" },
{ 0x003300, 0x0033FF, "CJK Compatibility" },
{ 0x003400, 0x004DBF, "CJK Unified Ideographs Extension A" },
{ 0x004DC0, 0x004DFF, "Yijing Hexagram Symbols" },
{ 0x004E00, 0x009FFF, "CJK Unified Ideographs" },
{ 0x00A000, 0x00A48F, "Yi Syllables" },
{ 0x00A490, 0x00A4CF, "Yi Radicals" },
{ 0x00A4D0, 0x00A4FF, "Lisu" },
{ 0x00A500, 0x00A63F, "Vai" },
{ 0x00A640, 0x00A69F, "Cyrillic Extended-B" },
{ 0x00A6A0, 0x00A6FF, "Bamum" },
{ 0x00A700, 0x00A71F, "Modifier Tone Letters" },
{ 0x00A720, 0x00A7FF, "Latin Extended-D" },
{ 0x00A800, 0x00A82F, "Syloti Nagri" },
{ 0x00A830, 0x00A83F, "Common Indic Number Forms" },
{ 0x00A840, 0x00A87F, "Phags-pa" },
{ 0x00A880, 0x00A8DF, "Saurashtra" },
{ 0x00A8E0, 0x00A8FF, "Devanagari Extended" },
{ 0x00A900, 0x00A92F, "Kayah Li" },
{ 0x00A930, 0x00A95F, "Rejang" },
{ 0x00A960, 0x00A97F, "Hangul Jamo Extended-A" },
{ 0x00A980, 0x00A9DF, "Javanese" },
{ 0x00A9E0, 0x00A9FF, "Myanmar Extended-B" },
{ 0x00AA00, 0x00AA5F, "Cham" },
{ 0x00AA60, 0x00AA7F, "Myanmar Extended-A" },
{ 0x00AA80, 0x00AADF, "Tai Viet" },
{ 0x00AAE0, 0x00AAFF, "Meetei Mayek Extensions" },
{ 0x00AB00, 0x00AB2F, "Ethiopic Extended-A" },
{ 0x00AB30, 0x00AB6F, "Latin Extended-E" },
{ 0x00AB70, 0x00ABBF, "Cherokee Supplement" },
{ 0x00ABC0, 0x00ABFF, "Meetei Mayek" },
{ 0x00AC00, 0x00D7AF, "Hangul Syllables" },
{ 0x00D7B0, 0x00D7FF, "Hangul Jamo Extended-B" },
{ 0x00D800, 0x00DB7F, "High Surrogates" },
{ 0x00DB80, 0x00DBFF, "High Private Use Surrogates" },
{ 0x00DC00, 0x00DFFF, "Low Surrogates" },
{ 0x00E000, 0x00F8FF, "Private Use Area" },
{ 0x00F900, 0x00FAFF, "CJK Compatibility Ideographs" },
{ 0x00FB00, 0x00FB4F, "Alphabetic Presentation Forms" },
{ 0x00FB50, 0x00FDFF, "Arabic Presentation Forms-A" },
{ 0x00FE00, 0x00FE0F, "Variation Selectors" },
{ 0x00FE10, 0x00FE1F, "Vertical Forms" },
{ 0x00FE20, 0x00FE2F, "Combining Half Marks" },
{ 0x00FE30, 0x00FE4F, "CJK Compatibility Forms" },
{ 0x00FE50, 0x00FE6F, "Small Form Variants" },
{ 0x00FE70, 0x00FEFF, "Arabic Presentation Forms-B" },
{ 0x00FF00, 0x00FFEF, "Halfwidth and Fullwidth Forms" },
{ 0x00FFF0, 0x00FFFF, "Specials" },
{ 0x010000, 0x01007F, "Linear B Syllabary" },
{ 0x010080, 0x0100FF, "Linear B Ideograms" },
{ 0x010100, 0x01013F, "Aegean Numbers" },
{ 0x010140, 0x01018F, "Ancient Greek Numbers" },
{ 0x010190, 0x0101CF, "Ancient Symbols" },
{ 0x0101D0, 0x0101FF, "Phaistos Disc" },
{ 0x010280, 0x01029F, "Lycian" },
{ 0x0102A0, 0x0102DF, "Carian" },
{ 0x0102E0, 0x0102FF, "Coptic Epact Numbers" },
{ 0x010300, 0x01032F, "Old Italic" },
{ 0x010330, 0x01034F, "Gothic" },
{ 0x010350, 0x01037F, "Old Permic" },
{ 0x010380, 0x01039F, "Ugaritic" },
{ 0x0103A0, 0x0103DF, "Old Persian" },
{ 0x010400, 0x01044F, "Deseret" },
{ 0x010450, 0x01047F, "Shavian" },
{ 0x010480, 0x0104AF, "Osmanya" },
{ 0x0104B0, 0x0104FF, "Osage" },
{ 0x010500, 0x01052F, "Elbasan" },
{ 0x010530, 0x01056F, "Caucasian Albanian" },
{ 0x010570, 0x0105BF, "Vithkuqi" },
{ 0x0105C0, 0x0105FF, "Todhri" },
{ 0x010600, 0x01077F, "Linear A" },
{ 0x010780, 0x0107BF, "Latin Extended-F" },
{ 0x010800, 0x01083F, "Cypriot Syllabary" },
{ 0x010840, 0x01085F, "Imperial Aramaic" },
{ 0x010860, 0x01087F, "Palmyrene" },
{ 0x010880, 0x0108AF, "Nabataean" },
{ 0x0108E0, 0x0108FF, "Hatran" },
{ 0x010900, 0x01091F, "Phoenician" },
{ 0x010920, 0x01093F, "Lydian" },
{ 0x010940, 0x01095F, "Sidetic" },
{ 0x010980, 0x01099F, "Meroitic Hieroglyphs" },
{ 0x0109A0, 0x0109FF, "Meroitic Cursive" },
{ 0x010A00, 0x010A5F, "Kharoshthi" },
{ 0x010A60, 0x010A7F, "Old South Arabian" },
{ 0x010A80, 0x010A9F, "Old North Arabian" },
{ 0x010AC0, 0x010AFF, "Manichaean" },
{ 0x010B00, 0x010B3F, "Avestan" },
{ 0x010B40, 0x010B5F, "Inscriptional Parthian" },
{ 0x010B60, 0x010B7F, "Inscriptional Pahlavi" },
{ 0x010B80, 0x010BAF, "Psalter Pahlavi" },
{ 0x010C00, 0x010C4F, "Old Turkic" },
{ 0x010C80, 0x010CFF, "Old Hungarian" },
{ 0x010D00, 0x010D3F, "Hanifi Rohingya" },
{ 0x010D40, 0x010D8F, "Garay" },
{ 0x010E60, 0x010E7F, "Rumi Numeral Symbols" },
{ 0x010E80, 0x010EBF, "Yezidi" },
{ 0x010EC0, 0x010EFF, "Arabic Extended-C" },
{ 0x010F00, 0x010F2F, "Old Sogdian" },
{ 0x010F30, 0x010F6F, "Sogdian" },
{ 0x010F70, 0x010FAF, "Old Uyghur" },
{ 0x010FB0, 0x010FDF, "Chorasmian" },
{ 0x010FE0, 0x010FFF, "Elymaic" },
{ 0x011000, 0x01107F, "Brahmi" },
{ 0x011080, 0x0110CF, "Kaithi" },
{ 0x0110D0, 0x0110FF, "Sora Sompeng" },
{ 0x011100, 0x01114F, "Chakma" },
{ 0x011150, 0x01117F, "Mahajani" },
{ 0x011180, 0x0111DF, "Sharada" },
{ 0x0111E0, 0x0111FF, "Sinhala Archaic Numbers" },
{ 0x011200, 0x01124F, "Khojki" },
{ 0x011280, 0x0112AF, "Multani" },
{ 0x0112B0, 0x0112FF, "Khudawadi" },
{ 0x011300, 0x01137F, "Grantha" },
{ 0x011380, 0x0113FF, "Tulu-Tigalari" },
{ 0x011400, 0x01147F, "Newa" },
{ 0x011480, 0x0114DF, "Tirhuta" },
{ 0x011580, 0x0115FF, "Siddham" },
{ 0x011600, 0x01165F, "Modi" },
{ 0x011660, 0x01167F, "Mongolian Supplement" },
{ 0x011680, 0x0116CF, "Takri" },
{ 0x0116D0, 0x0116FF, "Myanmar Extended-C" },
{ 0x011700, 0x01174F, "Ahom" },
{ 0x011800, 0x01184F, "Dogra" },
{ 0x0118A0, 0x0118FF, "Warang Citi" },
{ 0x011900, 0x01195F, "Dives Akuru" },
{ 0x0119A0, 0x0119FF, "Nandinagari" },
{ 0x011A00, 0x011A4F, "Zanabazar Square" },
{ 0x011A50, 0x011AAF, "Soyombo" },
{ 0x011AB0, 0x011ABF, "Unified Canadian Aboriginal Syllabics Extended-A" },
{ 0x011AC0, 0x011AFF, "Pau Cin Hau" },
{ 0x011B00, 0x011B5F, "Devanagari Extended-A" },
{ 0x011B60, 0x011B7F, "Sharada Supplement" },
{ 0x011BC0, 0x011BFF, "Sunuwar" },
{ 0x011C00, 0x011C6F, "Bhaiksuki" },
{ 0x011C70, 0x011CBF, "Marchen" },
{ 0x011D00, 0x011D5F, "Masaram Gondi" },
{ 0x011D60, 0x011DAF, "Gunjala Gondi" },
{ 0x011DB0, 0x011DEF, "Tolong Siki" },
{ 0x011EE0, 0x011EFF, "Makasar" },
{ 0x011F00, 0x011F5F, "Kawi" },
{ 0x011FB0, 0x011FBF, "Lisu Supplement" },
{ 0x011FC0, 0x011FFF, "Tamil Supplement" },
{ 0x012000, 0x0123FF, "Cuneiform" },
{ 0x012400, 0x01247F, "Cuneiform Numbers and Punctuation" },
{ 0x012480, 0x01254F, "Early Dynastic Cuneiform" },
{ 0x012F90, 0x012FFF, "Cypro-Minoan" },
{ 0x013000, 0x01342F, "Egyptian Hieroglyphs" },
{ 0x013430, 0x01345F, "Egyptian Hieroglyph Format Controls" },
{ 0x013460, 0x0143FF, "Egyptian Hieroglyphs Extended-A" },
{ 0x014400, 0x01467F, "Anatolian Hieroglyphs" },
{ 0x016100, 0x01613F, "Gurung Khema" },
{ 0x016800, 0x016A3F, "Bamum Supplement" },
{ 0x016A40, 0x016A6F, "Mro" },
{ 0x016A70, 0x016ACF, "Tangsa" },
{ 0x016AD0, 0x016AFF, "Bassa Vah" },
{ 0x016B00, 0x016B8F, "Pahawh Hmong" },
{ 0x016D40, 0x016D7F, "Kirat Rai" },
{ 0x016E40, 0x016E9F, "Medefaidrin" },
{ 0x016EA0, 0x016EDF, "Beria Erfe" },
{ 0x016F00, 0x016F9F, "Miao" },
{ 0x016FE0, 0x016FFF, "Ideographic Symbols and Punctuation" },
{ 0x017000, 0x0187FF, "Tangut" },
{ 0x018800, 0x018AFF, "Tangut Components" },
{ 0x018B00, 0x018CFF, "Khitan Small Script" },
{ 0x018D00, 0x018D7F, "Tangut Supplement" },
{ 0x018D80, 0x018DFF, "Tangut Components Supplement" },
{ 0x01AFF0, 0x01AFFF, "Kana Extended-B" },
{ 0x01B000, 0x01B0FF, "Kana Supplement" },
{ 0x01B100, 0x01B12F, "Kana Extended-A" },
{ 0x01B130, 0x01B16F, "Small Kana Extension" },
{ 0x01B170, 0x01B2FF, "Nushu" },
{ 0x01BC00, 0x01BC9F, "Duployan" },
{ 0x01BCA0, 0x01BCAF, "Shorthand Format Controls" },
{ 0x01CC00, 0x01CEBF, "Symbols for Legacy Computing Supplement" },
{ 0x01CEC0, 0x01CEFF, "Miscellaneous Symbols Supplement" },
{ 0x01CF00, 0x01CFCF, "Znamenny Musical Notation" },
{ 0x01D000, 0x01D0FF, "Byzantine Musical Symbols" },
{ 0x01D100, 0x01D1FF, "Musical Symbols" },
{ 0x01D200, 0x01D24F, "Ancient Greek Musical Notation" },
{ 0x01D2C0, 0x01D2DF, "Kaktovik Numerals" },
{ 0x01D2E0, 0x01D2FF, "Mayan Numerals" },
{ 0x01D300, 0x01D35F, "Tai Xuan Jing Symbols" },
{ 0x01D360, 0x01D37F, "Counting Rod Numerals" },
{ 0x01D400, 0x01D7FF, "Mathematical Alphanumeric Symbols" },
{ 0x01D800, 0x01DAAF, "Sutton SignWriting" },
{ 0x01DF00, 0x01DFFF, "Latin Extended-G" },
{ 0x01E000, 0x01E02F, "Glagolitic Supplement" },
{ 0x01E030, 0x01E08F, "Cyrillic Extended-D" },
{ 0x01E100, 0x01E14F, "Nyiakeng Puachue Hmong" },
{ 0x01E290, 0x01E2BF, "Toto" },
{ 0x01E2C0, 0x01E2FF, "Wancho" },
{ 0x01E4D0, 0x01E4FF, "Nag Mundari" },
{ 0x01E5D0, 0x01E5FF, "Ol Onal" },
{ 0x01E6C0, 0x01E6FF, "Tai Yo" },
{ 0x01E7E0, 0x01E7FF, "Ethiopic Extended-B" },
{ 0x01E800, 0x01E8DF, "Mende Kikakui" },
{ 0x01E900, 0x01E95F, "Adlam" },
{ 0x01EC70, 0x01ECBF, "Indic Siyaq Numbers" },
{ 0x01ED00, 0x01ED4F, "Ottoman Siyaq Numbers" },
{ 0x01EE00, 0x01EEFF, "Arabic Mathematical Alphabetic Symbols" },
{ 0x01F000, 0x01F02F, "Mahjong Tiles" },
{ 0x01F030, 0x01F09F, "Domino Tiles" },
{ 0x01F0A0, 0x01F0FF, "Playing Cards" },
{ 0x01F100, 0x01F1FF, "Enclosed Alphanumeric Supplement" },
{ 0x01F200, 0x01F2FF, "Enclosed Ideographic Supplement" },
{ 0x01F300, 0x01F5FF, "Miscellaneous Symbols and Pictographs" },
{ 0x01F600, 0x01F64F, "Emoticons" },
{ 0x01F650, 0x01F67F, "Ornamental Dingbats" },
{ 0x01F680, 0x01F6FF, "Transport and Map Symbols" },
{ 0x01F700, 0x01F77F, "Alchemical Symbols" },
{ 0x01F780, 0x01F7FF, "Geometric Shapes Extended" },
{ 0x01F800, 0x01F8FF, "Supplemental Arrows-C" },
{ 0x01F900, 0x01F9FF, "Supplemental Symbols and Pictographs" },
{ 0x01FA00, 0x01FA6F, "Chess Symbols" },
{ 0x01FA70, 0x01FAFF, "Symbols and Pictographs Extended-A" },
{ 0x01FB00, 0x01FBFF, "Symbols for Legacy Computing" },
{ 0x020000, 0x02A6DF, "CJK Unified Ideographs Extension B" },
{ 0x02A700, 0x02B73F, "CJK Unified Ideographs Extension C" },
{ 0x02B740, 0x02B81F, "CJK Unified Ideographs Extension D" },
{ 0x02B820, 0x02CEAF, "CJK Unified Ideographs Extension E" },
{ 0x02CEB0, 0x02EBEF, "CJK Unified Ideographs Extension F" },
{ 0x02EBF0, 0x02EE5F, "CJK Unified Ideographs Extension I" },
{ 0x02F800, 0x02FA1F, "CJK Compatibility Ideographs Supplement" },
{ 0x030000, 0x03134F, "CJK Unified Ideographs Extension G" },
{ 0x031350, 0x0323AF, "CJK Unified Ideographs Extension H" },
{ 0x0323B0, 0x03347F, "CJK Unified Ideographs Extension J" },
{ 0x0E0000, 0x0E007F, "Tags" },
{ 0x0E0100, 0x0E01EF, "Variation Selectors Supplement" },
{ 0x0F0000, 0x0FFFFF, "Supplementary Private Use Area-A" },
{ 0x100000, 0x10FFFF, "Supplementary Private Use Area-B" },
}
blocks.length = #blocks
return blocks
074u8xv91hil7kmfiy547ew45hlga1q
Module:Unicode data/category
828
3660
43493
2026-05-13T23:01:45Z
Kambai Akau
8
Created page with "-- Generated by [[Module:Unicode data/build]] category_formatted (Unicode version 17.0) local data = { singles = { [0x00020] = "Zs", [0x00024] = "Sc", [0x00028] = "Ps", [0x00029] = "Pe", [0x0002A] = "Po", [0x0002B] = "Sm", [0x0002C] = "Po", [0x0002D] = "Pd", [0x0005B] = "Ps", [0x0005C] = "Po", [0x0005D] = "Pe", [0x0005E] = "Sk", [0x0005F] = "Pc", [0x00060] = "Sk", [0x0007B] = "Ps", [0x0007C] = "Sm", [0x0007D] = "Pe", [0x0007E] = "Sm"..."
43493
Scribunto
text/plain
-- Generated by [[Module:Unicode data/build]] category_formatted (Unicode version 17.0)
local data = {
singles = {
[0x00020] = "Zs",
[0x00024] = "Sc",
[0x00028] = "Ps",
[0x00029] = "Pe",
[0x0002A] = "Po",
[0x0002B] = "Sm",
[0x0002C] = "Po",
[0x0002D] = "Pd",
[0x0005B] = "Ps",
[0x0005C] = "Po",
[0x0005D] = "Pe",
[0x0005E] = "Sk",
[0x0005F] = "Pc",
[0x00060] = "Sk",
[0x0007B] = "Ps",
[0x0007C] = "Sm",
[0x0007D] = "Pe",
[0x0007E] = "Sm",
[0x000A0] = "Zs",
[0x000A1] = "Po",
[0x000A6] = "So",
[0x000A7] = "Po",
[0x000A8] = "Sk",
[0x000A9] = "So",
[0x000AA] = "Lo",
[0x000AB] = "Pi",
[0x000AC] = "Sm",
[0x000AD] = "Cf",
[0x000AE] = "So",
[0x000AF] = "Sk",
[0x000B0] = "So",
[0x000B1] = "Sm",
[0x000B4] = "Sk",
[0x000B5] = "Ll",
[0x000B8] = "Sk",
[0x000B9] = "No",
[0x000BA] = "Lo",
[0x000BB] = "Pf",
[0x000BF] = "Po",
[0x000D7] = "Sm",
[0x000F7] = "Sm",
[0x00100] = "Lu",
[0x00101] = "Ll",
[0x00102] = "Lu",
[0x00103] = "Ll",
[0x00104] = "Lu",
[0x00105] = "Ll",
[0x00106] = "Lu",
[0x00107] = "Ll",
[0x00108] = "Lu",
[0x00109] = "Ll",
[0x0010A] = "Lu",
[0x0010B] = "Ll",
[0x0010C] = "Lu",
[0x0010D] = "Ll",
[0x0010E] = "Lu",
[0x0010F] = "Ll",
[0x00110] = "Lu",
[0x00111] = "Ll",
[0x00112] = "Lu",
[0x00113] = "Ll",
[0x00114] = "Lu",
[0x00115] = "Ll",
[0x00116] = "Lu",
[0x00117] = "Ll",
[0x00118] = "Lu",
[0x00119] = "Ll",
[0x0011A] = "Lu",
[0x0011B] = "Ll",
[0x0011C] = "Lu",
[0x0011D] = "Ll",
[0x0011E] = "Lu",
[0x0011F] = "Ll",
[0x00120] = "Lu",
[0x00121] = "Ll",
[0x00122] = "Lu",
[0x00123] = "Ll",
[0x00124] = "Lu",
[0x00125] = "Ll",
[0x00126] = "Lu",
[0x00127] = "Ll",
[0x00128] = "Lu",
[0x00129] = "Ll",
[0x0012A] = "Lu",
[0x0012B] = "Ll",
[0x0012C] = "Lu",
[0x0012D] = "Ll",
[0x0012E] = "Lu",
[0x0012F] = "Ll",
[0x00130] = "Lu",
[0x00131] = "Ll",
[0x00132] = "Lu",
[0x00133] = "Ll",
[0x00134] = "Lu",
[0x00135] = "Ll",
[0x00136] = "Lu",
[0x00139] = "Lu",
[0x0013A] = "Ll",
[0x0013B] = "Lu",
[0x0013C] = "Ll",
[0x0013D] = "Lu",
[0x0013E] = "Ll",
[0x0013F] = "Lu",
[0x00140] = "Ll",
[0x00141] = "Lu",
[0x00142] = "Ll",
[0x00143] = "Lu",
[0x00144] = "Ll",
[0x00145] = "Lu",
[0x00146] = "Ll",
[0x00147] = "Lu",
[0x0014A] = "Lu",
[0x0014B] = "Ll",
[0x0014C] = "Lu",
[0x0014D] = "Ll",
[0x0014E] = "Lu",
[0x0014F] = "Ll",
[0x00150] = "Lu",
[0x00151] = "Ll",
[0x00152] = "Lu",
[0x00153] = "Ll",
[0x00154] = "Lu",
[0x00155] = "Ll",
[0x00156] = "Lu",
[0x00157] = "Ll",
[0x00158] = "Lu",
[0x00159] = "Ll",
[0x0015A] = "Lu",
[0x0015B] = "Ll",
[0x0015C] = "Lu",
[0x0015D] = "Ll",
[0x0015E] = "Lu",
[0x0015F] = "Ll",
[0x00160] = "Lu",
[0x00161] = "Ll",
[0x00162] = "Lu",
[0x00163] = "Ll",
[0x00164] = "Lu",
[0x00165] = "Ll",
[0x00166] = "Lu",
[0x00167] = "Ll",
[0x00168] = "Lu",
[0x00169] = "Ll",
[0x0016A] = "Lu",
[0x0016B] = "Ll",
[0x0016C] = "Lu",
[0x0016D] = "Ll",
[0x0016E] = "Lu",
[0x0016F] = "Ll",
[0x00170] = "Lu",
[0x00171] = "Ll",
[0x00172] = "Lu",
[0x00173] = "Ll",
[0x00174] = "Lu",
[0x00175] = "Ll",
[0x00176] = "Lu",
[0x00177] = "Ll",
[0x0017A] = "Ll",
[0x0017B] = "Lu",
[0x0017C] = "Ll",
[0x0017D] = "Lu",
[0x00183] = "Ll",
[0x00184] = "Lu",
[0x00185] = "Ll",
[0x00188] = "Ll",
[0x00192] = "Ll",
[0x00195] = "Ll",
[0x0019E] = "Ll",
[0x001A1] = "Ll",
[0x001A2] = "Lu",
[0x001A3] = "Ll",
[0x001A4] = "Lu",
[0x001A5] = "Ll",
[0x001A8] = "Ll",
[0x001A9] = "Lu",
[0x001AC] = "Lu",
[0x001AD] = "Ll",
[0x001B0] = "Ll",
[0x001B4] = "Ll",
[0x001B5] = "Lu",
[0x001B6] = "Ll",
[0x001BB] = "Lo",
[0x001BC] = "Lu",
[0x001C4] = "Lu",
[0x001C5] = "Lt",
[0x001C6] = "Ll",
[0x001C7] = "Lu",
[0x001C8] = "Lt",
[0x001C9] = "Ll",
[0x001CA] = "Lu",
[0x001CB] = "Lt",
[0x001CC] = "Ll",
[0x001CD] = "Lu",
[0x001CE] = "Ll",
[0x001CF] = "Lu",
[0x001D0] = "Ll",
[0x001D1] = "Lu",
[0x001D2] = "Ll",
[0x001D3] = "Lu",
[0x001D4] = "Ll",
[0x001D5] = "Lu",
[0x001D6] = "Ll",
[0x001D7] = "Lu",
[0x001D8] = "Ll",
[0x001D9] = "Lu",
[0x001DA] = "Ll",
[0x001DB] = "Lu",
[0x001DE] = "Lu",
[0x001DF] = "Ll",
[0x001E0] = "Lu",
[0x001E1] = "Ll",
[0x001E2] = "Lu",
[0x001E3] = "Ll",
[0x001E4] = "Lu",
[0x001E5] = "Ll",
[0x001E6] = "Lu",
[0x001E7] = "Ll",
[0x001E8] = "Lu",
[0x001E9] = "Ll",
[0x001EA] = "Lu",
[0x001EB] = "Ll",
[0x001EC] = "Lu",
[0x001ED] = "Ll",
[0x001EE] = "Lu",
[0x001F1] = "Lu",
[0x001F2] = "Lt",
[0x001F3] = "Ll",
[0x001F4] = "Lu",
[0x001F5] = "Ll",
[0x001F9] = "Ll",
[0x001FA] = "Lu",
[0x001FB] = "Ll",
[0x001FC] = "Lu",
[0x001FD] = "Ll",
[0x001FE] = "Lu",
[0x001FF] = "Ll",
[0x00200] = "Lu",
[0x00201] = "Ll",
[0x00202] = "Lu",
[0x00203] = "Ll",
[0x00204] = "Lu",
[0x00205] = "Ll",
[0x00206] = "Lu",
[0x00207] = "Ll",
[0x00208] = "Lu",
[0x00209] = "Ll",
[0x0020A] = "Lu",
[0x0020B] = "Ll",
[0x0020C] = "Lu",
[0x0020D] = "Ll",
[0x0020E] = "Lu",
[0x0020F] = "Ll",
[0x00210] = "Lu",
[0x00211] = "Ll",
[0x00212] = "Lu",
[0x00213] = "Ll",
[0x00214] = "Lu",
[0x00215] = "Ll",
[0x00216] = "Lu",
[0x00217] = "Ll",
[0x00218] = "Lu",
[0x00219] = "Ll",
[0x0021A] = "Lu",
[0x0021B] = "Ll",
[0x0021C] = "Lu",
[0x0021D] = "Ll",
[0x0021E] = "Lu",
[0x0021F] = "Ll",
[0x00220] = "Lu",
[0x00221] = "Ll",
[0x00222] = "Lu",
[0x00223] = "Ll",
[0x00224] = "Lu",
[0x00225] = "Ll",
[0x00226] = "Lu",
[0x00227] = "Ll",
[0x00228] = "Lu",
[0x00229] = "Ll",
[0x0022A] = "Lu",
[0x0022B] = "Ll",
[0x0022C] = "Lu",
[0x0022D] = "Ll",
[0x0022E] = "Lu",
[0x0022F] = "Ll",
[0x00230] = "Lu",
[0x00231] = "Ll",
[0x00232] = "Lu",
[0x0023C] = "Ll",
[0x00241] = "Lu",
[0x00242] = "Ll",
[0x00247] = "Ll",
[0x00248] = "Lu",
[0x00249] = "Ll",
[0x0024A] = "Lu",
[0x0024B] = "Ll",
[0x0024C] = "Lu",
[0x0024D] = "Ll",
[0x0024E] = "Lu",
[0x002EC] = "Lm",
[0x002ED] = "Sk",
[0x002EE] = "Lm",
[0x00370] = "Lu",
[0x00371] = "Ll",
[0x00372] = "Lu",
[0x00373] = "Ll",
[0x00374] = "Lm",
[0x00375] = "Sk",
[0x00376] = "Lu",
[0x00377] = "Ll",
[0x0037A] = "Lm",
[0x0037E] = "Po",
[0x0037F] = "Lu",
[0x00386] = "Lu",
[0x00387] = "Po",
[0x0038B] = "Cn",
[0x0038C] = "Lu",
[0x0038D] = "Cn",
[0x00390] = "Ll",
[0x003A2] = "Cn",
[0x003CF] = "Lu",
[0x003D8] = "Lu",
[0x003D9] = "Ll",
[0x003DA] = "Lu",
[0x003DB] = "Ll",
[0x003DC] = "Lu",
[0x003DD] = "Ll",
[0x003DE] = "Lu",
[0x003DF] = "Ll",
[0x003E0] = "Lu",
[0x003E1] = "Ll",
[0x003E2] = "Lu",
[0x003E3] = "Ll",
[0x003E4] = "Lu",
[0x003E5] = "Ll",
[0x003E6] = "Lu",
[0x003E7] = "Ll",
[0x003E8] = "Lu",
[0x003E9] = "Ll",
[0x003EA] = "Lu",
[0x003EB] = "Ll",
[0x003EC] = "Lu",
[0x003ED] = "Ll",
[0x003EE] = "Lu",
[0x003F4] = "Lu",
[0x003F5] = "Ll",
[0x003F6] = "Sm",
[0x003F7] = "Lu",
[0x003F8] = "Ll",
[0x00460] = "Lu",
[0x00461] = "Ll",
[0x00462] = "Lu",
[0x00463] = "Ll",
[0x00464] = "Lu",
[0x00465] = "Ll",
[0x00466] = "Lu",
[0x00467] = "Ll",
[0x00468] = "Lu",
[0x00469] = "Ll",
[0x0046A] = "Lu",
[0x0046B] = "Ll",
[0x0046C] = "Lu",
[0x0046D] = "Ll",
[0x0046E] = "Lu",
[0x0046F] = "Ll",
[0x00470] = "Lu",
[0x00471] = "Ll",
[0x00472] = "Lu",
[0x00473] = "Ll",
[0x00474] = "Lu",
[0x00475] = "Ll",
[0x00476] = "Lu",
[0x00477] = "Ll",
[0x00478] = "Lu",
[0x00479] = "Ll",
[0x0047A] = "Lu",
[0x0047B] = "Ll",
[0x0047C] = "Lu",
[0x0047D] = "Ll",
[0x0047E] = "Lu",
[0x0047F] = "Ll",
[0x00480] = "Lu",
[0x00481] = "Ll",
[0x00482] = "So",
[0x0048A] = "Lu",
[0x0048B] = "Ll",
[0x0048C] = "Lu",
[0x0048D] = "Ll",
[0x0048E] = "Lu",
[0x0048F] = "Ll",
[0x00490] = "Lu",
[0x00491] = "Ll",
[0x00492] = "Lu",
[0x00493] = "Ll",
[0x00494] = "Lu",
[0x00495] = "Ll",
[0x00496] = "Lu",
[0x00497] = "Ll",
[0x00498] = "Lu",
[0x00499] = "Ll",
[0x0049A] = "Lu",
[0x0049B] = "Ll",
[0x0049C] = "Lu",
[0x0049D] = "Ll",
[0x0049E] = "Lu",
[0x0049F] = "Ll",
[0x004A0] = "Lu",
[0x004A1] = "Ll",
[0x004A2] = "Lu",
[0x004A3] = "Ll",
[0x004A4] = "Lu",
[0x004A5] = "Ll",
[0x004A6] = "Lu",
[0x004A7] = "Ll",
[0x004A8] = "Lu",
[0x004A9] = "Ll",
[0x004AA] = "Lu",
[0x004AB] = "Ll",
[0x004AC] = "Lu",
[0x004AD] = "Ll",
[0x004AE] = "Lu",
[0x004AF] = "Ll",
[0x004B0] = "Lu",
[0x004B1] = "Ll",
[0x004B2] = "Lu",
[0x004B3] = "Ll",
[0x004B4] = "Lu",
[0x004B5] = "Ll",
[0x004B6] = "Lu",
[0x004B7] = "Ll",
[0x004B8] = "Lu",
[0x004B9] = "Ll",
[0x004BA] = "Lu",
[0x004BB] = "Ll",
[0x004BC] = "Lu",
[0x004BD] = "Ll",
[0x004BE] = "Lu",
[0x004BF] = "Ll",
[0x004C2] = "Ll",
[0x004C3] = "Lu",
[0x004C4] = "Ll",
[0x004C5] = "Lu",
[0x004C6] = "Ll",
[0x004C7] = "Lu",
[0x004C8] = "Ll",
[0x004C9] = "Lu",
[0x004CA] = "Ll",
[0x004CB] = "Lu",
[0x004CC] = "Ll",
[0x004CD] = "Lu",
[0x004D0] = "Lu",
[0x004D1] = "Ll",
[0x004D2] = "Lu",
[0x004D3] = "Ll",
[0x004D4] = "Lu",
[0x004D5] = "Ll",
[0x004D6] = "Lu",
[0x004D7] = "Ll",
[0x004D8] = "Lu",
[0x004D9] = "Ll",
[0x004DA] = "Lu",
[0x004DB] = "Ll",
[0x004DC] = "Lu",
[0x004DD] = "Ll",
[0x004DE] = "Lu",
[0x004DF] = "Ll",
[0x004E0] = "Lu",
[0x004E1] = "Ll",
[0x004E2] = "Lu",
[0x004E3] = "Ll",
[0x004E4] = "Lu",
[0x004E5] = "Ll",
[0x004E6] = "Lu",
[0x004E7] = "Ll",
[0x004E8] = "Lu",
[0x004E9] = "Ll",
[0x004EA] = "Lu",
[0x004EB] = "Ll",
[0x004EC] = "Lu",
[0x004ED] = "Ll",
[0x004EE] = "Lu",
[0x004EF] = "Ll",
[0x004F0] = "Lu",
[0x004F1] = "Ll",
[0x004F2] = "Lu",
[0x004F3] = "Ll",
[0x004F4] = "Lu",
[0x004F5] = "Ll",
[0x004F6] = "Lu",
[0x004F7] = "Ll",
[0x004F8] = "Lu",
[0x004F9] = "Ll",
[0x004FA] = "Lu",
[0x004FB] = "Ll",
[0x004FC] = "Lu",
[0x004FD] = "Ll",
[0x004FE] = "Lu",
[0x004FF] = "Ll",
[0x00500] = "Lu",
[0x00501] = "Ll",
[0x00502] = "Lu",
[0x00503] = "Ll",
[0x00504] = "Lu",
[0x00505] = "Ll",
[0x00506] = "Lu",
[0x00507] = "Ll",
[0x00508] = "Lu",
[0x00509] = "Ll",
[0x0050A] = "Lu",
[0x0050B] = "Ll",
[0x0050C] = "Lu",
[0x0050D] = "Ll",
[0x0050E] = "Lu",
[0x0050F] = "Ll",
[0x00510] = "Lu",
[0x00511] = "Ll",
[0x00512] = "Lu",
[0x00513] = "Ll",
[0x00514] = "Lu",
[0x00515] = "Ll",
[0x00516] = "Lu",
[0x00517] = "Ll",
[0x00518] = "Lu",
[0x00519] = "Ll",
[0x0051A] = "Lu",
[0x0051B] = "Ll",
[0x0051C] = "Lu",
[0x0051D] = "Ll",
[0x0051E] = "Lu",
[0x0051F] = "Ll",
[0x00520] = "Lu",
[0x00521] = "Ll",
[0x00522] = "Lu",
[0x00523] = "Ll",
[0x00524] = "Lu",
[0x00525] = "Ll",
[0x00526] = "Lu",
[0x00527] = "Ll",
[0x00528] = "Lu",
[0x00529] = "Ll",
[0x0052A] = "Lu",
[0x0052B] = "Ll",
[0x0052C] = "Lu",
[0x0052D] = "Ll",
[0x0052E] = "Lu",
[0x0052F] = "Ll",
[0x00530] = "Cn",
[0x00559] = "Lm",
[0x00589] = "Po",
[0x0058A] = "Pd",
[0x0058F] = "Sc",
[0x00590] = "Cn",
[0x005BE] = "Pd",
[0x005BF] = "Mn",
[0x005C0] = "Po",
[0x005C3] = "Po",
[0x005C6] = "Po",
[0x005C7] = "Mn",
[0x0060B] = "Sc",
[0x0061B] = "Po",
[0x0061C] = "Cf",
[0x00640] = "Lm",
[0x00670] = "Mn",
[0x006D4] = "Po",
[0x006D5] = "Lo",
[0x006DD] = "Cf",
[0x006DE] = "So",
[0x006E9] = "So",
[0x006FF] = "Lo",
[0x0070E] = "Cn",
[0x0070F] = "Cf",
[0x00710] = "Lo",
[0x00711] = "Mn",
[0x007B1] = "Lo",
[0x007F6] = "So",
[0x007FA] = "Lm",
[0x007FD] = "Mn",
[0x0081A] = "Lm",
[0x00824] = "Lm",
[0x00828] = "Lm",
[0x0083F] = "Cn",
[0x0085E] = "Po",
[0x0085F] = "Cn",
[0x00888] = "Sk",
[0x008C9] = "Lm",
[0x008E2] = "Cf",
[0x00903] = "Mc",
[0x0093A] = "Mn",
[0x0093B] = "Mc",
[0x0093C] = "Mn",
[0x0093D] = "Lo",
[0x0094D] = "Mn",
[0x00950] = "Lo",
[0x00970] = "Po",
[0x00971] = "Lm",
[0x00981] = "Mn",
[0x00984] = "Cn",
[0x009A9] = "Cn",
[0x009B1] = "Cn",
[0x009B2] = "Lo",
[0x009BC] = "Mn",
[0x009BD] = "Lo",
[0x009CD] = "Mn",
[0x009CE] = "Lo",
[0x009D7] = "Mc",
[0x009DE] = "Cn",
[0x009FA] = "So",
[0x009FB] = "Sc",
[0x009FC] = "Lo",
[0x009FD] = "Po",
[0x009FE] = "Mn",
[0x00A03] = "Mc",
[0x00A04] = "Cn",
[0x00A29] = "Cn",
[0x00A31] = "Cn",
[0x00A34] = "Cn",
[0x00A37] = "Cn",
[0x00A3C] = "Mn",
[0x00A3D] = "Cn",
[0x00A51] = "Mn",
[0x00A5D] = "Cn",
[0x00A5E] = "Lo",
[0x00A75] = "Mn",
[0x00A76] = "Po",
[0x00A83] = "Mc",
[0x00A84] = "Cn",
[0x00A8E] = "Cn",
[0x00A92] = "Cn",
[0x00AA9] = "Cn",
[0x00AB1] = "Cn",
[0x00AB4] = "Cn",
[0x00ABC] = "Mn",
[0x00ABD] = "Lo",
[0x00AC6] = "Cn",
[0x00AC9] = "Mc",
[0x00ACA] = "Cn",
[0x00ACD] = "Mn",
[0x00AD0] = "Lo",
[0x00AF0] = "Po",
[0x00AF1] = "Sc",
[0x00AF9] = "Lo",
[0x00B00] = "Cn",
[0x00B01] = "Mn",
[0x00B04] = "Cn",
[0x00B29] = "Cn",
[0x00B31] = "Cn",
[0x00B34] = "Cn",
[0x00B3C] = "Mn",
[0x00B3D] = "Lo",
[0x00B3E] = "Mc",
[0x00B3F] = "Mn",
[0x00B40] = "Mc",
[0x00B4D] = "Mn",
[0x00B57] = "Mc",
[0x00B5E] = "Cn",
[0x00B70] = "So",
[0x00B71] = "Lo",
[0x00B82] = "Mn",
[0x00B83] = "Lo",
[0x00B84] = "Cn",
[0x00B91] = "Cn",
[0x00B9B] = "Cn",
[0x00B9C] = "Lo",
[0x00B9D] = "Cn",
[0x00BC0] = "Mn",
[0x00BC9] = "Cn",
[0x00BCD] = "Mn",
[0x00BD0] = "Lo",
[0x00BD7] = "Mc",
[0x00BF9] = "Sc",
[0x00BFA] = "So",
[0x00C00] = "Mn",
[0x00C04] = "Mn",
[0x00C0D] = "Cn",
[0x00C11] = "Cn",
[0x00C29] = "Cn",
[0x00C3C] = "Mn",
[0x00C3D] = "Lo",
[0x00C45] = "Cn",
[0x00C49] = "Cn",
[0x00C57] = "Cn",
[0x00C5B] = "Cn",
[0x00C77] = "Po",
[0x00C7F] = "So",
[0x00C80] = "Lo",
[0x00C81] = "Mn",
[0x00C84] = "Po",
[0x00C8D] = "Cn",
[0x00C91] = "Cn",
[0x00CA9] = "Cn",
[0x00CB4] = "Cn",
[0x00CBC] = "Mn",
[0x00CBD] = "Lo",
[0x00CBE] = "Mc",
[0x00CBF] = "Mn",
[0x00CC5] = "Cn",
[0x00CC6] = "Mn",
[0x00CC9] = "Cn",
[0x00CDF] = "Cn",
[0x00CF0] = "Cn",
[0x00CF3] = "Mc",
[0x00D0D] = "Cn",
[0x00D11] = "Cn",
[0x00D3D] = "Lo",
[0x00D45] = "Cn",
[0x00D49] = "Cn",
[0x00D4D] = "Mn",
[0x00D4E] = "Lo",
[0x00D4F] = "So",
[0x00D57] = "Mc",
[0x00D79] = "So",
[0x00D80] = "Cn",
[0x00D81] = "Mn",
[0x00D84] = "Cn",
[0x00DB2] = "Cn",
[0x00DBC] = "Cn",
[0x00DBD] = "Lo",
[0x00DCA] = "Mn",
[0x00DD5] = "Cn",
[0x00DD6] = "Mn",
[0x00DD7] = "Cn",
[0x00DF4] = "Po",
[0x00E31] = "Mn",
[0x00E3F] = "Sc",
[0x00E46] = "Lm",
[0x00E4F] = "Po",
[0x00E83] = "Cn",
[0x00E84] = "Lo",
[0x00E85] = "Cn",
[0x00E8B] = "Cn",
[0x00EA4] = "Cn",
[0x00EA5] = "Lo",
[0x00EA6] = "Cn",
[0x00EB1] = "Mn",
[0x00EBD] = "Lo",
[0x00EC5] = "Cn",
[0x00EC6] = "Lm",
[0x00EC7] = "Cn",
[0x00ECF] = "Cn",
[0x00F00] = "Lo",
[0x00F13] = "So",
[0x00F14] = "Po",
[0x00F34] = "So",
[0x00F35] = "Mn",
[0x00F36] = "So",
[0x00F37] = "Mn",
[0x00F38] = "So",
[0x00F39] = "Mn",
[0x00F3A] = "Ps",
[0x00F3B] = "Pe",
[0x00F3C] = "Ps",
[0x00F3D] = "Pe",
[0x00F48] = "Cn",
[0x00F7F] = "Mc",
[0x00F85] = "Po",
[0x00F98] = "Cn",
[0x00FBD] = "Cn",
[0x00FC6] = "Mn",
[0x00FCD] = "Cn",
[0x01031] = "Mc",
[0x01038] = "Mc",
[0x0103F] = "Lo",
[0x01061] = "Lo",
[0x01082] = "Mn",
[0x0108D] = "Mn",
[0x0108E] = "Lo",
[0x0108F] = "Mc",
[0x0109D] = "Mn",
[0x010C6] = "Cn",
[0x010C7] = "Lu",
[0x010CD] = "Lu",
[0x010FB] = "Po",
[0x010FC] = "Lm",
[0x01249] = "Cn",
[0x01257] = "Cn",
[0x01258] = "Lo",
[0x01259] = "Cn",
[0x01289] = "Cn",
[0x012B1] = "Cn",
[0x012BF] = "Cn",
[0x012C0] = "Lo",
[0x012C1] = "Cn",
[0x012D7] = "Cn",
[0x01311] = "Cn",
[0x01400] = "Pd",
[0x0166D] = "So",
[0x0166E] = "Po",
[0x01680] = "Zs",
[0x0169B] = "Ps",
[0x0169C] = "Pe",
[0x01715] = "Mc",
[0x01734] = "Mc",
[0x0176D] = "Cn",
[0x01771] = "Cn",
[0x017B6] = "Mc",
[0x017C6] = "Mn",
[0x017D7] = "Lm",
[0x017DB] = "Sc",
[0x017DC] = "Lo",
[0x017DD] = "Mn",
[0x01806] = "Pd",
[0x0180E] = "Cf",
[0x0180F] = "Mn",
[0x01843] = "Lm",
[0x018A9] = "Mn",
[0x018AA] = "Lo",
[0x0191F] = "Cn",
[0x01932] = "Mn",
[0x01940] = "So",
[0x019DA] = "No",
[0x01A1B] = "Mn",
[0x01A55] = "Mc",
[0x01A56] = "Mn",
[0x01A57] = "Mc",
[0x01A5F] = "Cn",
[0x01A60] = "Mn",
[0x01A61] = "Mc",
[0x01A62] = "Mn",
[0x01A7F] = "Mn",
[0x01AA7] = "Lm",
[0x01ABE] = "Me",
[0x01B04] = "Mc",
[0x01B34] = "Mn",
[0x01B35] = "Mc",
[0x01B3B] = "Mc",
[0x01B3C] = "Mn",
[0x01B42] = "Mn",
[0x01B4D] = "Cn",
[0x01B82] = "Mc",
[0x01BA1] = "Mc",
[0x01BAA] = "Mc",
[0x01BE6] = "Mn",
[0x01BE7] = "Mc",
[0x01BED] = "Mn",
[0x01BEE] = "Mc",
[0x01C89] = "Lu",
[0x01C8A] = "Ll",
[0x01CD3] = "Po",
[0x01CE1] = "Mc",
[0x01CED] = "Mn",
[0x01CF4] = "Mn",
[0x01CF7] = "Mc",
[0x01CFA] = "Lo",
[0x01D78] = "Lm",
[0x01E00] = "Lu",
[0x01E01] = "Ll",
[0x01E02] = "Lu",
[0x01E03] = "Ll",
[0x01E04] = "Lu",
[0x01E05] = "Ll",
[0x01E06] = "Lu",
[0x01E07] = "Ll",
[0x01E08] = "Lu",
[0x01E09] = "Ll",
[0x01E0A] = "Lu",
[0x01E0B] = "Ll",
[0x01E0C] = "Lu",
[0x01E0D] = "Ll",
[0x01E0E] = "Lu",
[0x01E0F] = "Ll",
[0x01E10] = "Lu",
[0x01E11] = "Ll",
[0x01E12] = "Lu",
[0x01E13] = "Ll",
[0x01E14] = "Lu",
[0x01E15] = "Ll",
[0x01E16] = "Lu",
[0x01E17] = "Ll",
[0x01E18] = "Lu",
[0x01E19] = "Ll",
[0x01E1A] = "Lu",
[0x01E1B] = "Ll",
[0x01E1C] = "Lu",
[0x01E1D] = "Ll",
[0x01E1E] = "Lu",
[0x01E1F] = "Ll",
[0x01E20] = "Lu",
[0x01E21] = "Ll",
[0x01E22] = "Lu",
[0x01E23] = "Ll",
[0x01E24] = "Lu",
[0x01E25] = "Ll",
[0x01E26] = "Lu",
[0x01E27] = "Ll",
[0x01E28] = "Lu",
[0x01E29] = "Ll",
[0x01E2A] = "Lu",
[0x01E2B] = "Ll",
[0x01E2C] = "Lu",
[0x01E2D] = "Ll",
[0x01E2E] = "Lu",
[0x01E2F] = "Ll",
[0x01E30] = "Lu",
[0x01E31] = "Ll",
[0x01E32] = "Lu",
[0x01E33] = "Ll",
[0x01E34] = "Lu",
[0x01E35] = "Ll",
[0x01E36] = "Lu",
[0x01E37] = "Ll",
[0x01E38] = "Lu",
[0x01E39] = "Ll",
[0x01E3A] = "Lu",
[0x01E3B] = "Ll",
[0x01E3C] = "Lu",
[0x01E3D] = "Ll",
[0x01E3E] = "Lu",
[0x01E3F] = "Ll",
[0x01E40] = "Lu",
[0x01E41] = "Ll",
[0x01E42] = "Lu",
[0x01E43] = "Ll",
[0x01E44] = "Lu",
[0x01E45] = "Ll",
[0x01E46] = "Lu",
[0x01E47] = "Ll",
[0x01E48] = "Lu",
[0x01E49] = "Ll",
[0x01E4A] = "Lu",
[0x01E4B] = "Ll",
[0x01E4C] = "Lu",
[0x01E4D] = "Ll",
[0x01E4E] = "Lu",
[0x01E4F] = "Ll",
[0x01E50] = "Lu",
[0x01E51] = "Ll",
[0x01E52] = "Lu",
[0x01E53] = "Ll",
[0x01E54] = "Lu",
[0x01E55] = "Ll",
[0x01E56] = "Lu",
[0x01E57] = "Ll",
[0x01E58] = "Lu",
[0x01E59] = "Ll",
[0x01E5A] = "Lu",
[0x01E5B] = "Ll",
[0x01E5C] = "Lu",
[0x01E5D] = "Ll",
[0x01E5E] = "Lu",
[0x01E5F] = "Ll",
[0x01E60] = "Lu",
[0x01E61] = "Ll",
[0x01E62] = "Lu",
[0x01E63] = "Ll",
[0x01E64] = "Lu",
[0x01E65] = "Ll",
[0x01E66] = "Lu",
[0x01E67] = "Ll",
[0x01E68] = "Lu",
[0x01E69] = "Ll",
[0x01E6A] = "Lu",
[0x01E6B] = "Ll",
[0x01E6C] = "Lu",
[0x01E6D] = "Ll",
[0x01E6E] = "Lu",
[0x01E6F] = "Ll",
[0x01E70] = "Lu",
[0x01E71] = "Ll",
[0x01E72] = "Lu",
[0x01E73] = "Ll",
[0x01E74] = "Lu",
[0x01E75] = "Ll",
[0x01E76] = "Lu",
[0x01E77] = "Ll",
[0x01E78] = "Lu",
[0x01E79] = "Ll",
[0x01E7A] = "Lu",
[0x01E7B] = "Ll",
[0x01E7C] = "Lu",
[0x01E7D] = "Ll",
[0x01E7E] = "Lu",
[0x01E7F] = "Ll",
[0x01E80] = "Lu",
[0x01E81] = "Ll",
[0x01E82] = "Lu",
[0x01E83] = "Ll",
[0x01E84] = "Lu",
[0x01E85] = "Ll",
[0x01E86] = "Lu",
[0x01E87] = "Ll",
[0x01E88] = "Lu",
[0x01E89] = "Ll",
[0x01E8A] = "Lu",
[0x01E8B] = "Ll",
[0x01E8C] = "Lu",
[0x01E8D] = "Ll",
[0x01E8E] = "Lu",
[0x01E8F] = "Ll",
[0x01E90] = "Lu",
[0x01E91] = "Ll",
[0x01E92] = "Lu",
[0x01E93] = "Ll",
[0x01E94] = "Lu",
[0x01E9E] = "Lu",
[0x01E9F] = "Ll",
[0x01EA0] = "Lu",
[0x01EA1] = "Ll",
[0x01EA2] = "Lu",
[0x01EA3] = "Ll",
[0x01EA4] = "Lu",
[0x01EA5] = "Ll",
[0x01EA6] = "Lu",
[0x01EA7] = "Ll",
[0x01EA8] = "Lu",
[0x01EA9] = "Ll",
[0x01EAA] = "Lu",
[0x01EAB] = "Ll",
[0x01EAC] = "Lu",
[0x01EAD] = "Ll",
[0x01EAE] = "Lu",
[0x01EAF] = "Ll",
[0x01EB0] = "Lu",
[0x01EB1] = "Ll",
[0x01EB2] = "Lu",
[0x01EB3] = "Ll",
[0x01EB4] = "Lu",
[0x01EB5] = "Ll",
[0x01EB6] = "Lu",
[0x01EB7] = "Ll",
[0x01EB8] = "Lu",
[0x01EB9] = "Ll",
[0x01EBA] = "Lu",
[0x01EBB] = "Ll",
[0x01EBC] = "Lu",
[0x01EBD] = "Ll",
[0x01EBE] = "Lu",
[0x01EBF] = "Ll",
[0x01EC0] = "Lu",
[0x01EC1] = "Ll",
[0x01EC2] = "Lu",
[0x01EC3] = "Ll",
[0x01EC4] = "Lu",
[0x01EC5] = "Ll",
[0x01EC6] = "Lu",
[0x01EC7] = "Ll",
[0x01EC8] = "Lu",
[0x01EC9] = "Ll",
[0x01ECA] = "Lu",
[0x01ECB] = "Ll",
[0x01ECC] = "Lu",
[0x01ECD] = "Ll",
[0x01ECE] = "Lu",
[0x01ECF] = "Ll",
[0x01ED0] = "Lu",
[0x01ED1] = "Ll",
[0x01ED2] = "Lu",
[0x01ED3] = "Ll",
[0x01ED4] = "Lu",
[0x01ED5] = "Ll",
[0x01ED6] = "Lu",
[0x01ED7] = "Ll",
[0x01ED8] = "Lu",
[0x01ED9] = "Ll",
[0x01EDA] = "Lu",
[0x01EDB] = "Ll",
[0x01EDC] = "Lu",
[0x01EDD] = "Ll",
[0x01EDE] = "Lu",
[0x01EDF] = "Ll",
[0x01EE0] = "Lu",
[0x01EE1] = "Ll",
[0x01EE2] = "Lu",
[0x01EE3] = "Ll",
[0x01EE4] = "Lu",
[0x01EE5] = "Ll",
[0x01EE6] = "Lu",
[0x01EE7] = "Ll",
[0x01EE8] = "Lu",
[0x01EE9] = "Ll",
[0x01EEA] = "Lu",
[0x01EEB] = "Ll",
[0x01EEC] = "Lu",
[0x01EED] = "Ll",
[0x01EEE] = "Lu",
[0x01EEF] = "Ll",
[0x01EF0] = "Lu",
[0x01EF1] = "Ll",
[0x01EF2] = "Lu",
[0x01EF3] = "Ll",
[0x01EF4] = "Lu",
[0x01EF5] = "Ll",
[0x01EF6] = "Lu",
[0x01EF7] = "Ll",
[0x01EF8] = "Lu",
[0x01EF9] = "Ll",
[0x01EFA] = "Lu",
[0x01EFB] = "Ll",
[0x01EFC] = "Lu",
[0x01EFD] = "Ll",
[0x01EFE] = "Lu",
[0x01F58] = "Cn",
[0x01F59] = "Lu",
[0x01F5A] = "Cn",
[0x01F5B] = "Lu",
[0x01F5C] = "Cn",
[0x01F5D] = "Lu",
[0x01F5E] = "Cn",
[0x01F5F] = "Lu",
[0x01FB5] = "Cn",
[0x01FBC] = "Lt",
[0x01FBD] = "Sk",
[0x01FBE] = "Ll",
[0x01FC5] = "Cn",
[0x01FCC] = "Lt",
[0x01FDC] = "Cn",
[0x01FF5] = "Cn",
[0x01FFC] = "Lt",
[0x01FFF] = "Cn",
[0x02018] = "Pi",
[0x02019] = "Pf",
[0x0201A] = "Ps",
[0x0201D] = "Pf",
[0x0201E] = "Ps",
[0x0201F] = "Pi",
[0x02028] = "Zl",
[0x02029] = "Zp",
[0x0202F] = "Zs",
[0x02039] = "Pi",
[0x0203A] = "Pf",
[0x02044] = "Sm",
[0x02045] = "Ps",
[0x02046] = "Pe",
[0x02052] = "Sm",
[0x02053] = "Po",
[0x02054] = "Pc",
[0x0205F] = "Zs",
[0x02065] = "Cn",
[0x02070] = "No",
[0x02071] = "Lm",
[0x0207D] = "Ps",
[0x0207E] = "Pe",
[0x0207F] = "Lm",
[0x0208D] = "Ps",
[0x0208E] = "Pe",
[0x0208F] = "Cn",
[0x020E1] = "Mn",
[0x02102] = "Lu",
[0x02107] = "Lu",
[0x0210A] = "Ll",
[0x02113] = "Ll",
[0x02114] = "So",
[0x02115] = "Lu",
[0x02118] = "Sm",
[0x02124] = "Lu",
[0x02125] = "So",
[0x02126] = "Lu",
[0x02127] = "So",
[0x02128] = "Lu",
[0x02129] = "So",
[0x0212E] = "So",
[0x0212F] = "Ll",
[0x02134] = "Ll",
[0x02139] = "Ll",
[0x02145] = "Lu",
[0x0214A] = "So",
[0x0214B] = "Sm",
[0x0214E] = "Ll",
[0x0214F] = "So",
[0x02183] = "Lu",
[0x02184] = "Ll",
[0x02189] = "No",
[0x021A0] = "Sm",
[0x021A3] = "Sm",
[0x021A6] = "Sm",
[0x021AE] = "Sm",
[0x021D2] = "Sm",
[0x021D3] = "So",
[0x021D4] = "Sm",
[0x02308] = "Ps",
[0x02309] = "Pe",
[0x0230A] = "Ps",
[0x0230B] = "Pe",
[0x02329] = "Ps",
[0x0232A] = "Pe",
[0x0237C] = "Sm",
[0x025B7] = "Sm",
[0x025C1] = "Sm",
[0x0266F] = "Sm",
[0x02768] = "Ps",
[0x02769] = "Pe",
[0x0276A] = "Ps",
[0x0276B] = "Pe",
[0x0276C] = "Ps",
[0x0276D] = "Pe",
[0x0276E] = "Ps",
[0x0276F] = "Pe",
[0x02770] = "Ps",
[0x02771] = "Pe",
[0x02772] = "Ps",
[0x02773] = "Pe",
[0x02774] = "Ps",
[0x02775] = "Pe",
[0x027C5] = "Ps",
[0x027C6] = "Pe",
[0x027E6] = "Ps",
[0x027E7] = "Pe",
[0x027E8] = "Ps",
[0x027E9] = "Pe",
[0x027EA] = "Ps",
[0x027EB] = "Pe",
[0x027EC] = "Ps",
[0x027ED] = "Pe",
[0x027EE] = "Ps",
[0x027EF] = "Pe",
[0x02983] = "Ps",
[0x02984] = "Pe",
[0x02985] = "Ps",
[0x02986] = "Pe",
[0x02987] = "Ps",
[0x02988] = "Pe",
[0x02989] = "Ps",
[0x0298A] = "Pe",
[0x0298B] = "Ps",
[0x0298C] = "Pe",
[0x0298D] = "Ps",
[0x0298E] = "Pe",
[0x0298F] = "Ps",
[0x02990] = "Pe",
[0x02991] = "Ps",
[0x02992] = "Pe",
[0x02993] = "Ps",
[0x02994] = "Pe",
[0x02995] = "Ps",
[0x02996] = "Pe",
[0x02997] = "Ps",
[0x02998] = "Pe",
[0x029D8] = "Ps",
[0x029D9] = "Pe",
[0x029DA] = "Ps",
[0x029DB] = "Pe",
[0x029FC] = "Ps",
[0x029FD] = "Pe",
[0x02C60] = "Lu",
[0x02C61] = "Ll",
[0x02C67] = "Lu",
[0x02C68] = "Ll",
[0x02C69] = "Lu",
[0x02C6A] = "Ll",
[0x02C6B] = "Lu",
[0x02C6C] = "Ll",
[0x02C71] = "Ll",
[0x02C72] = "Lu",
[0x02C75] = "Lu",
[0x02C81] = "Ll",
[0x02C82] = "Lu",
[0x02C83] = "Ll",
[0x02C84] = "Lu",
[0x02C85] = "Ll",
[0x02C86] = "Lu",
[0x02C87] = "Ll",
[0x02C88] = "Lu",
[0x02C89] = "Ll",
[0x02C8A] = "Lu",
[0x02C8B] = "Ll",
[0x02C8C] = "Lu",
[0x02C8D] = "Ll",
[0x02C8E] = "Lu",
[0x02C8F] = "Ll",
[0x02C90] = "Lu",
[0x02C91] = "Ll",
[0x02C92] = "Lu",
[0x02C93] = "Ll",
[0x02C94] = "Lu",
[0x02C95] = "Ll",
[0x02C96] = "Lu",
[0x02C97] = "Ll",
[0x02C98] = "Lu",
[0x02C99] = "Ll",
[0x02C9A] = "Lu",
[0x02C9B] = "Ll",
[0x02C9C] = "Lu",
[0x02C9D] = "Ll",
[0x02C9E] = "Lu",
[0x02C9F] = "Ll",
[0x02CA0] = "Lu",
[0x02CA1] = "Ll",
[0x02CA2] = "Lu",
[0x02CA3] = "Ll",
[0x02CA4] = "Lu",
[0x02CA5] = "Ll",
[0x02CA6] = "Lu",
[0x02CA7] = "Ll",
[0x02CA8] = "Lu",
[0x02CA9] = "Ll",
[0x02CAA] = "Lu",
[0x02CAB] = "Ll",
[0x02CAC] = "Lu",
[0x02CAD] = "Ll",
[0x02CAE] = "Lu",
[0x02CAF] = "Ll",
[0x02CB0] = "Lu",
[0x02CB1] = "Ll",
[0x02CB2] = "Lu",
[0x02CB3] = "Ll",
[0x02CB4] = "Lu",
[0x02CB5] = "Ll",
[0x02CB6] = "Lu",
[0x02CB7] = "Ll",
[0x02CB8] = "Lu",
[0x02CB9] = "Ll",
[0x02CBA] = "Lu",
[0x02CBB] = "Ll",
[0x02CBC] = "Lu",
[0x02CBD] = "Ll",
[0x02CBE] = "Lu",
[0x02CBF] = "Ll",
[0x02CC0] = "Lu",
[0x02CC1] = "Ll",
[0x02CC2] = "Lu",
[0x02CC3] = "Ll",
[0x02CC4] = "Lu",
[0x02CC5] = "Ll",
[0x02CC6] = "Lu",
[0x02CC7] = "Ll",
[0x02CC8] = "Lu",
[0x02CC9] = "Ll",
[0x02CCA] = "Lu",
[0x02CCB] = "Ll",
[0x02CCC] = "Lu",
[0x02CCD] = "Ll",
[0x02CCE] = "Lu",
[0x02CCF] = "Ll",
[0x02CD0] = "Lu",
[0x02CD1] = "Ll",
[0x02CD2] = "Lu",
[0x02CD3] = "Ll",
[0x02CD4] = "Lu",
[0x02CD5] = "Ll",
[0x02CD6] = "Lu",
[0x02CD7] = "Ll",
[0x02CD8] = "Lu",
[0x02CD9] = "Ll",
[0x02CDA] = "Lu",
[0x02CDB] = "Ll",
[0x02CDC] = "Lu",
[0x02CDD] = "Ll",
[0x02CDE] = "Lu",
[0x02CDF] = "Ll",
[0x02CE0] = "Lu",
[0x02CE1] = "Ll",
[0x02CE2] = "Lu",
[0x02CEB] = "Lu",
[0x02CEC] = "Ll",
[0x02CED] = "Lu",
[0x02CEE] = "Ll",
[0x02CF2] = "Lu",
[0x02CF3] = "Ll",
[0x02CFD] = "No",
[0x02D26] = "Cn",
[0x02D27] = "Ll",
[0x02D2D] = "Ll",
[0x02D6F] = "Lm",
[0x02D70] = "Po",
[0x02D7F] = "Mn",
[0x02DA7] = "Cn",
[0x02DAF] = "Cn",
[0x02DB7] = "Cn",
[0x02DBF] = "Cn",
[0x02DC7] = "Cn",
[0x02DCF] = "Cn",
[0x02DD7] = "Cn",
[0x02DDF] = "Cn",
[0x02E02] = "Pi",
[0x02E03] = "Pf",
[0x02E04] = "Pi",
[0x02E05] = "Pf",
[0x02E09] = "Pi",
[0x02E0A] = "Pf",
[0x02E0B] = "Po",
[0x02E0C] = "Pi",
[0x02E0D] = "Pf",
[0x02E17] = "Pd",
[0x02E1A] = "Pd",
[0x02E1B] = "Po",
[0x02E1C] = "Pi",
[0x02E1D] = "Pf",
[0x02E20] = "Pi",
[0x02E21] = "Pf",
[0x02E22] = "Ps",
[0x02E23] = "Pe",
[0x02E24] = "Ps",
[0x02E25] = "Pe",
[0x02E26] = "Ps",
[0x02E27] = "Pe",
[0x02E28] = "Ps",
[0x02E29] = "Pe",
[0x02E2F] = "Lm",
[0x02E40] = "Pd",
[0x02E41] = "Po",
[0x02E42] = "Ps",
[0x02E55] = "Ps",
[0x02E56] = "Pe",
[0x02E57] = "Ps",
[0x02E58] = "Pe",
[0x02E59] = "Ps",
[0x02E5A] = "Pe",
[0x02E5B] = "Ps",
[0x02E5C] = "Pe",
[0x02E5D] = "Pd",
[0x02E9A] = "Cn",
[0x03000] = "Zs",
[0x03004] = "So",
[0x03005] = "Lm",
[0x03006] = "Lo",
[0x03007] = "Nl",
[0x03008] = "Ps",
[0x03009] = "Pe",
[0x0300A] = "Ps",
[0x0300B] = "Pe",
[0x0300C] = "Ps",
[0x0300D] = "Pe",
[0x0300E] = "Ps",
[0x0300F] = "Pe",
[0x03010] = "Ps",
[0x03011] = "Pe",
[0x03014] = "Ps",
[0x03015] = "Pe",
[0x03016] = "Ps",
[0x03017] = "Pe",
[0x03018] = "Ps",
[0x03019] = "Pe",
[0x0301A] = "Ps",
[0x0301B] = "Pe",
[0x0301C] = "Pd",
[0x0301D] = "Ps",
[0x03020] = "So",
[0x03030] = "Pd",
[0x0303B] = "Lm",
[0x0303C] = "Lo",
[0x0303D] = "Po",
[0x03040] = "Cn",
[0x0309F] = "Lo",
[0x030A0] = "Pd",
[0x030FB] = "Po",
[0x030FF] = "Lo",
[0x03130] = "Cn",
[0x0318F] = "Cn",
[0x031EF] = "So",
[0x0321F] = "Cn",
[0x03250] = "So",
[0x0A015] = "Lm",
[0x0A60C] = "Lm",
[0x0A640] = "Lu",
[0x0A641] = "Ll",
[0x0A642] = "Lu",
[0x0A643] = "Ll",
[0x0A644] = "Lu",
[0x0A645] = "Ll",
[0x0A646] = "Lu",
[0x0A647] = "Ll",
[0x0A648] = "Lu",
[0x0A649] = "Ll",
[0x0A64A] = "Lu",
[0x0A64B] = "Ll",
[0x0A64C] = "Lu",
[0x0A64D] = "Ll",
[0x0A64E] = "Lu",
[0x0A64F] = "Ll",
[0x0A650] = "Lu",
[0x0A651] = "Ll",
[0x0A652] = "Lu",
[0x0A653] = "Ll",
[0x0A654] = "Lu",
[0x0A655] = "Ll",
[0x0A656] = "Lu",
[0x0A657] = "Ll",
[0x0A658] = "Lu",
[0x0A659] = "Ll",
[0x0A65A] = "Lu",
[0x0A65B] = "Ll",
[0x0A65C] = "Lu",
[0x0A65D] = "Ll",
[0x0A65E] = "Lu",
[0x0A65F] = "Ll",
[0x0A660] = "Lu",
[0x0A661] = "Ll",
[0x0A662] = "Lu",
[0x0A663] = "Ll",
[0x0A664] = "Lu",
[0x0A665] = "Ll",
[0x0A666] = "Lu",
[0x0A667] = "Ll",
[0x0A668] = "Lu",
[0x0A669] = "Ll",
[0x0A66A] = "Lu",
[0x0A66B] = "Ll",
[0x0A66C] = "Lu",
[0x0A66D] = "Ll",
[0x0A66E] = "Lo",
[0x0A66F] = "Mn",
[0x0A673] = "Po",
[0x0A67E] = "Po",
[0x0A67F] = "Lm",
[0x0A680] = "Lu",
[0x0A681] = "Ll",
[0x0A682] = "Lu",
[0x0A683] = "Ll",
[0x0A684] = "Lu",
[0x0A685] = "Ll",
[0x0A686] = "Lu",
[0x0A687] = "Ll",
[0x0A688] = "Lu",
[0x0A689] = "Ll",
[0x0A68A] = "Lu",
[0x0A68B] = "Ll",
[0x0A68C] = "Lu",
[0x0A68D] = "Ll",
[0x0A68E] = "Lu",
[0x0A68F] = "Ll",
[0x0A690] = "Lu",
[0x0A691] = "Ll",
[0x0A692] = "Lu",
[0x0A693] = "Ll",
[0x0A694] = "Lu",
[0x0A695] = "Ll",
[0x0A696] = "Lu",
[0x0A697] = "Ll",
[0x0A698] = "Lu",
[0x0A699] = "Ll",
[0x0A69A] = "Lu",
[0x0A69B] = "Ll",
[0x0A722] = "Lu",
[0x0A723] = "Ll",
[0x0A724] = "Lu",
[0x0A725] = "Ll",
[0x0A726] = "Lu",
[0x0A727] = "Ll",
[0x0A728] = "Lu",
[0x0A729] = "Ll",
[0x0A72A] = "Lu",
[0x0A72B] = "Ll",
[0x0A72C] = "Lu",
[0x0A72D] = "Ll",
[0x0A72E] = "Lu",
[0x0A732] = "Lu",
[0x0A733] = "Ll",
[0x0A734] = "Lu",
[0x0A735] = "Ll",
[0x0A736] = "Lu",
[0x0A737] = "Ll",
[0x0A738] = "Lu",
[0x0A739] = "Ll",
[0x0A73A] = "Lu",
[0x0A73B] = "Ll",
[0x0A73C] = "Lu",
[0x0A73D] = "Ll",
[0x0A73E] = "Lu",
[0x0A73F] = "Ll",
[0x0A740] = "Lu",
[0x0A741] = "Ll",
[0x0A742] = "Lu",
[0x0A743] = "Ll",
[0x0A744] = "Lu",
[0x0A745] = "Ll",
[0x0A746] = "Lu",
[0x0A747] = "Ll",
[0x0A748] = "Lu",
[0x0A749] = "Ll",
[0x0A74A] = "Lu",
[0x0A74B] = "Ll",
[0x0A74C] = "Lu",
[0x0A74D] = "Ll",
[0x0A74E] = "Lu",
[0x0A74F] = "Ll",
[0x0A750] = "Lu",
[0x0A751] = "Ll",
[0x0A752] = "Lu",
[0x0A753] = "Ll",
[0x0A754] = "Lu",
[0x0A755] = "Ll",
[0x0A756] = "Lu",
[0x0A757] = "Ll",
[0x0A758] = "Lu",
[0x0A759] = "Ll",
[0x0A75A] = "Lu",
[0x0A75B] = "Ll",
[0x0A75C] = "Lu",
[0x0A75D] = "Ll",
[0x0A75E] = "Lu",
[0x0A75F] = "Ll",
[0x0A760] = "Lu",
[0x0A761] = "Ll",
[0x0A762] = "Lu",
[0x0A763] = "Ll",
[0x0A764] = "Lu",
[0x0A765] = "Ll",
[0x0A766] = "Lu",
[0x0A767] = "Ll",
[0x0A768] = "Lu",
[0x0A769] = "Ll",
[0x0A76A] = "Lu",
[0x0A76B] = "Ll",
[0x0A76C] = "Lu",
[0x0A76D] = "Ll",
[0x0A76E] = "Lu",
[0x0A76F] = "Ll",
[0x0A770] = "Lm",
[0x0A779] = "Lu",
[0x0A77A] = "Ll",
[0x0A77B] = "Lu",
[0x0A77C] = "Ll",
[0x0A77F] = "Ll",
[0x0A780] = "Lu",
[0x0A781] = "Ll",
[0x0A782] = "Lu",
[0x0A783] = "Ll",
[0x0A784] = "Lu",
[0x0A785] = "Ll",
[0x0A786] = "Lu",
[0x0A787] = "Ll",
[0x0A788] = "Lm",
[0x0A78B] = "Lu",
[0x0A78C] = "Ll",
[0x0A78D] = "Lu",
[0x0A78E] = "Ll",
[0x0A78F] = "Lo",
[0x0A790] = "Lu",
[0x0A791] = "Ll",
[0x0A792] = "Lu",
[0x0A796] = "Lu",
[0x0A797] = "Ll",
[0x0A798] = "Lu",
[0x0A799] = "Ll",
[0x0A79A] = "Lu",
[0x0A79B] = "Ll",
[0x0A79C] = "Lu",
[0x0A79D] = "Ll",
[0x0A79E] = "Lu",
[0x0A79F] = "Ll",
[0x0A7A0] = "Lu",
[0x0A7A1] = "Ll",
[0x0A7A2] = "Lu",
[0x0A7A3] = "Ll",
[0x0A7A4] = "Lu",
[0x0A7A5] = "Ll",
[0x0A7A6] = "Lu",
[0x0A7A7] = "Ll",
[0x0A7A8] = "Lu",
[0x0A7A9] = "Ll",
[0x0A7AF] = "Ll",
[0x0A7B5] = "Ll",
[0x0A7B6] = "Lu",
[0x0A7B7] = "Ll",
[0x0A7B8] = "Lu",
[0x0A7B9] = "Ll",
[0x0A7BA] = "Lu",
[0x0A7BB] = "Ll",
[0x0A7BC] = "Lu",
[0x0A7BD] = "Ll",
[0x0A7BE] = "Lu",
[0x0A7BF] = "Ll",
[0x0A7C0] = "Lu",
[0x0A7C1] = "Ll",
[0x0A7C2] = "Lu",
[0x0A7C3] = "Ll",
[0x0A7C8] = "Ll",
[0x0A7C9] = "Lu",
[0x0A7CA] = "Ll",
[0x0A7CD] = "Ll",
[0x0A7CE] = "Lu",
[0x0A7CF] = "Ll",
[0x0A7D0] = "Lu",
[0x0A7D1] = "Ll",
[0x0A7D2] = "Lu",
[0x0A7D3] = "Ll",
[0x0A7D4] = "Lu",
[0x0A7D5] = "Ll",
[0x0A7D6] = "Lu",
[0x0A7D7] = "Ll",
[0x0A7D8] = "Lu",
[0x0A7D9] = "Ll",
[0x0A7DA] = "Lu",
[0x0A7DB] = "Ll",
[0x0A7DC] = "Lu",
[0x0A7F5] = "Lu",
[0x0A7F6] = "Ll",
[0x0A7F7] = "Lo",
[0x0A7FA] = "Ll",
[0x0A802] = "Mn",
[0x0A806] = "Mn",
[0x0A80B] = "Mn",
[0x0A827] = "Mc",
[0x0A82C] = "Mn",
[0x0A838] = "Sc",
[0x0A839] = "So",
[0x0A8FB] = "Lo",
[0x0A8FC] = "Po",
[0x0A8FF] = "Mn",
[0x0A95F] = "Po",
[0x0A983] = "Mc",
[0x0A9B3] = "Mn",
[0x0A9CE] = "Cn",
[0x0A9CF] = "Lm",
[0x0A9E5] = "Mn",
[0x0A9E6] = "Lm",
[0x0A9FF] = "Cn",
[0x0AA43] = "Mn",
[0x0AA4C] = "Mn",
[0x0AA4D] = "Mc",
[0x0AA70] = "Lm",
[0x0AA7A] = "Lo",
[0x0AA7B] = "Mc",
[0x0AA7C] = "Mn",
[0x0AA7D] = "Mc",
[0x0AAB0] = "Mn",
[0x0AAB1] = "Lo",
[0x0AAC0] = "Lo",
[0x0AAC1] = "Mn",
[0x0AAC2] = "Lo",
[0x0AADD] = "Lm",
[0x0AAEB] = "Mc",
[0x0AAF2] = "Lo",
[0x0AAF5] = "Mc",
[0x0AAF6] = "Mn",
[0x0AB27] = "Cn",
[0x0AB2F] = "Cn",
[0x0AB5B] = "Sk",
[0x0AB69] = "Lm",
[0x0ABE5] = "Mn",
[0x0ABE8] = "Mn",
[0x0ABEB] = "Po",
[0x0ABEC] = "Mc",
[0x0ABED] = "Mn",
[0x0FB1D] = "Lo",
[0x0FB1E] = "Mn",
[0x0FB29] = "Sm",
[0x0FB37] = "Cn",
[0x0FB3D] = "Cn",
[0x0FB3E] = "Lo",
[0x0FB3F] = "Cn",
[0x0FB42] = "Cn",
[0x0FB45] = "Cn",
[0x0FD3E] = "Pe",
[0x0FD3F] = "Ps",
[0x0FDFC] = "Sc",
[0x0FE17] = "Ps",
[0x0FE18] = "Pe",
[0x0FE19] = "Po",
[0x0FE30] = "Po",
[0x0FE35] = "Ps",
[0x0FE36] = "Pe",
[0x0FE37] = "Ps",
[0x0FE38] = "Pe",
[0x0FE39] = "Ps",
[0x0FE3A] = "Pe",
[0x0FE3B] = "Ps",
[0x0FE3C] = "Pe",
[0x0FE3D] = "Ps",
[0x0FE3E] = "Pe",
[0x0FE3F] = "Ps",
[0x0FE40] = "Pe",
[0x0FE41] = "Ps",
[0x0FE42] = "Pe",
[0x0FE43] = "Ps",
[0x0FE44] = "Pe",
[0x0FE47] = "Ps",
[0x0FE48] = "Pe",
[0x0FE53] = "Cn",
[0x0FE58] = "Pd",
[0x0FE59] = "Ps",
[0x0FE5A] = "Pe",
[0x0FE5B] = "Ps",
[0x0FE5C] = "Pe",
[0x0FE5D] = "Ps",
[0x0FE5E] = "Pe",
[0x0FE62] = "Sm",
[0x0FE63] = "Pd",
[0x0FE67] = "Cn",
[0x0FE68] = "Po",
[0x0FE69] = "Sc",
[0x0FE75] = "Cn",
[0x0FEFF] = "Cf",
[0x0FF00] = "Cn",
[0x0FF04] = "Sc",
[0x0FF08] = "Ps",
[0x0FF09] = "Pe",
[0x0FF0A] = "Po",
[0x0FF0B] = "Sm",
[0x0FF0C] = "Po",
[0x0FF0D] = "Pd",
[0x0FF3B] = "Ps",
[0x0FF3C] = "Po",
[0x0FF3D] = "Pe",
[0x0FF3E] = "Sk",
[0x0FF3F] = "Pc",
[0x0FF40] = "Sk",
[0x0FF5B] = "Ps",
[0x0FF5C] = "Sm",
[0x0FF5D] = "Pe",
[0x0FF5E] = "Sm",
[0x0FF5F] = "Ps",
[0x0FF60] = "Pe",
[0x0FF61] = "Po",
[0x0FF62] = "Ps",
[0x0FF63] = "Pe",
[0x0FF70] = "Lm",
[0x0FFE2] = "Sm",
[0x0FFE3] = "Sk",
[0x0FFE4] = "So",
[0x0FFE7] = "Cn",
[0x0FFE8] = "So",
[0x1000C] = "Cn",
[0x10027] = "Cn",
[0x1003B] = "Cn",
[0x1003E] = "Cn",
[0x1018F] = "Cn",
[0x101A0] = "So",
[0x101FD] = "Mn",
[0x102E0] = "Mn",
[0x10341] = "Nl",
[0x1034A] = "Nl",
[0x1039E] = "Cn",
[0x1039F] = "Po",
[0x103D0] = "Po",
[0x1056F] = "Po",
[0x1057B] = "Cn",
[0x1058B] = "Cn",
[0x10593] = "Cn",
[0x10596] = "Cn",
[0x105A2] = "Cn",
[0x105B2] = "Cn",
[0x105BA] = "Cn",
[0x10786] = "Cn",
[0x107B1] = "Cn",
[0x10808] = "Lo",
[0x10809] = "Cn",
[0x10836] = "Cn",
[0x1083C] = "Lo",
[0x10856] = "Cn",
[0x10857] = "Po",
[0x108F3] = "Cn",
[0x1091F] = "Po",
[0x1093F] = "Po",
[0x10A00] = "Lo",
[0x10A04] = "Cn",
[0x10A14] = "Cn",
[0x10A18] = "Cn",
[0x10A3F] = "Mn",
[0x10A7F] = "Po",
[0x10AC8] = "So",
[0x10D4E] = "Lm",
[0x10D4F] = "Lo",
[0x10D6E] = "Pd",
[0x10D6F] = "Lm",
[0x10E7F] = "Cn",
[0x10EAA] = "Cn",
[0x10EAD] = "Pd",
[0x10EC5] = "Lm",
[0x10ED0] = "Po",
[0x10F27] = "Lo",
[0x11000] = "Mc",
[0x11001] = "Mn",
[0x11002] = "Mc",
[0x11070] = "Mn",
[0x11075] = "Lo",
[0x11082] = "Mc",
[0x110BD] = "Cf",
[0x110C2] = "Mn",
[0x110CD] = "Cf",
[0x1112C] = "Mc",
[0x11135] = "Cn",
[0x11144] = "Lo",
[0x11147] = "Lo",
[0x11173] = "Mn",
[0x11176] = "Lo",
[0x11182] = "Mc",
[0x111CD] = "Po",
[0x111CE] = "Mc",
[0x111CF] = "Mn",
[0x111DA] = "Lo",
[0x111DB] = "Po",
[0x111DC] = "Lo",
[0x111E0] = "Cn",
[0x11212] = "Cn",
[0x11234] = "Mn",
[0x11235] = "Mc",
[0x1123E] = "Mn",
[0x11241] = "Mn",
[0x11287] = "Cn",
[0x11288] = "Lo",
[0x11289] = "Cn",
[0x1128E] = "Cn",
[0x1129E] = "Cn",
[0x112A9] = "Po",
[0x112DF] = "Mn",
[0x11304] = "Cn",
[0x11329] = "Cn",
[0x11331] = "Cn",
[0x11334] = "Cn",
[0x1133A] = "Cn",
[0x1133D] = "Lo",
[0x11340] = "Mn",
[0x11350] = "Lo",
[0x11357] = "Mc",
[0x1138A] = "Cn",
[0x1138B] = "Lo",
[0x1138E] = "Lo",
[0x1138F] = "Cn",
[0x113B6] = "Cn",
[0x113B7] = "Lo",
[0x113C1] = "Cn",
[0x113C2] = "Mc",
[0x113C5] = "Mc",
[0x113C6] = "Cn",
[0x113CB] = "Cn",
[0x113CE] = "Mn",
[0x113CF] = "Mc",
[0x113D0] = "Mn",
[0x113D1] = "Lo",
[0x113D2] = "Mn",
[0x113D3] = "Lo",
[0x113D6] = "Cn",
[0x11445] = "Mc",
[0x11446] = "Mn",
[0x1145C] = "Cn",
[0x1145D] = "Po",
[0x1145E] = "Mn",
[0x114B9] = "Mc",
[0x114BA] = "Mn",
[0x114C1] = "Mc",
[0x114C6] = "Po",
[0x114C7] = "Lo",
[0x115BE] = "Mc",
[0x1163D] = "Mn",
[0x1163E] = "Mc",
[0x11644] = "Lo",
[0x116AB] = "Mn",
[0x116AC] = "Mc",
[0x116AD] = "Mn",
[0x116B6] = "Mc",
[0x116B7] = "Mn",
[0x116B8] = "Lo",
[0x116B9] = "Po",
[0x1171D] = "Mn",
[0x1171E] = "Mc",
[0x1171F] = "Mn",
[0x11726] = "Mc",
[0x1173F] = "So",
[0x11838] = "Mc",
[0x1183B] = "Po",
[0x11909] = "Lo",
[0x11914] = "Cn",
[0x11917] = "Cn",
[0x11936] = "Cn",
[0x1193D] = "Mc",
[0x1193E] = "Mn",
[0x1193F] = "Lo",
[0x11940] = "Mc",
[0x11941] = "Lo",
[0x11942] = "Mc",
[0x11943] = "Mn",
[0x119E0] = "Mn",
[0x119E1] = "Lo",
[0x119E2] = "Po",
[0x119E3] = "Lo",
[0x119E4] = "Mc",
[0x11A00] = "Lo",
[0x11A39] = "Mc",
[0x11A3A] = "Lo",
[0x11A47] = "Mn",
[0x11A50] = "Lo",
[0x11A97] = "Mc",
[0x11A9D] = "Lo",
[0x11B60] = "Mn",
[0x11B61] = "Mc",
[0x11B65] = "Mc",
[0x11B66] = "Mn",
[0x11B67] = "Mc",
[0x11BE1] = "Po",
[0x11C09] = "Cn",
[0x11C2F] = "Mc",
[0x11C37] = "Cn",
[0x11C3E] = "Mc",
[0x11C3F] = "Mn",
[0x11C40] = "Lo",
[0x11CA8] = "Cn",
[0x11CA9] = "Mc",
[0x11CB1] = "Mc",
[0x11CB4] = "Mc",
[0x11D07] = "Cn",
[0x11D0A] = "Cn",
[0x11D3A] = "Mn",
[0x11D3B] = "Cn",
[0x11D3E] = "Cn",
[0x11D46] = "Lo",
[0x11D47] = "Mn",
[0x11D66] = "Cn",
[0x11D69] = "Cn",
[0x11D8F] = "Cn",
[0x11D92] = "Cn",
[0x11D95] = "Mn",
[0x11D96] = "Mc",
[0x11D97] = "Mn",
[0x11D98] = "Lo",
[0x11DD9] = "Lm",
[0x11F02] = "Lo",
[0x11F03] = "Mc",
[0x11F11] = "Cn",
[0x11F40] = "Mn",
[0x11F41] = "Mc",
[0x11F42] = "Mn",
[0x11F5A] = "Mn",
[0x11FB0] = "Lo",
[0x11FFF] = "Po",
[0x1246F] = "Cn",
[0x13440] = "Mn",
[0x16A5F] = "Cn",
[0x16ABF] = "Cn",
[0x16AF5] = "Po",
[0x16B44] = "Po",
[0x16B45] = "So",
[0x16B5A] = "Cn",
[0x16B62] = "Cn",
[0x16F4F] = "Mn",
[0x16F50] = "Lo",
[0x16FE2] = "Po",
[0x16FE3] = "Lm",
[0x16FE4] = "Mn",
[0x1AFF4] = "Cn",
[0x1AFFC] = "Cn",
[0x1AFFF] = "Cn",
[0x1B132] = "Lo",
[0x1B155] = "Lo",
[0x1BC9C] = "So",
[0x1BC9F] = "Po",
[0x1CEF0] = "Sm",
[0x1D245] = "So",
[0x1D455] = "Cn",
[0x1D49C] = "Lu",
[0x1D49D] = "Cn",
[0x1D4A2] = "Lu",
[0x1D4AD] = "Cn",
[0x1D4BA] = "Cn",
[0x1D4BB] = "Ll",
[0x1D4BC] = "Cn",
[0x1D4C4] = "Cn",
[0x1D506] = "Cn",
[0x1D515] = "Cn",
[0x1D51D] = "Cn",
[0x1D53A] = "Cn",
[0x1D53F] = "Cn",
[0x1D545] = "Cn",
[0x1D546] = "Lu",
[0x1D551] = "Cn",
[0x1D6C1] = "Sm",
[0x1D6DB] = "Sm",
[0x1D6FB] = "Sm",
[0x1D715] = "Sm",
[0x1D735] = "Sm",
[0x1D74F] = "Sm",
[0x1D76F] = "Sm",
[0x1D789] = "Sm",
[0x1D7A9] = "Sm",
[0x1D7C3] = "Sm",
[0x1D7CA] = "Lu",
[0x1D7CB] = "Ll",
[0x1DA75] = "Mn",
[0x1DA84] = "Mn",
[0x1DAA0] = "Cn",
[0x1DF0A] = "Lo",
[0x1E007] = "Cn",
[0x1E022] = "Cn",
[0x1E025] = "Cn",
[0x1E08F] = "Mn",
[0x1E14E] = "Lo",
[0x1E14F] = "So",
[0x1E2AE] = "Mn",
[0x1E2FF] = "Sc",
[0x1E4EB] = "Lm",
[0x1E5F0] = "Lo",
[0x1E5FF] = "Po",
[0x1E6DF] = "Cn",
[0x1E6E3] = "Mn",
[0x1E6E6] = "Mn",
[0x1E6F5] = "Mn",
[0x1E6FE] = "Lo",
[0x1E6FF] = "Lm",
[0x1E7E7] = "Cn",
[0x1E7EC] = "Cn",
[0x1E7EF] = "Cn",
[0x1E7FF] = "Cn",
[0x1E94B] = "Lm",
[0x1ECAC] = "So",
[0x1ECB0] = "Sc",
[0x1ED2E] = "So",
[0x1EE04] = "Cn",
[0x1EE20] = "Cn",
[0x1EE23] = "Cn",
[0x1EE24] = "Lo",
[0x1EE27] = "Lo",
[0x1EE28] = "Cn",
[0x1EE33] = "Cn",
[0x1EE38] = "Cn",
[0x1EE39] = "Lo",
[0x1EE3A] = "Cn",
[0x1EE3B] = "Lo",
[0x1EE42] = "Lo",
[0x1EE47] = "Lo",
[0x1EE48] = "Cn",
[0x1EE49] = "Lo",
[0x1EE4A] = "Cn",
[0x1EE4B] = "Lo",
[0x1EE4C] = "Cn",
[0x1EE50] = "Cn",
[0x1EE53] = "Cn",
[0x1EE54] = "Lo",
[0x1EE57] = "Lo",
[0x1EE58] = "Cn",
[0x1EE59] = "Lo",
[0x1EE5A] = "Cn",
[0x1EE5B] = "Lo",
[0x1EE5C] = "Cn",
[0x1EE5D] = "Lo",
[0x1EE5E] = "Cn",
[0x1EE5F] = "Lo",
[0x1EE60] = "Cn",
[0x1EE63] = "Cn",
[0x1EE64] = "Lo",
[0x1EE6B] = "Cn",
[0x1EE73] = "Cn",
[0x1EE78] = "Cn",
[0x1EE7D] = "Cn",
[0x1EE7E] = "Lo",
[0x1EE7F] = "Cn",
[0x1EE8A] = "Cn",
[0x1EEA4] = "Cn",
[0x1EEAA] = "Cn",
[0x1F0C0] = "Cn",
[0x1F0D0] = "Cn",
[0x1F7F0] = "So",
[0x1FAC7] = "Cn",
[0x1FAC8] = "So",
[0x1FB93] = "Cn",
[0x1FBFA] = "So",
[0xE0001] = "Cf",
},
ranges = {
{ 0x000000, 0x00001F, "Cc" },
{ 0x000021, 0x000023, "Po" },
{ 0x000025, 0x000027, "Po" },
{ 0x00002E, 0x00002F, "Po" },
{ 0x000030, 0x000039, "Nd" },
{ 0x00003A, 0x00003B, "Po" },
{ 0x00003C, 0x00003E, "Sm" },
{ 0x00003F, 0x000040, "Po" },
{ 0x000041, 0x00005A, "Lu" },
{ 0x000061, 0x00007A, "Ll" },
{ 0x00007F, 0x00009F, "Cc" },
{ 0x0000A2, 0x0000A5, "Sc" },
{ 0x0000B2, 0x0000B3, "No" },
{ 0x0000B6, 0x0000B7, "Po" },
{ 0x0000BC, 0x0000BE, "No" },
{ 0x0000C0, 0x0000D6, "Lu" },
{ 0x0000D8, 0x0000DE, "Lu" },
{ 0x0000DF, 0x0000F6, "Ll" },
{ 0x0000F8, 0x0000FF, "Ll" },
{ 0x000137, 0x000138, "Ll" },
{ 0x000148, 0x000149, "Ll" },
{ 0x000178, 0x000179, "Lu" },
{ 0x00017E, 0x000180, "Ll" },
{ 0x000181, 0x000182, "Lu" },
{ 0x000186, 0x000187, "Lu" },
{ 0x000189, 0x00018B, "Lu" },
{ 0x00018C, 0x00018D, "Ll" },
{ 0x00018E, 0x000191, "Lu" },
{ 0x000193, 0x000194, "Lu" },
{ 0x000196, 0x000198, "Lu" },
{ 0x000199, 0x00019B, "Ll" },
{ 0x00019C, 0x00019D, "Lu" },
{ 0x00019F, 0x0001A0, "Lu" },
{ 0x0001A6, 0x0001A7, "Lu" },
{ 0x0001AA, 0x0001AB, "Ll" },
{ 0x0001AE, 0x0001AF, "Lu" },
{ 0x0001B1, 0x0001B3, "Lu" },
{ 0x0001B7, 0x0001B8, "Lu" },
{ 0x0001B9, 0x0001BA, "Ll" },
{ 0x0001BD, 0x0001BF, "Ll" },
{ 0x0001C0, 0x0001C3, "Lo" },
{ 0x0001DC, 0x0001DD, "Ll" },
{ 0x0001EF, 0x0001F0, "Ll" },
{ 0x0001F6, 0x0001F8, "Lu" },
{ 0x000233, 0x000239, "Ll" },
{ 0x00023A, 0x00023B, "Lu" },
{ 0x00023D, 0x00023E, "Lu" },
{ 0x00023F, 0x000240, "Ll" },
{ 0x000243, 0x000246, "Lu" },
{ 0x00024F, 0x000293, "Ll" },
{ 0x000294, 0x000295, "Lo" },
{ 0x000296, 0x0002AF, "Ll" },
{ 0x0002B0, 0x0002C1, "Lm" },
{ 0x0002C2, 0x0002C5, "Sk" },
{ 0x0002C6, 0x0002D1, "Lm" },
{ 0x0002D2, 0x0002DF, "Sk" },
{ 0x0002E0, 0x0002E4, "Lm" },
{ 0x0002E5, 0x0002EB, "Sk" },
{ 0x0002EF, 0x0002FF, "Sk" },
{ 0x000300, 0x00036F, "Mn" },
{ 0x000378, 0x000379, "Cn" },
{ 0x00037B, 0x00037D, "Ll" },
{ 0x000380, 0x000383, "Cn" },
{ 0x000384, 0x000385, "Sk" },
{ 0x000388, 0x00038A, "Lu" },
{ 0x00038E, 0x00038F, "Lu" },
{ 0x000391, 0x0003A1, "Lu" },
{ 0x0003A3, 0x0003AB, "Lu" },
{ 0x0003AC, 0x0003CE, "Ll" },
{ 0x0003D0, 0x0003D1, "Ll" },
{ 0x0003D2, 0x0003D4, "Lu" },
{ 0x0003D5, 0x0003D7, "Ll" },
{ 0x0003EF, 0x0003F3, "Ll" },
{ 0x0003F9, 0x0003FA, "Lu" },
{ 0x0003FB, 0x0003FC, "Ll" },
{ 0x0003FD, 0x00042F, "Lu" },
{ 0x000430, 0x00045F, "Ll" },
{ 0x000483, 0x000487, "Mn" },
{ 0x000488, 0x000489, "Me" },
{ 0x0004C0, 0x0004C1, "Lu" },
{ 0x0004CE, 0x0004CF, "Ll" },
{ 0x000531, 0x000556, "Lu" },
{ 0x000557, 0x000558, "Cn" },
{ 0x00055A, 0x00055F, "Po" },
{ 0x000560, 0x000588, "Ll" },
{ 0x00058B, 0x00058C, "Cn" },
{ 0x00058D, 0x00058E, "So" },
{ 0x000591, 0x0005BD, "Mn" },
{ 0x0005C1, 0x0005C2, "Mn" },
{ 0x0005C4, 0x0005C5, "Mn" },
{ 0x0005C8, 0x0005CF, "Cn" },
{ 0x0005D0, 0x0005EA, "Lo" },
{ 0x0005EB, 0x0005EE, "Cn" },
{ 0x0005EF, 0x0005F2, "Lo" },
{ 0x0005F3, 0x0005F4, "Po" },
{ 0x0005F5, 0x0005FF, "Cn" },
{ 0x000600, 0x000605, "Cf" },
{ 0x000606, 0x000608, "Sm" },
{ 0x000609, 0x00060A, "Po" },
{ 0x00060C, 0x00060D, "Po" },
{ 0x00060E, 0x00060F, "So" },
{ 0x000610, 0x00061A, "Mn" },
{ 0x00061D, 0x00061F, "Po" },
{ 0x000620, 0x00063F, "Lo" },
{ 0x000641, 0x00064A, "Lo" },
{ 0x00064B, 0x00065F, "Mn" },
{ 0x000660, 0x000669, "Nd" },
{ 0x00066A, 0x00066D, "Po" },
{ 0x00066E, 0x00066F, "Lo" },
{ 0x000671, 0x0006D3, "Lo" },
{ 0x0006D6, 0x0006DC, "Mn" },
{ 0x0006DF, 0x0006E4, "Mn" },
{ 0x0006E5, 0x0006E6, "Lm" },
{ 0x0006E7, 0x0006E8, "Mn" },
{ 0x0006EA, 0x0006ED, "Mn" },
{ 0x0006EE, 0x0006EF, "Lo" },
{ 0x0006F0, 0x0006F9, "Nd" },
{ 0x0006FA, 0x0006FC, "Lo" },
{ 0x0006FD, 0x0006FE, "So" },
{ 0x000700, 0x00070D, "Po" },
{ 0x000712, 0x00072F, "Lo" },
{ 0x000730, 0x00074A, "Mn" },
{ 0x00074B, 0x00074C, "Cn" },
{ 0x00074D, 0x0007A5, "Lo" },
{ 0x0007A6, 0x0007B0, "Mn" },
{ 0x0007B2, 0x0007BF, "Cn" },
{ 0x0007C0, 0x0007C9, "Nd" },
{ 0x0007CA, 0x0007EA, "Lo" },
{ 0x0007EB, 0x0007F3, "Mn" },
{ 0x0007F4, 0x0007F5, "Lm" },
{ 0x0007F7, 0x0007F9, "Po" },
{ 0x0007FB, 0x0007FC, "Cn" },
{ 0x0007FE, 0x0007FF, "Sc" },
{ 0x000800, 0x000815, "Lo" },
{ 0x000816, 0x000819, "Mn" },
{ 0x00081B, 0x000823, "Mn" },
{ 0x000825, 0x000827, "Mn" },
{ 0x000829, 0x00082D, "Mn" },
{ 0x00082E, 0x00082F, "Cn" },
{ 0x000830, 0x00083E, "Po" },
{ 0x000840, 0x000858, "Lo" },
{ 0x000859, 0x00085B, "Mn" },
{ 0x00085C, 0x00085D, "Cn" },
{ 0x000860, 0x00086A, "Lo" },
{ 0x00086B, 0x00086F, "Cn" },
{ 0x000870, 0x000887, "Lo" },
{ 0x000889, 0x00088F, "Lo" },
{ 0x000890, 0x000891, "Cf" },
{ 0x000892, 0x000896, "Cn" },
{ 0x000897, 0x00089F, "Mn" },
{ 0x0008A0, 0x0008C8, "Lo" },
{ 0x0008CA, 0x0008E1, "Mn" },
{ 0x0008E3, 0x000902, "Mn" },
{ 0x000904, 0x000939, "Lo" },
{ 0x00093E, 0x000940, "Mc" },
{ 0x000941, 0x000948, "Mn" },
{ 0x000949, 0x00094C, "Mc" },
{ 0x00094E, 0x00094F, "Mc" },
{ 0x000951, 0x000957, "Mn" },
{ 0x000958, 0x000961, "Lo" },
{ 0x000962, 0x000963, "Mn" },
{ 0x000964, 0x000965, "Po" },
{ 0x000966, 0x00096F, "Nd" },
{ 0x000972, 0x000980, "Lo" },
{ 0x000982, 0x000983, "Mc" },
{ 0x000985, 0x00098C, "Lo" },
{ 0x00098D, 0x00098E, "Cn" },
{ 0x00098F, 0x000990, "Lo" },
{ 0x000991, 0x000992, "Cn" },
{ 0x000993, 0x0009A8, "Lo" },
{ 0x0009AA, 0x0009B0, "Lo" },
{ 0x0009B3, 0x0009B5, "Cn" },
{ 0x0009B6, 0x0009B9, "Lo" },
{ 0x0009BA, 0x0009BB, "Cn" },
{ 0x0009BE, 0x0009C0, "Mc" },
{ 0x0009C1, 0x0009C4, "Mn" },
{ 0x0009C5, 0x0009C6, "Cn" },
{ 0x0009C7, 0x0009C8, "Mc" },
{ 0x0009C9, 0x0009CA, "Cn" },
{ 0x0009CB, 0x0009CC, "Mc" },
{ 0x0009CF, 0x0009D6, "Cn" },
{ 0x0009D8, 0x0009DB, "Cn" },
{ 0x0009DC, 0x0009DD, "Lo" },
{ 0x0009DF, 0x0009E1, "Lo" },
{ 0x0009E2, 0x0009E3, "Mn" },
{ 0x0009E4, 0x0009E5, "Cn" },
{ 0x0009E6, 0x0009EF, "Nd" },
{ 0x0009F0, 0x0009F1, "Lo" },
{ 0x0009F2, 0x0009F3, "Sc" },
{ 0x0009F4, 0x0009F9, "No" },
{ 0x0009FF, 0x000A00, "Cn" },
{ 0x000A01, 0x000A02, "Mn" },
{ 0x000A05, 0x000A0A, "Lo" },
{ 0x000A0B, 0x000A0E, "Cn" },
{ 0x000A0F, 0x000A10, "Lo" },
{ 0x000A11, 0x000A12, "Cn" },
{ 0x000A13, 0x000A28, "Lo" },
{ 0x000A2A, 0x000A30, "Lo" },
{ 0x000A32, 0x000A33, "Lo" },
{ 0x000A35, 0x000A36, "Lo" },
{ 0x000A38, 0x000A39, "Lo" },
{ 0x000A3A, 0x000A3B, "Cn" },
{ 0x000A3E, 0x000A40, "Mc" },
{ 0x000A41, 0x000A42, "Mn" },
{ 0x000A43, 0x000A46, "Cn" },
{ 0x000A47, 0x000A48, "Mn" },
{ 0x000A49, 0x000A4A, "Cn" },
{ 0x000A4B, 0x000A4D, "Mn" },
{ 0x000A4E, 0x000A50, "Cn" },
{ 0x000A52, 0x000A58, "Cn" },
{ 0x000A59, 0x000A5C, "Lo" },
{ 0x000A5F, 0x000A65, "Cn" },
{ 0x000A66, 0x000A6F, "Nd" },
{ 0x000A70, 0x000A71, "Mn" },
{ 0x000A72, 0x000A74, "Lo" },
{ 0x000A77, 0x000A80, "Cn" },
{ 0x000A81, 0x000A82, "Mn" },
{ 0x000A85, 0x000A8D, "Lo" },
{ 0x000A8F, 0x000A91, "Lo" },
{ 0x000A93, 0x000AA8, "Lo" },
{ 0x000AAA, 0x000AB0, "Lo" },
{ 0x000AB2, 0x000AB3, "Lo" },
{ 0x000AB5, 0x000AB9, "Lo" },
{ 0x000ABA, 0x000ABB, "Cn" },
{ 0x000ABE, 0x000AC0, "Mc" },
{ 0x000AC1, 0x000AC5, "Mn" },
{ 0x000AC7, 0x000AC8, "Mn" },
{ 0x000ACB, 0x000ACC, "Mc" },
{ 0x000ACE, 0x000ACF, "Cn" },
{ 0x000AD1, 0x000ADF, "Cn" },
{ 0x000AE0, 0x000AE1, "Lo" },
{ 0x000AE2, 0x000AE3, "Mn" },
{ 0x000AE4, 0x000AE5, "Cn" },
{ 0x000AE6, 0x000AEF, "Nd" },
{ 0x000AF2, 0x000AF8, "Cn" },
{ 0x000AFA, 0x000AFF, "Mn" },
{ 0x000B02, 0x000B03, "Mc" },
{ 0x000B05, 0x000B0C, "Lo" },
{ 0x000B0D, 0x000B0E, "Cn" },
{ 0x000B0F, 0x000B10, "Lo" },
{ 0x000B11, 0x000B12, "Cn" },
{ 0x000B13, 0x000B28, "Lo" },
{ 0x000B2A, 0x000B30, "Lo" },
{ 0x000B32, 0x000B33, "Lo" },
{ 0x000B35, 0x000B39, "Lo" },
{ 0x000B3A, 0x000B3B, "Cn" },
{ 0x000B41, 0x000B44, "Mn" },
{ 0x000B45, 0x000B46, "Cn" },
{ 0x000B47, 0x000B48, "Mc" },
{ 0x000B49, 0x000B4A, "Cn" },
{ 0x000B4B, 0x000B4C, "Mc" },
{ 0x000B4E, 0x000B54, "Cn" },
{ 0x000B55, 0x000B56, "Mn" },
{ 0x000B58, 0x000B5B, "Cn" },
{ 0x000B5C, 0x000B5D, "Lo" },
{ 0x000B5F, 0x000B61, "Lo" },
{ 0x000B62, 0x000B63, "Mn" },
{ 0x000B64, 0x000B65, "Cn" },
{ 0x000B66, 0x000B6F, "Nd" },
{ 0x000B72, 0x000B77, "No" },
{ 0x000B78, 0x000B81, "Cn" },
{ 0x000B85, 0x000B8A, "Lo" },
{ 0x000B8B, 0x000B8D, "Cn" },
{ 0x000B8E, 0x000B90, "Lo" },
{ 0x000B92, 0x000B95, "Lo" },
{ 0x000B96, 0x000B98, "Cn" },
{ 0x000B99, 0x000B9A, "Lo" },
{ 0x000B9E, 0x000B9F, "Lo" },
{ 0x000BA0, 0x000BA2, "Cn" },
{ 0x000BA3, 0x000BA4, "Lo" },
{ 0x000BA5, 0x000BA7, "Cn" },
{ 0x000BA8, 0x000BAA, "Lo" },
{ 0x000BAB, 0x000BAD, "Cn" },
{ 0x000BAE, 0x000BB9, "Lo" },
{ 0x000BBA, 0x000BBD, "Cn" },
{ 0x000BBE, 0x000BBF, "Mc" },
{ 0x000BC1, 0x000BC2, "Mc" },
{ 0x000BC3, 0x000BC5, "Cn" },
{ 0x000BC6, 0x000BC8, "Mc" },
{ 0x000BCA, 0x000BCC, "Mc" },
{ 0x000BCE, 0x000BCF, "Cn" },
{ 0x000BD1, 0x000BD6, "Cn" },
{ 0x000BD8, 0x000BE5, "Cn" },
{ 0x000BE6, 0x000BEF, "Nd" },
{ 0x000BF0, 0x000BF2, "No" },
{ 0x000BF3, 0x000BF8, "So" },
{ 0x000BFB, 0x000BFF, "Cn" },
{ 0x000C01, 0x000C03, "Mc" },
{ 0x000C05, 0x000C0C, "Lo" },
{ 0x000C0E, 0x000C10, "Lo" },
{ 0x000C12, 0x000C28, "Lo" },
{ 0x000C2A, 0x000C39, "Lo" },
{ 0x000C3A, 0x000C3B, "Cn" },
{ 0x000C3E, 0x000C40, "Mn" },
{ 0x000C41, 0x000C44, "Mc" },
{ 0x000C46, 0x000C48, "Mn" },
{ 0x000C4A, 0x000C4D, "Mn" },
{ 0x000C4E, 0x000C54, "Cn" },
{ 0x000C55, 0x000C56, "Mn" },
{ 0x000C58, 0x000C5A, "Lo" },
{ 0x000C5C, 0x000C5D, "Lo" },
{ 0x000C5E, 0x000C5F, "Cn" },
{ 0x000C60, 0x000C61, "Lo" },
{ 0x000C62, 0x000C63, "Mn" },
{ 0x000C64, 0x000C65, "Cn" },
{ 0x000C66, 0x000C6F, "Nd" },
{ 0x000C70, 0x000C76, "Cn" },
{ 0x000C78, 0x000C7E, "No" },
{ 0x000C82, 0x000C83, "Mc" },
{ 0x000C85, 0x000C8C, "Lo" },
{ 0x000C8E, 0x000C90, "Lo" },
{ 0x000C92, 0x000CA8, "Lo" },
{ 0x000CAA, 0x000CB3, "Lo" },
{ 0x000CB5, 0x000CB9, "Lo" },
{ 0x000CBA, 0x000CBB, "Cn" },
{ 0x000CC0, 0x000CC4, "Mc" },
{ 0x000CC7, 0x000CC8, "Mc" },
{ 0x000CCA, 0x000CCB, "Mc" },
{ 0x000CCC, 0x000CCD, "Mn" },
{ 0x000CCE, 0x000CD4, "Cn" },
{ 0x000CD5, 0x000CD6, "Mc" },
{ 0x000CD7, 0x000CDB, "Cn" },
{ 0x000CDC, 0x000CDE, "Lo" },
{ 0x000CE0, 0x000CE1, "Lo" },
{ 0x000CE2, 0x000CE3, "Mn" },
{ 0x000CE4, 0x000CE5, "Cn" },
{ 0x000CE6, 0x000CEF, "Nd" },
{ 0x000CF1, 0x000CF2, "Lo" },
{ 0x000CF4, 0x000CFF, "Cn" },
{ 0x000D00, 0x000D01, "Mn" },
{ 0x000D02, 0x000D03, "Mc" },
{ 0x000D04, 0x000D0C, "Lo" },
{ 0x000D0E, 0x000D10, "Lo" },
{ 0x000D12, 0x000D3A, "Lo" },
{ 0x000D3B, 0x000D3C, "Mn" },
{ 0x000D3E, 0x000D40, "Mc" },
{ 0x000D41, 0x000D44, "Mn" },
{ 0x000D46, 0x000D48, "Mc" },
{ 0x000D4A, 0x000D4C, "Mc" },
{ 0x000D50, 0x000D53, "Cn" },
{ 0x000D54, 0x000D56, "Lo" },
{ 0x000D58, 0x000D5E, "No" },
{ 0x000D5F, 0x000D61, "Lo" },
{ 0x000D62, 0x000D63, "Mn" },
{ 0x000D64, 0x000D65, "Cn" },
{ 0x000D66, 0x000D6F, "Nd" },
{ 0x000D70, 0x000D78, "No" },
{ 0x000D7A, 0x000D7F, "Lo" },
{ 0x000D82, 0x000D83, "Mc" },
{ 0x000D85, 0x000D96, "Lo" },
{ 0x000D97, 0x000D99, "Cn" },
{ 0x000D9A, 0x000DB1, "Lo" },
{ 0x000DB3, 0x000DBB, "Lo" },
{ 0x000DBE, 0x000DBF, "Cn" },
{ 0x000DC0, 0x000DC6, "Lo" },
{ 0x000DC7, 0x000DC9, "Cn" },
{ 0x000DCB, 0x000DCE, "Cn" },
{ 0x000DCF, 0x000DD1, "Mc" },
{ 0x000DD2, 0x000DD4, "Mn" },
{ 0x000DD8, 0x000DDF, "Mc" },
{ 0x000DE0, 0x000DE5, "Cn" },
{ 0x000DE6, 0x000DEF, "Nd" },
{ 0x000DF0, 0x000DF1, "Cn" },
{ 0x000DF2, 0x000DF3, "Mc" },
{ 0x000DF5, 0x000E00, "Cn" },
{ 0x000E01, 0x000E30, "Lo" },
{ 0x000E32, 0x000E33, "Lo" },
{ 0x000E34, 0x000E3A, "Mn" },
{ 0x000E3B, 0x000E3E, "Cn" },
{ 0x000E40, 0x000E45, "Lo" },
{ 0x000E47, 0x000E4E, "Mn" },
{ 0x000E50, 0x000E59, "Nd" },
{ 0x000E5A, 0x000E5B, "Po" },
{ 0x000E5C, 0x000E80, "Cn" },
{ 0x000E81, 0x000E82, "Lo" },
{ 0x000E86, 0x000E8A, "Lo" },
{ 0x000E8C, 0x000EA3, "Lo" },
{ 0x000EA7, 0x000EB0, "Lo" },
{ 0x000EB2, 0x000EB3, "Lo" },
{ 0x000EB4, 0x000EBC, "Mn" },
{ 0x000EBE, 0x000EBF, "Cn" },
{ 0x000EC0, 0x000EC4, "Lo" },
{ 0x000EC8, 0x000ECE, "Mn" },
{ 0x000ED0, 0x000ED9, "Nd" },
{ 0x000EDA, 0x000EDB, "Cn" },
{ 0x000EDC, 0x000EDF, "Lo" },
{ 0x000EE0, 0x000EFF, "Cn" },
{ 0x000F01, 0x000F03, "So" },
{ 0x000F04, 0x000F12, "Po" },
{ 0x000F15, 0x000F17, "So" },
{ 0x000F18, 0x000F19, "Mn" },
{ 0x000F1A, 0x000F1F, "So" },
{ 0x000F20, 0x000F29, "Nd" },
{ 0x000F2A, 0x000F33, "No" },
{ 0x000F3E, 0x000F3F, "Mc" },
{ 0x000F40, 0x000F47, "Lo" },
{ 0x000F49, 0x000F6C, "Lo" },
{ 0x000F6D, 0x000F70, "Cn" },
{ 0x000F71, 0x000F7E, "Mn" },
{ 0x000F80, 0x000F84, "Mn" },
{ 0x000F86, 0x000F87, "Mn" },
{ 0x000F88, 0x000F8C, "Lo" },
{ 0x000F8D, 0x000F97, "Mn" },
{ 0x000F99, 0x000FBC, "Mn" },
{ 0x000FBE, 0x000FC5, "So" },
{ 0x000FC7, 0x000FCC, "So" },
{ 0x000FCE, 0x000FCF, "So" },
{ 0x000FD0, 0x000FD4, "Po" },
{ 0x000FD5, 0x000FD8, "So" },
{ 0x000FD9, 0x000FDA, "Po" },
{ 0x000FDB, 0x000FFF, "Cn" },
{ 0x001000, 0x00102A, "Lo" },
{ 0x00102B, 0x00102C, "Mc" },
{ 0x00102D, 0x001030, "Mn" },
{ 0x001032, 0x001037, "Mn" },
{ 0x001039, 0x00103A, "Mn" },
{ 0x00103B, 0x00103C, "Mc" },
{ 0x00103D, 0x00103E, "Mn" },
{ 0x001040, 0x001049, "Nd" },
{ 0x00104A, 0x00104F, "Po" },
{ 0x001050, 0x001055, "Lo" },
{ 0x001056, 0x001057, "Mc" },
{ 0x001058, 0x001059, "Mn" },
{ 0x00105A, 0x00105D, "Lo" },
{ 0x00105E, 0x001060, "Mn" },
{ 0x001062, 0x001064, "Mc" },
{ 0x001065, 0x001066, "Lo" },
{ 0x001067, 0x00106D, "Mc" },
{ 0x00106E, 0x001070, "Lo" },
{ 0x001071, 0x001074, "Mn" },
{ 0x001075, 0x001081, "Lo" },
{ 0x001083, 0x001084, "Mc" },
{ 0x001085, 0x001086, "Mn" },
{ 0x001087, 0x00108C, "Mc" },
{ 0x001090, 0x001099, "Nd" },
{ 0x00109A, 0x00109C, "Mc" },
{ 0x00109E, 0x00109F, "So" },
{ 0x0010A0, 0x0010C5, "Lu" },
{ 0x0010C8, 0x0010CC, "Cn" },
{ 0x0010CE, 0x0010CF, "Cn" },
{ 0x0010D0, 0x0010FA, "Ll" },
{ 0x0010FD, 0x0010FF, "Ll" },
{ 0x001100, 0x001248, "Lo" },
{ 0x00124A, 0x00124D, "Lo" },
{ 0x00124E, 0x00124F, "Cn" },
{ 0x001250, 0x001256, "Lo" },
{ 0x00125A, 0x00125D, "Lo" },
{ 0x00125E, 0x00125F, "Cn" },
{ 0x001260, 0x001288, "Lo" },
{ 0x00128A, 0x00128D, "Lo" },
{ 0x00128E, 0x00128F, "Cn" },
{ 0x001290, 0x0012B0, "Lo" },
{ 0x0012B2, 0x0012B5, "Lo" },
{ 0x0012B6, 0x0012B7, "Cn" },
{ 0x0012B8, 0x0012BE, "Lo" },
{ 0x0012C2, 0x0012C5, "Lo" },
{ 0x0012C6, 0x0012C7, "Cn" },
{ 0x0012C8, 0x0012D6, "Lo" },
{ 0x0012D8, 0x001310, "Lo" },
{ 0x001312, 0x001315, "Lo" },
{ 0x001316, 0x001317, "Cn" },
{ 0x001318, 0x00135A, "Lo" },
{ 0x00135B, 0x00135C, "Cn" },
{ 0x00135D, 0x00135F, "Mn" },
{ 0x001360, 0x001368, "Po" },
{ 0x001369, 0x00137C, "No" },
{ 0x00137D, 0x00137F, "Cn" },
{ 0x001380, 0x00138F, "Lo" },
{ 0x001390, 0x001399, "So" },
{ 0x00139A, 0x00139F, "Cn" },
{ 0x0013A0, 0x0013F5, "Lu" },
{ 0x0013F6, 0x0013F7, "Cn" },
{ 0x0013F8, 0x0013FD, "Ll" },
{ 0x0013FE, 0x0013FF, "Cn" },
{ 0x001401, 0x00166C, "Lo" },
{ 0x00166F, 0x00167F, "Lo" },
{ 0x001681, 0x00169A, "Lo" },
{ 0x00169D, 0x00169F, "Cn" },
{ 0x0016A0, 0x0016EA, "Lo" },
{ 0x0016EB, 0x0016ED, "Po" },
{ 0x0016EE, 0x0016F0, "Nl" },
{ 0x0016F1, 0x0016F8, "Lo" },
{ 0x0016F9, 0x0016FF, "Cn" },
{ 0x001700, 0x001711, "Lo" },
{ 0x001712, 0x001714, "Mn" },
{ 0x001716, 0x00171E, "Cn" },
{ 0x00171F, 0x001731, "Lo" },
{ 0x001732, 0x001733, "Mn" },
{ 0x001735, 0x001736, "Po" },
{ 0x001737, 0x00173F, "Cn" },
{ 0x001740, 0x001751, "Lo" },
{ 0x001752, 0x001753, "Mn" },
{ 0x001754, 0x00175F, "Cn" },
{ 0x001760, 0x00176C, "Lo" },
{ 0x00176E, 0x001770, "Lo" },
{ 0x001772, 0x001773, "Mn" },
{ 0x001774, 0x00177F, "Cn" },
{ 0x001780, 0x0017B3, "Lo" },
{ 0x0017B4, 0x0017B5, "Mn" },
{ 0x0017B7, 0x0017BD, "Mn" },
{ 0x0017BE, 0x0017C5, "Mc" },
{ 0x0017C7, 0x0017C8, "Mc" },
{ 0x0017C9, 0x0017D3, "Mn" },
{ 0x0017D4, 0x0017D6, "Po" },
{ 0x0017D8, 0x0017DA, "Po" },
{ 0x0017DE, 0x0017DF, "Cn" },
{ 0x0017E0, 0x0017E9, "Nd" },
{ 0x0017EA, 0x0017EF, "Cn" },
{ 0x0017F0, 0x0017F9, "No" },
{ 0x0017FA, 0x0017FF, "Cn" },
{ 0x001800, 0x001805, "Po" },
{ 0x001807, 0x00180A, "Po" },
{ 0x00180B, 0x00180D, "Mn" },
{ 0x001810, 0x001819, "Nd" },
{ 0x00181A, 0x00181F, "Cn" },
{ 0x001820, 0x001842, "Lo" },
{ 0x001844, 0x001878, "Lo" },
{ 0x001879, 0x00187F, "Cn" },
{ 0x001880, 0x001884, "Lo" },
{ 0x001885, 0x001886, "Mn" },
{ 0x001887, 0x0018A8, "Lo" },
{ 0x0018AB, 0x0018AF, "Cn" },
{ 0x0018B0, 0x0018F5, "Lo" },
{ 0x0018F6, 0x0018FF, "Cn" },
{ 0x001900, 0x00191E, "Lo" },
{ 0x001920, 0x001922, "Mn" },
{ 0x001923, 0x001926, "Mc" },
{ 0x001927, 0x001928, "Mn" },
{ 0x001929, 0x00192B, "Mc" },
{ 0x00192C, 0x00192F, "Cn" },
{ 0x001930, 0x001931, "Mc" },
{ 0x001933, 0x001938, "Mc" },
{ 0x001939, 0x00193B, "Mn" },
{ 0x00193C, 0x00193F, "Cn" },
{ 0x001941, 0x001943, "Cn" },
{ 0x001944, 0x001945, "Po" },
{ 0x001946, 0x00194F, "Nd" },
{ 0x001950, 0x00196D, "Lo" },
{ 0x00196E, 0x00196F, "Cn" },
{ 0x001970, 0x001974, "Lo" },
{ 0x001975, 0x00197F, "Cn" },
{ 0x001980, 0x0019AB, "Lo" },
{ 0x0019AC, 0x0019AF, "Cn" },
{ 0x0019B0, 0x0019C9, "Lo" },
{ 0x0019CA, 0x0019CF, "Cn" },
{ 0x0019D0, 0x0019D9, "Nd" },
{ 0x0019DB, 0x0019DD, "Cn" },
{ 0x0019DE, 0x0019FF, "So" },
{ 0x001A00, 0x001A16, "Lo" },
{ 0x001A17, 0x001A18, "Mn" },
{ 0x001A19, 0x001A1A, "Mc" },
{ 0x001A1C, 0x001A1D, "Cn" },
{ 0x001A1E, 0x001A1F, "Po" },
{ 0x001A20, 0x001A54, "Lo" },
{ 0x001A58, 0x001A5E, "Mn" },
{ 0x001A63, 0x001A64, "Mc" },
{ 0x001A65, 0x001A6C, "Mn" },
{ 0x001A6D, 0x001A72, "Mc" },
{ 0x001A73, 0x001A7C, "Mn" },
{ 0x001A7D, 0x001A7E, "Cn" },
{ 0x001A80, 0x001A89, "Nd" },
{ 0x001A8A, 0x001A8F, "Cn" },
{ 0x001A90, 0x001A99, "Nd" },
{ 0x001A9A, 0x001A9F, "Cn" },
{ 0x001AA0, 0x001AA6, "Po" },
{ 0x001AA8, 0x001AAD, "Po" },
{ 0x001AAE, 0x001AAF, "Cn" },
{ 0x001AB0, 0x001ABD, "Mn" },
{ 0x001ABF, 0x001ADD, "Mn" },
{ 0x001ADE, 0x001ADF, "Cn" },
{ 0x001AE0, 0x001AEB, "Mn" },
{ 0x001AEC, 0x001AFF, "Cn" },
{ 0x001B00, 0x001B03, "Mn" },
{ 0x001B05, 0x001B33, "Lo" },
{ 0x001B36, 0x001B3A, "Mn" },
{ 0x001B3D, 0x001B41, "Mc" },
{ 0x001B43, 0x001B44, "Mc" },
{ 0x001B45, 0x001B4C, "Lo" },
{ 0x001B4E, 0x001B4F, "Po" },
{ 0x001B50, 0x001B59, "Nd" },
{ 0x001B5A, 0x001B60, "Po" },
{ 0x001B61, 0x001B6A, "So" },
{ 0x001B6B, 0x001B73, "Mn" },
{ 0x001B74, 0x001B7C, "So" },
{ 0x001B7D, 0x001B7F, "Po" },
{ 0x001B80, 0x001B81, "Mn" },
{ 0x001B83, 0x001BA0, "Lo" },
{ 0x001BA2, 0x001BA5, "Mn" },
{ 0x001BA6, 0x001BA7, "Mc" },
{ 0x001BA8, 0x001BA9, "Mn" },
{ 0x001BAB, 0x001BAD, "Mn" },
{ 0x001BAE, 0x001BAF, "Lo" },
{ 0x001BB0, 0x001BB9, "Nd" },
{ 0x001BBA, 0x001BE5, "Lo" },
{ 0x001BE8, 0x001BE9, "Mn" },
{ 0x001BEA, 0x001BEC, "Mc" },
{ 0x001BEF, 0x001BF1, "Mn" },
{ 0x001BF2, 0x001BF3, "Mc" },
{ 0x001BF4, 0x001BFB, "Cn" },
{ 0x001BFC, 0x001BFF, "Po" },
{ 0x001C00, 0x001C23, "Lo" },
{ 0x001C24, 0x001C2B, "Mc" },
{ 0x001C2C, 0x001C33, "Mn" },
{ 0x001C34, 0x001C35, "Mc" },
{ 0x001C36, 0x001C37, "Mn" },
{ 0x001C38, 0x001C3A, "Cn" },
{ 0x001C3B, 0x001C3F, "Po" },
{ 0x001C40, 0x001C49, "Nd" },
{ 0x001C4A, 0x001C4C, "Cn" },
{ 0x001C4D, 0x001C4F, "Lo" },
{ 0x001C50, 0x001C59, "Nd" },
{ 0x001C5A, 0x001C77, "Lo" },
{ 0x001C78, 0x001C7D, "Lm" },
{ 0x001C7E, 0x001C7F, "Po" },
{ 0x001C80, 0x001C88, "Ll" },
{ 0x001C8B, 0x001C8F, "Cn" },
{ 0x001C90, 0x001CBA, "Lu" },
{ 0x001CBB, 0x001CBC, "Cn" },
{ 0x001CBD, 0x001CBF, "Lu" },
{ 0x001CC0, 0x001CC7, "Po" },
{ 0x001CC8, 0x001CCF, "Cn" },
{ 0x001CD0, 0x001CD2, "Mn" },
{ 0x001CD4, 0x001CE0, "Mn" },
{ 0x001CE2, 0x001CE8, "Mn" },
{ 0x001CE9, 0x001CEC, "Lo" },
{ 0x001CEE, 0x001CF3, "Lo" },
{ 0x001CF5, 0x001CF6, "Lo" },
{ 0x001CF8, 0x001CF9, "Mn" },
{ 0x001CFB, 0x001CFF, "Cn" },
{ 0x001D00, 0x001D2B, "Ll" },
{ 0x001D2C, 0x001D6A, "Lm" },
{ 0x001D6B, 0x001D77, "Ll" },
{ 0x001D79, 0x001D9A, "Ll" },
{ 0x001D9B, 0x001DBF, "Lm" },
{ 0x001DC0, 0x001DFF, "Mn" },
{ 0x001E95, 0x001E9D, "Ll" },
{ 0x001EFF, 0x001F07, "Ll" },
{ 0x001F08, 0x001F0F, "Lu" },
{ 0x001F10, 0x001F15, "Ll" },
{ 0x001F16, 0x001F17, "Cn" },
{ 0x001F18, 0x001F1D, "Lu" },
{ 0x001F1E, 0x001F1F, "Cn" },
{ 0x001F20, 0x001F27, "Ll" },
{ 0x001F28, 0x001F2F, "Lu" },
{ 0x001F30, 0x001F37, "Ll" },
{ 0x001F38, 0x001F3F, "Lu" },
{ 0x001F40, 0x001F45, "Ll" },
{ 0x001F46, 0x001F47, "Cn" },
{ 0x001F48, 0x001F4D, "Lu" },
{ 0x001F4E, 0x001F4F, "Cn" },
{ 0x001F50, 0x001F57, "Ll" },
{ 0x001F60, 0x001F67, "Ll" },
{ 0x001F68, 0x001F6F, "Lu" },
{ 0x001F70, 0x001F7D, "Ll" },
{ 0x001F7E, 0x001F7F, "Cn" },
{ 0x001F80, 0x001F87, "Ll" },
{ 0x001F88, 0x001F8F, "Lt" },
{ 0x001F90, 0x001F97, "Ll" },
{ 0x001F98, 0x001F9F, "Lt" },
{ 0x001FA0, 0x001FA7, "Ll" },
{ 0x001FA8, 0x001FAF, "Lt" },
{ 0x001FB0, 0x001FB4, "Ll" },
{ 0x001FB6, 0x001FB7, "Ll" },
{ 0x001FB8, 0x001FBB, "Lu" },
{ 0x001FBF, 0x001FC1, "Sk" },
{ 0x001FC2, 0x001FC4, "Ll" },
{ 0x001FC6, 0x001FC7, "Ll" },
{ 0x001FC8, 0x001FCB, "Lu" },
{ 0x001FCD, 0x001FCF, "Sk" },
{ 0x001FD0, 0x001FD3, "Ll" },
{ 0x001FD4, 0x001FD5, "Cn" },
{ 0x001FD6, 0x001FD7, "Ll" },
{ 0x001FD8, 0x001FDB, "Lu" },
{ 0x001FDD, 0x001FDF, "Sk" },
{ 0x001FE0, 0x001FE7, "Ll" },
{ 0x001FE8, 0x001FEC, "Lu" },
{ 0x001FED, 0x001FEF, "Sk" },
{ 0x001FF0, 0x001FF1, "Cn" },
{ 0x001FF2, 0x001FF4, "Ll" },
{ 0x001FF6, 0x001FF7, "Ll" },
{ 0x001FF8, 0x001FFB, "Lu" },
{ 0x001FFD, 0x001FFE, "Sk" },
{ 0x002000, 0x00200A, "Zs" },
{ 0x00200B, 0x00200F, "Cf" },
{ 0x002010, 0x002015, "Pd" },
{ 0x002016, 0x002017, "Po" },
{ 0x00201B, 0x00201C, "Pi" },
{ 0x002020, 0x002027, "Po" },
{ 0x00202A, 0x00202E, "Cf" },
{ 0x002030, 0x002038, "Po" },
{ 0x00203B, 0x00203E, "Po" },
{ 0x00203F, 0x002040, "Pc" },
{ 0x002041, 0x002043, "Po" },
{ 0x002047, 0x002051, "Po" },
{ 0x002055, 0x00205E, "Po" },
{ 0x002060, 0x002064, "Cf" },
{ 0x002066, 0x00206F, "Cf" },
{ 0x002072, 0x002073, "Cn" },
{ 0x002074, 0x002079, "No" },
{ 0x00207A, 0x00207C, "Sm" },
{ 0x002080, 0x002089, "No" },
{ 0x00208A, 0x00208C, "Sm" },
{ 0x002090, 0x00209C, "Lm" },
{ 0x00209D, 0x00209F, "Cn" },
{ 0x0020A0, 0x0020C1, "Sc" },
{ 0x0020C2, 0x0020CF, "Cn" },
{ 0x0020D0, 0x0020DC, "Mn" },
{ 0x0020DD, 0x0020E0, "Me" },
{ 0x0020E2, 0x0020E4, "Me" },
{ 0x0020E5, 0x0020F0, "Mn" },
{ 0x0020F1, 0x0020FF, "Cn" },
{ 0x002100, 0x002101, "So" },
{ 0x002103, 0x002106, "So" },
{ 0x002108, 0x002109, "So" },
{ 0x00210B, 0x00210D, "Lu" },
{ 0x00210E, 0x00210F, "Ll" },
{ 0x002110, 0x002112, "Lu" },
{ 0x002116, 0x002117, "So" },
{ 0x002119, 0x00211D, "Lu" },
{ 0x00211E, 0x002123, "So" },
{ 0x00212A, 0x00212D, "Lu" },
{ 0x002130, 0x002133, "Lu" },
{ 0x002135, 0x002138, "Lo" },
{ 0x00213A, 0x00213B, "So" },
{ 0x00213C, 0x00213D, "Ll" },
{ 0x00213E, 0x00213F, "Lu" },
{ 0x002140, 0x002144, "Sm" },
{ 0x002146, 0x002149, "Ll" },
{ 0x00214C, 0x00214D, "So" },
{ 0x002150, 0x00215F, "No" },
{ 0x002160, 0x002182, "Nl" },
{ 0x002185, 0x002188, "Nl" },
{ 0x00218A, 0x00218B, "So" },
{ 0x00218C, 0x00218F, "Cn" },
{ 0x002190, 0x002194, "Sm" },
{ 0x002195, 0x002199, "So" },
{ 0x00219A, 0x00219B, "Sm" },
{ 0x00219C, 0x00219F, "So" },
{ 0x0021A1, 0x0021A2, "So" },
{ 0x0021A4, 0x0021A5, "So" },
{ 0x0021A7, 0x0021AD, "So" },
{ 0x0021AF, 0x0021CD, "So" },
{ 0x0021CE, 0x0021CF, "Sm" },
{ 0x0021D0, 0x0021D1, "So" },
{ 0x0021D5, 0x0021F3, "So" },
{ 0x0021F4, 0x0022FF, "Sm" },
{ 0x002300, 0x002307, "So" },
{ 0x00230C, 0x00231F, "So" },
{ 0x002320, 0x002321, "Sm" },
{ 0x002322, 0x002328, "So" },
{ 0x00232B, 0x00237B, "So" },
{ 0x00237D, 0x00239A, "So" },
{ 0x00239B, 0x0023B3, "Sm" },
{ 0x0023B4, 0x0023DB, "So" },
{ 0x0023DC, 0x0023E1, "Sm" },
{ 0x0023E2, 0x002429, "So" },
{ 0x00242A, 0x00243F, "Cn" },
{ 0x002440, 0x00244A, "So" },
{ 0x00244B, 0x00245F, "Cn" },
{ 0x002460, 0x00249B, "No" },
{ 0x00249C, 0x0024E9, "So" },
{ 0x0024EA, 0x0024FF, "No" },
{ 0x002500, 0x0025B6, "So" },
{ 0x0025B8, 0x0025C0, "So" },
{ 0x0025C2, 0x0025F7, "So" },
{ 0x0025F8, 0x0025FF, "Sm" },
{ 0x002600, 0x00266E, "So" },
{ 0x002670, 0x002767, "So" },
{ 0x002776, 0x002793, "No" },
{ 0x002794, 0x0027BF, "So" },
{ 0x0027C0, 0x0027C4, "Sm" },
{ 0x0027C7, 0x0027E5, "Sm" },
{ 0x0027F0, 0x0027FF, "Sm" },
{ 0x002800, 0x0028FF, "So" },
{ 0x002900, 0x002982, "Sm" },
{ 0x002999, 0x0029D7, "Sm" },
{ 0x0029DC, 0x0029FB, "Sm" },
{ 0x0029FE, 0x002AFF, "Sm" },
{ 0x002B00, 0x002B2F, "So" },
{ 0x002B30, 0x002B44, "Sm" },
{ 0x002B45, 0x002B46, "So" },
{ 0x002B47, 0x002B4C, "Sm" },
{ 0x002B4D, 0x002B73, "So" },
{ 0x002B74, 0x002B75, "Cn" },
{ 0x002B76, 0x002BFF, "So" },
{ 0x002C00, 0x002C2F, "Lu" },
{ 0x002C30, 0x002C5F, "Ll" },
{ 0x002C62, 0x002C64, "Lu" },
{ 0x002C65, 0x002C66, "Ll" },
{ 0x002C6D, 0x002C70, "Lu" },
{ 0x002C73, 0x002C74, "Ll" },
{ 0x002C76, 0x002C7B, "Ll" },
{ 0x002C7C, 0x002C7D, "Lm" },
{ 0x002C7E, 0x002C80, "Lu" },
{ 0x002CE3, 0x002CE4, "Ll" },
{ 0x002CE5, 0x002CEA, "So" },
{ 0x002CEF, 0x002CF1, "Mn" },
{ 0x002CF4, 0x002CF8, "Cn" },
{ 0x002CF9, 0x002CFC, "Po" },
{ 0x002CFE, 0x002CFF, "Po" },
{ 0x002D00, 0x002D25, "Ll" },
{ 0x002D28, 0x002D2C, "Cn" },
{ 0x002D2E, 0x002D2F, "Cn" },
{ 0x002D30, 0x002D67, "Lo" },
{ 0x002D68, 0x002D6E, "Cn" },
{ 0x002D71, 0x002D7E, "Cn" },
{ 0x002D80, 0x002D96, "Lo" },
{ 0x002D97, 0x002D9F, "Cn" },
{ 0x002DA0, 0x002DA6, "Lo" },
{ 0x002DA8, 0x002DAE, "Lo" },
{ 0x002DB0, 0x002DB6, "Lo" },
{ 0x002DB8, 0x002DBE, "Lo" },
{ 0x002DC0, 0x002DC6, "Lo" },
{ 0x002DC8, 0x002DCE, "Lo" },
{ 0x002DD0, 0x002DD6, "Lo" },
{ 0x002DD8, 0x002DDE, "Lo" },
{ 0x002DE0, 0x002DFF, "Mn" },
{ 0x002E00, 0x002E01, "Po" },
{ 0x002E06, 0x002E08, "Po" },
{ 0x002E0E, 0x002E16, "Po" },
{ 0x002E18, 0x002E19, "Po" },
{ 0x002E1E, 0x002E1F, "Po" },
{ 0x002E2A, 0x002E2E, "Po" },
{ 0x002E30, 0x002E39, "Po" },
{ 0x002E3A, 0x002E3B, "Pd" },
{ 0x002E3C, 0x002E3F, "Po" },
{ 0x002E43, 0x002E4F, "Po" },
{ 0x002E50, 0x002E51, "So" },
{ 0x002E52, 0x002E54, "Po" },
{ 0x002E5E, 0x002E7F, "Cn" },
{ 0x002E80, 0x002E99, "So" },
{ 0x002E9B, 0x002EF3, "So" },
{ 0x002EF4, 0x002EFF, "Cn" },
{ 0x002F00, 0x002FD5, "So" },
{ 0x002FD6, 0x002FEF, "Cn" },
{ 0x002FF0, 0x002FFF, "So" },
{ 0x003001, 0x003003, "Po" },
{ 0x003012, 0x003013, "So" },
{ 0x00301E, 0x00301F, "Pe" },
{ 0x003021, 0x003029, "Nl" },
{ 0x00302A, 0x00302D, "Mn" },
{ 0x00302E, 0x00302F, "Mc" },
{ 0x003031, 0x003035, "Lm" },
{ 0x003036, 0x003037, "So" },
{ 0x003038, 0x00303A, "Nl" },
{ 0x00303E, 0x00303F, "So" },
{ 0x003041, 0x003096, "Lo" },
{ 0x003097, 0x003098, "Cn" },
{ 0x003099, 0x00309A, "Mn" },
{ 0x00309B, 0x00309C, "Sk" },
{ 0x00309D, 0x00309E, "Lm" },
{ 0x0030A1, 0x0030FA, "Lo" },
{ 0x0030FC, 0x0030FE, "Lm" },
{ 0x003100, 0x003104, "Cn" },
{ 0x003105, 0x00312F, "Lo" },
{ 0x003131, 0x00318E, "Lo" },
{ 0x003190, 0x003191, "So" },
{ 0x003192, 0x003195, "No" },
{ 0x003196, 0x00319F, "So" },
{ 0x0031A0, 0x0031BF, "Lo" },
{ 0x0031C0, 0x0031E5, "So" },
{ 0x0031E6, 0x0031EE, "Cn" },
{ 0x0031F0, 0x0031FF, "Lo" },
{ 0x003200, 0x00321E, "So" },
{ 0x003220, 0x003229, "No" },
{ 0x00322A, 0x003247, "So" },
{ 0x003248, 0x00324F, "No" },
{ 0x003251, 0x00325F, "No" },
{ 0x003260, 0x00327F, "So" },
{ 0x003280, 0x003289, "No" },
{ 0x00328A, 0x0032B0, "So" },
{ 0x0032B1, 0x0032BF, "No" },
{ 0x0032C0, 0x0033FF, "So" },
{ 0x003400, 0x004DBF, "Lo" },
{ 0x004DC0, 0x004DFF, "So" },
{ 0x004E00, 0x00A014, "Lo" },
{ 0x00A016, 0x00A48C, "Lo" },
{ 0x00A48D, 0x00A48F, "Cn" },
{ 0x00A490, 0x00A4C6, "So" },
{ 0x00A4C7, 0x00A4CF, "Cn" },
{ 0x00A4D0, 0x00A4F7, "Lo" },
{ 0x00A4F8, 0x00A4FD, "Lm" },
{ 0x00A4FE, 0x00A4FF, "Po" },
{ 0x00A500, 0x00A60B, "Lo" },
{ 0x00A60D, 0x00A60F, "Po" },
{ 0x00A610, 0x00A61F, "Lo" },
{ 0x00A620, 0x00A629, "Nd" },
{ 0x00A62A, 0x00A62B, "Lo" },
{ 0x00A62C, 0x00A63F, "Cn" },
{ 0x00A670, 0x00A672, "Me" },
{ 0x00A674, 0x00A67D, "Mn" },
{ 0x00A69C, 0x00A69D, "Lm" },
{ 0x00A69E, 0x00A69F, "Mn" },
{ 0x00A6A0, 0x00A6E5, "Lo" },
{ 0x00A6E6, 0x00A6EF, "Nl" },
{ 0x00A6F0, 0x00A6F1, "Mn" },
{ 0x00A6F2, 0x00A6F7, "Po" },
{ 0x00A6F8, 0x00A6FF, "Cn" },
{ 0x00A700, 0x00A716, "Sk" },
{ 0x00A717, 0x00A71F, "Lm" },
{ 0x00A720, 0x00A721, "Sk" },
{ 0x00A72F, 0x00A731, "Ll" },
{ 0x00A771, 0x00A778, "Ll" },
{ 0x00A77D, 0x00A77E, "Lu" },
{ 0x00A789, 0x00A78A, "Sk" },
{ 0x00A793, 0x00A795, "Ll" },
{ 0x00A7AA, 0x00A7AE, "Lu" },
{ 0x00A7B0, 0x00A7B4, "Lu" },
{ 0x00A7C4, 0x00A7C7, "Lu" },
{ 0x00A7CB, 0x00A7CC, "Lu" },
{ 0x00A7DD, 0x00A7F0, "Cn" },
{ 0x00A7F1, 0x00A7F4, "Lm" },
{ 0x00A7F8, 0x00A7F9, "Lm" },
{ 0x00A7FB, 0x00A801, "Lo" },
{ 0x00A803, 0x00A805, "Lo" },
{ 0x00A807, 0x00A80A, "Lo" },
{ 0x00A80C, 0x00A822, "Lo" },
{ 0x00A823, 0x00A824, "Mc" },
{ 0x00A825, 0x00A826, "Mn" },
{ 0x00A828, 0x00A82B, "So" },
{ 0x00A82D, 0x00A82F, "Cn" },
{ 0x00A830, 0x00A835, "No" },
{ 0x00A836, 0x00A837, "So" },
{ 0x00A83A, 0x00A83F, "Cn" },
{ 0x00A840, 0x00A873, "Lo" },
{ 0x00A874, 0x00A877, "Po" },
{ 0x00A878, 0x00A87F, "Cn" },
{ 0x00A880, 0x00A881, "Mc" },
{ 0x00A882, 0x00A8B3, "Lo" },
{ 0x00A8B4, 0x00A8C3, "Mc" },
{ 0x00A8C4, 0x00A8C5, "Mn" },
{ 0x00A8C6, 0x00A8CD, "Cn" },
{ 0x00A8CE, 0x00A8CF, "Po" },
{ 0x00A8D0, 0x00A8D9, "Nd" },
{ 0x00A8DA, 0x00A8DF, "Cn" },
{ 0x00A8E0, 0x00A8F1, "Mn" },
{ 0x00A8F2, 0x00A8F7, "Lo" },
{ 0x00A8F8, 0x00A8FA, "Po" },
{ 0x00A8FD, 0x00A8FE, "Lo" },
{ 0x00A900, 0x00A909, "Nd" },
{ 0x00A90A, 0x00A925, "Lo" },
{ 0x00A926, 0x00A92D, "Mn" },
{ 0x00A92E, 0x00A92F, "Po" },
{ 0x00A930, 0x00A946, "Lo" },
{ 0x00A947, 0x00A951, "Mn" },
{ 0x00A952, 0x00A953, "Mc" },
{ 0x00A954, 0x00A95E, "Cn" },
{ 0x00A960, 0x00A97C, "Lo" },
{ 0x00A97D, 0x00A97F, "Cn" },
{ 0x00A980, 0x00A982, "Mn" },
{ 0x00A984, 0x00A9B2, "Lo" },
{ 0x00A9B4, 0x00A9B5, "Mc" },
{ 0x00A9B6, 0x00A9B9, "Mn" },
{ 0x00A9BA, 0x00A9BB, "Mc" },
{ 0x00A9BC, 0x00A9BD, "Mn" },
{ 0x00A9BE, 0x00A9C0, "Mc" },
{ 0x00A9C1, 0x00A9CD, "Po" },
{ 0x00A9D0, 0x00A9D9, "Nd" },
{ 0x00A9DA, 0x00A9DD, "Cn" },
{ 0x00A9DE, 0x00A9DF, "Po" },
{ 0x00A9E0, 0x00A9E4, "Lo" },
{ 0x00A9E7, 0x00A9EF, "Lo" },
{ 0x00A9F0, 0x00A9F9, "Nd" },
{ 0x00A9FA, 0x00A9FE, "Lo" },
{ 0x00AA00, 0x00AA28, "Lo" },
{ 0x00AA29, 0x00AA2E, "Mn" },
{ 0x00AA2F, 0x00AA30, "Mc" },
{ 0x00AA31, 0x00AA32, "Mn" },
{ 0x00AA33, 0x00AA34, "Mc" },
{ 0x00AA35, 0x00AA36, "Mn" },
{ 0x00AA37, 0x00AA3F, "Cn" },
{ 0x00AA40, 0x00AA42, "Lo" },
{ 0x00AA44, 0x00AA4B, "Lo" },
{ 0x00AA4E, 0x00AA4F, "Cn" },
{ 0x00AA50, 0x00AA59, "Nd" },
{ 0x00AA5A, 0x00AA5B, "Cn" },
{ 0x00AA5C, 0x00AA5F, "Po" },
{ 0x00AA60, 0x00AA6F, "Lo" },
{ 0x00AA71, 0x00AA76, "Lo" },
{ 0x00AA77, 0x00AA79, "So" },
{ 0x00AA7E, 0x00AAAF, "Lo" },
{ 0x00AAB2, 0x00AAB4, "Mn" },
{ 0x00AAB5, 0x00AAB6, "Lo" },
{ 0x00AAB7, 0x00AAB8, "Mn" },
{ 0x00AAB9, 0x00AABD, "Lo" },
{ 0x00AABE, 0x00AABF, "Mn" },
{ 0x00AAC3, 0x00AADA, "Cn" },
{ 0x00AADB, 0x00AADC, "Lo" },
{ 0x00AADE, 0x00AADF, "Po" },
{ 0x00AAE0, 0x00AAEA, "Lo" },
{ 0x00AAEC, 0x00AAED, "Mn" },
{ 0x00AAEE, 0x00AAEF, "Mc" },
{ 0x00AAF0, 0x00AAF1, "Po" },
{ 0x00AAF3, 0x00AAF4, "Lm" },
{ 0x00AAF7, 0x00AB00, "Cn" },
{ 0x00AB01, 0x00AB06, "Lo" },
{ 0x00AB07, 0x00AB08, "Cn" },
{ 0x00AB09, 0x00AB0E, "Lo" },
{ 0x00AB0F, 0x00AB10, "Cn" },
{ 0x00AB11, 0x00AB16, "Lo" },
{ 0x00AB17, 0x00AB1F, "Cn" },
{ 0x00AB20, 0x00AB26, "Lo" },
{ 0x00AB28, 0x00AB2E, "Lo" },
{ 0x00AB30, 0x00AB5A, "Ll" },
{ 0x00AB5C, 0x00AB5F, "Lm" },
{ 0x00AB60, 0x00AB68, "Ll" },
{ 0x00AB6A, 0x00AB6B, "Sk" },
{ 0x00AB6C, 0x00AB6F, "Cn" },
{ 0x00AB70, 0x00ABBF, "Ll" },
{ 0x00ABC0, 0x00ABE2, "Lo" },
{ 0x00ABE3, 0x00ABE4, "Mc" },
{ 0x00ABE6, 0x00ABE7, "Mc" },
{ 0x00ABE9, 0x00ABEA, "Mc" },
{ 0x00ABEE, 0x00ABEF, "Cn" },
{ 0x00ABF0, 0x00ABF9, "Nd" },
{ 0x00ABFA, 0x00ABFF, "Cn" },
{ 0x00AC00, 0x00D7A3, "Lo" },
{ 0x00D7A4, 0x00D7AF, "Cn" },
{ 0x00D7B0, 0x00D7C6, "Lo" },
{ 0x00D7C7, 0x00D7CA, "Cn" },
{ 0x00D7CB, 0x00D7FB, "Lo" },
{ 0x00D7FC, 0x00D7FF, "Cn" },
{ 0x00D800, 0x00DFFF, "Cs" },
{ 0x00E000, 0x00F8FF, "Co" },
{ 0x00F900, 0x00FA6D, "Lo" },
{ 0x00FA6E, 0x00FA6F, "Cn" },
{ 0x00FA70, 0x00FAD9, "Lo" },
{ 0x00FADA, 0x00FAFF, "Cn" },
{ 0x00FB00, 0x00FB06, "Ll" },
{ 0x00FB07, 0x00FB12, "Cn" },
{ 0x00FB13, 0x00FB17, "Ll" },
{ 0x00FB18, 0x00FB1C, "Cn" },
{ 0x00FB1F, 0x00FB28, "Lo" },
{ 0x00FB2A, 0x00FB36, "Lo" },
{ 0x00FB38, 0x00FB3C, "Lo" },
{ 0x00FB40, 0x00FB41, "Lo" },
{ 0x00FB43, 0x00FB44, "Lo" },
{ 0x00FB46, 0x00FBB1, "Lo" },
{ 0x00FBB2, 0x00FBC2, "Sk" },
{ 0x00FBC3, 0x00FBD2, "So" },
{ 0x00FBD3, 0x00FD3D, "Lo" },
{ 0x00FD40, 0x00FD4F, "So" },
{ 0x00FD50, 0x00FD8F, "Lo" },
{ 0x00FD90, 0x00FD91, "So" },
{ 0x00FD92, 0x00FDC7, "Lo" },
{ 0x00FDC8, 0x00FDCF, "So" },
{ 0x00FDD0, 0x00FDEF, "Cn" },
{ 0x00FDF0, 0x00FDFB, "Lo" },
{ 0x00FDFD, 0x00FDFF, "So" },
{ 0x00FE00, 0x00FE0F, "Mn" },
{ 0x00FE10, 0x00FE16, "Po" },
{ 0x00FE1A, 0x00FE1F, "Cn" },
{ 0x00FE20, 0x00FE2F, "Mn" },
{ 0x00FE31, 0x00FE32, "Pd" },
{ 0x00FE33, 0x00FE34, "Pc" },
{ 0x00FE45, 0x00FE46, "Po" },
{ 0x00FE49, 0x00FE4C, "Po" },
{ 0x00FE4D, 0x00FE4F, "Pc" },
{ 0x00FE50, 0x00FE52, "Po" },
{ 0x00FE54, 0x00FE57, "Po" },
{ 0x00FE5F, 0x00FE61, "Po" },
{ 0x00FE64, 0x00FE66, "Sm" },
{ 0x00FE6A, 0x00FE6B, "Po" },
{ 0x00FE6C, 0x00FE6F, "Cn" },
{ 0x00FE70, 0x00FE74, "Lo" },
{ 0x00FE76, 0x00FEFC, "Lo" },
{ 0x00FEFD, 0x00FEFE, "Cn" },
{ 0x00FF01, 0x00FF03, "Po" },
{ 0x00FF05, 0x00FF07, "Po" },
{ 0x00FF0E, 0x00FF0F, "Po" },
{ 0x00FF10, 0x00FF19, "Nd" },
{ 0x00FF1A, 0x00FF1B, "Po" },
{ 0x00FF1C, 0x00FF1E, "Sm" },
{ 0x00FF1F, 0x00FF20, "Po" },
{ 0x00FF21, 0x00FF3A, "Lu" },
{ 0x00FF41, 0x00FF5A, "Ll" },
{ 0x00FF64, 0x00FF65, "Po" },
{ 0x00FF66, 0x00FF6F, "Lo" },
{ 0x00FF71, 0x00FF9D, "Lo" },
{ 0x00FF9E, 0x00FF9F, "Lm" },
{ 0x00FFA0, 0x00FFBE, "Lo" },
{ 0x00FFBF, 0x00FFC1, "Cn" },
{ 0x00FFC2, 0x00FFC7, "Lo" },
{ 0x00FFC8, 0x00FFC9, "Cn" },
{ 0x00FFCA, 0x00FFCF, "Lo" },
{ 0x00FFD0, 0x00FFD1, "Cn" },
{ 0x00FFD2, 0x00FFD7, "Lo" },
{ 0x00FFD8, 0x00FFD9, "Cn" },
{ 0x00FFDA, 0x00FFDC, "Lo" },
{ 0x00FFDD, 0x00FFDF, "Cn" },
{ 0x00FFE0, 0x00FFE1, "Sc" },
{ 0x00FFE5, 0x00FFE6, "Sc" },
{ 0x00FFE9, 0x00FFEC, "Sm" },
{ 0x00FFED, 0x00FFEE, "So" },
{ 0x00FFEF, 0x00FFF8, "Cn" },
{ 0x00FFF9, 0x00FFFB, "Cf" },
{ 0x00FFFC, 0x00FFFD, "So" },
{ 0x00FFFE, 0x00FFFF, "Cn" },
{ 0x010000, 0x01000B, "Lo" },
{ 0x01000D, 0x010026, "Lo" },
{ 0x010028, 0x01003A, "Lo" },
{ 0x01003C, 0x01003D, "Lo" },
{ 0x01003F, 0x01004D, "Lo" },
{ 0x01004E, 0x01004F, "Cn" },
{ 0x010050, 0x01005D, "Lo" },
{ 0x01005E, 0x01007F, "Cn" },
{ 0x010080, 0x0100FA, "Lo" },
{ 0x0100FB, 0x0100FF, "Cn" },
{ 0x010100, 0x010102, "Po" },
{ 0x010103, 0x010106, "Cn" },
{ 0x010107, 0x010133, "No" },
{ 0x010134, 0x010136, "Cn" },
{ 0x010137, 0x01013F, "So" },
{ 0x010140, 0x010174, "Nl" },
{ 0x010175, 0x010178, "No" },
{ 0x010179, 0x010189, "So" },
{ 0x01018A, 0x01018B, "No" },
{ 0x01018C, 0x01018E, "So" },
{ 0x010190, 0x01019C, "So" },
{ 0x01019D, 0x01019F, "Cn" },
{ 0x0101A1, 0x0101CF, "Cn" },
{ 0x0101D0, 0x0101FC, "So" },
{ 0x0101FE, 0x01027F, "Cn" },
{ 0x010280, 0x01029C, "Lo" },
{ 0x01029D, 0x01029F, "Cn" },
{ 0x0102A0, 0x0102D0, "Lo" },
{ 0x0102D1, 0x0102DF, "Cn" },
{ 0x0102E1, 0x0102FB, "No" },
{ 0x0102FC, 0x0102FF, "Cn" },
{ 0x010300, 0x01031F, "Lo" },
{ 0x010320, 0x010323, "No" },
{ 0x010324, 0x01032C, "Cn" },
{ 0x01032D, 0x010340, "Lo" },
{ 0x010342, 0x010349, "Lo" },
{ 0x01034B, 0x01034F, "Cn" },
{ 0x010350, 0x010375, "Lo" },
{ 0x010376, 0x01037A, "Mn" },
{ 0x01037B, 0x01037F, "Cn" },
{ 0x010380, 0x01039D, "Lo" },
{ 0x0103A0, 0x0103C3, "Lo" },
{ 0x0103C4, 0x0103C7, "Cn" },
{ 0x0103C8, 0x0103CF, "Lo" },
{ 0x0103D1, 0x0103D5, "Nl" },
{ 0x0103D6, 0x0103FF, "Cn" },
{ 0x010400, 0x010427, "Lu" },
{ 0x010428, 0x01044F, "Ll" },
{ 0x010450, 0x01049D, "Lo" },
{ 0x01049E, 0x01049F, "Cn" },
{ 0x0104A0, 0x0104A9, "Nd" },
{ 0x0104AA, 0x0104AF, "Cn" },
{ 0x0104B0, 0x0104D3, "Lu" },
{ 0x0104D4, 0x0104D7, "Cn" },
{ 0x0104D8, 0x0104FB, "Ll" },
{ 0x0104FC, 0x0104FF, "Cn" },
{ 0x010500, 0x010527, "Lo" },
{ 0x010528, 0x01052F, "Cn" },
{ 0x010530, 0x010563, "Lo" },
{ 0x010564, 0x01056E, "Cn" },
{ 0x010570, 0x01057A, "Lu" },
{ 0x01057C, 0x01058A, "Lu" },
{ 0x01058C, 0x010592, "Lu" },
{ 0x010594, 0x010595, "Lu" },
{ 0x010597, 0x0105A1, "Ll" },
{ 0x0105A3, 0x0105B1, "Ll" },
{ 0x0105B3, 0x0105B9, "Ll" },
{ 0x0105BB, 0x0105BC, "Ll" },
{ 0x0105BD, 0x0105BF, "Cn" },
{ 0x0105C0, 0x0105F3, "Lo" },
{ 0x0105F4, 0x0105FF, "Cn" },
{ 0x010600, 0x010736, "Lo" },
{ 0x010737, 0x01073F, "Cn" },
{ 0x010740, 0x010755, "Lo" },
{ 0x010756, 0x01075F, "Cn" },
{ 0x010760, 0x010767, "Lo" },
{ 0x010768, 0x01077F, "Cn" },
{ 0x010780, 0x010785, "Lm" },
{ 0x010787, 0x0107B0, "Lm" },
{ 0x0107B2, 0x0107BA, "Lm" },
{ 0x0107BB, 0x0107FF, "Cn" },
{ 0x010800, 0x010805, "Lo" },
{ 0x010806, 0x010807, "Cn" },
{ 0x01080A, 0x010835, "Lo" },
{ 0x010837, 0x010838, "Lo" },
{ 0x010839, 0x01083B, "Cn" },
{ 0x01083D, 0x01083E, "Cn" },
{ 0x01083F, 0x010855, "Lo" },
{ 0x010858, 0x01085F, "No" },
{ 0x010860, 0x010876, "Lo" },
{ 0x010877, 0x010878, "So" },
{ 0x010879, 0x01087F, "No" },
{ 0x010880, 0x01089E, "Lo" },
{ 0x01089F, 0x0108A6, "Cn" },
{ 0x0108A7, 0x0108AF, "No" },
{ 0x0108B0, 0x0108DF, "Cn" },
{ 0x0108E0, 0x0108F2, "Lo" },
{ 0x0108F4, 0x0108F5, "Lo" },
{ 0x0108F6, 0x0108FA, "Cn" },
{ 0x0108FB, 0x0108FF, "No" },
{ 0x010900, 0x010915, "Lo" },
{ 0x010916, 0x01091B, "No" },
{ 0x01091C, 0x01091E, "Cn" },
{ 0x010920, 0x010939, "Lo" },
{ 0x01093A, 0x01093E, "Cn" },
{ 0x010940, 0x010959, "Lo" },
{ 0x01095A, 0x01097F, "Cn" },
{ 0x010980, 0x0109B7, "Lo" },
{ 0x0109B8, 0x0109BB, "Cn" },
{ 0x0109BC, 0x0109BD, "No" },
{ 0x0109BE, 0x0109BF, "Lo" },
{ 0x0109C0, 0x0109CF, "No" },
{ 0x0109D0, 0x0109D1, "Cn" },
{ 0x0109D2, 0x0109FF, "No" },
{ 0x010A01, 0x010A03, "Mn" },
{ 0x010A05, 0x010A06, "Mn" },
{ 0x010A07, 0x010A0B, "Cn" },
{ 0x010A0C, 0x010A0F, "Mn" },
{ 0x010A10, 0x010A13, "Lo" },
{ 0x010A15, 0x010A17, "Lo" },
{ 0x010A19, 0x010A35, "Lo" },
{ 0x010A36, 0x010A37, "Cn" },
{ 0x010A38, 0x010A3A, "Mn" },
{ 0x010A3B, 0x010A3E, "Cn" },
{ 0x010A40, 0x010A48, "No" },
{ 0x010A49, 0x010A4F, "Cn" },
{ 0x010A50, 0x010A58, "Po" },
{ 0x010A59, 0x010A5F, "Cn" },
{ 0x010A60, 0x010A7C, "Lo" },
{ 0x010A7D, 0x010A7E, "No" },
{ 0x010A80, 0x010A9C, "Lo" },
{ 0x010A9D, 0x010A9F, "No" },
{ 0x010AA0, 0x010ABF, "Cn" },
{ 0x010AC0, 0x010AC7, "Lo" },
{ 0x010AC9, 0x010AE4, "Lo" },
{ 0x010AE5, 0x010AE6, "Mn" },
{ 0x010AE7, 0x010AEA, "Cn" },
{ 0x010AEB, 0x010AEF, "No" },
{ 0x010AF0, 0x010AF6, "Po" },
{ 0x010AF7, 0x010AFF, "Cn" },
{ 0x010B00, 0x010B35, "Lo" },
{ 0x010B36, 0x010B38, "Cn" },
{ 0x010B39, 0x010B3F, "Po" },
{ 0x010B40, 0x010B55, "Lo" },
{ 0x010B56, 0x010B57, "Cn" },
{ 0x010B58, 0x010B5F, "No" },
{ 0x010B60, 0x010B72, "Lo" },
{ 0x010B73, 0x010B77, "Cn" },
{ 0x010B78, 0x010B7F, "No" },
{ 0x010B80, 0x010B91, "Lo" },
{ 0x010B92, 0x010B98, "Cn" },
{ 0x010B99, 0x010B9C, "Po" },
{ 0x010B9D, 0x010BA8, "Cn" },
{ 0x010BA9, 0x010BAF, "No" },
{ 0x010BB0, 0x010BFF, "Cn" },
{ 0x010C00, 0x010C48, "Lo" },
{ 0x010C49, 0x010C7F, "Cn" },
{ 0x010C80, 0x010CB2, "Lu" },
{ 0x010CB3, 0x010CBF, "Cn" },
{ 0x010CC0, 0x010CF2, "Ll" },
{ 0x010CF3, 0x010CF9, "Cn" },
{ 0x010CFA, 0x010CFF, "No" },
{ 0x010D00, 0x010D23, "Lo" },
{ 0x010D24, 0x010D27, "Mn" },
{ 0x010D28, 0x010D2F, "Cn" },
{ 0x010D30, 0x010D39, "Nd" },
{ 0x010D3A, 0x010D3F, "Cn" },
{ 0x010D40, 0x010D49, "Nd" },
{ 0x010D4A, 0x010D4D, "Lo" },
{ 0x010D50, 0x010D65, "Lu" },
{ 0x010D66, 0x010D68, "Cn" },
{ 0x010D69, 0x010D6D, "Mn" },
{ 0x010D70, 0x010D85, "Ll" },
{ 0x010D86, 0x010D8D, "Cn" },
{ 0x010D8E, 0x010D8F, "Sm" },
{ 0x010D90, 0x010E5F, "Cn" },
{ 0x010E60, 0x010E7E, "No" },
{ 0x010E80, 0x010EA9, "Lo" },
{ 0x010EAB, 0x010EAC, "Mn" },
{ 0x010EAE, 0x010EAF, "Cn" },
{ 0x010EB0, 0x010EB1, "Lo" },
{ 0x010EB2, 0x010EC1, "Cn" },
{ 0x010EC2, 0x010EC4, "Lo" },
{ 0x010EC6, 0x010EC7, "Lo" },
{ 0x010EC8, 0x010ECF, "Cn" },
{ 0x010ED1, 0x010ED8, "So" },
{ 0x010ED9, 0x010EF9, "Cn" },
{ 0x010EFA, 0x010EFF, "Mn" },
{ 0x010F00, 0x010F1C, "Lo" },
{ 0x010F1D, 0x010F26, "No" },
{ 0x010F28, 0x010F2F, "Cn" },
{ 0x010F30, 0x010F45, "Lo" },
{ 0x010F46, 0x010F50, "Mn" },
{ 0x010F51, 0x010F54, "No" },
{ 0x010F55, 0x010F59, "Po" },
{ 0x010F5A, 0x010F6F, "Cn" },
{ 0x010F70, 0x010F81, "Lo" },
{ 0x010F82, 0x010F85, "Mn" },
{ 0x010F86, 0x010F89, "Po" },
{ 0x010F8A, 0x010FAF, "Cn" },
{ 0x010FB0, 0x010FC4, "Lo" },
{ 0x010FC5, 0x010FCB, "No" },
{ 0x010FCC, 0x010FDF, "Cn" },
{ 0x010FE0, 0x010FF6, "Lo" },
{ 0x010FF7, 0x010FFF, "Cn" },
{ 0x011003, 0x011037, "Lo" },
{ 0x011038, 0x011046, "Mn" },
{ 0x011047, 0x01104D, "Po" },
{ 0x01104E, 0x011051, "Cn" },
{ 0x011052, 0x011065, "No" },
{ 0x011066, 0x01106F, "Nd" },
{ 0x011071, 0x011072, "Lo" },
{ 0x011073, 0x011074, "Mn" },
{ 0x011076, 0x01107E, "Cn" },
{ 0x01107F, 0x011081, "Mn" },
{ 0x011083, 0x0110AF, "Lo" },
{ 0x0110B0, 0x0110B2, "Mc" },
{ 0x0110B3, 0x0110B6, "Mn" },
{ 0x0110B7, 0x0110B8, "Mc" },
{ 0x0110B9, 0x0110BA, "Mn" },
{ 0x0110BB, 0x0110BC, "Po" },
{ 0x0110BE, 0x0110C1, "Po" },
{ 0x0110C3, 0x0110CC, "Cn" },
{ 0x0110CE, 0x0110CF, "Cn" },
{ 0x0110D0, 0x0110E8, "Lo" },
{ 0x0110E9, 0x0110EF, "Cn" },
{ 0x0110F0, 0x0110F9, "Nd" },
{ 0x0110FA, 0x0110FF, "Cn" },
{ 0x011100, 0x011102, "Mn" },
{ 0x011103, 0x011126, "Lo" },
{ 0x011127, 0x01112B, "Mn" },
{ 0x01112D, 0x011134, "Mn" },
{ 0x011136, 0x01113F, "Nd" },
{ 0x011140, 0x011143, "Po" },
{ 0x011145, 0x011146, "Mc" },
{ 0x011148, 0x01114F, "Cn" },
{ 0x011150, 0x011172, "Lo" },
{ 0x011174, 0x011175, "Po" },
{ 0x011177, 0x01117F, "Cn" },
{ 0x011180, 0x011181, "Mn" },
{ 0x011183, 0x0111B2, "Lo" },
{ 0x0111B3, 0x0111B5, "Mc" },
{ 0x0111B6, 0x0111BE, "Mn" },
{ 0x0111BF, 0x0111C0, "Mc" },
{ 0x0111C1, 0x0111C4, "Lo" },
{ 0x0111C5, 0x0111C8, "Po" },
{ 0x0111C9, 0x0111CC, "Mn" },
{ 0x0111D0, 0x0111D9, "Nd" },
{ 0x0111DD, 0x0111DF, "Po" },
{ 0x0111E1, 0x0111F4, "No" },
{ 0x0111F5, 0x0111FF, "Cn" },
{ 0x011200, 0x011211, "Lo" },
{ 0x011213, 0x01122B, "Lo" },
{ 0x01122C, 0x01122E, "Mc" },
{ 0x01122F, 0x011231, "Mn" },
{ 0x011232, 0x011233, "Mc" },
{ 0x011236, 0x011237, "Mn" },
{ 0x011238, 0x01123D, "Po" },
{ 0x01123F, 0x011240, "Lo" },
{ 0x011242, 0x01127F, "Cn" },
{ 0x011280, 0x011286, "Lo" },
{ 0x01128A, 0x01128D, "Lo" },
{ 0x01128F, 0x01129D, "Lo" },
{ 0x01129F, 0x0112A8, "Lo" },
{ 0x0112AA, 0x0112AF, "Cn" },
{ 0x0112B0, 0x0112DE, "Lo" },
{ 0x0112E0, 0x0112E2, "Mc" },
{ 0x0112E3, 0x0112EA, "Mn" },
{ 0x0112EB, 0x0112EF, "Cn" },
{ 0x0112F0, 0x0112F9, "Nd" },
{ 0x0112FA, 0x0112FF, "Cn" },
{ 0x011300, 0x011301, "Mn" },
{ 0x011302, 0x011303, "Mc" },
{ 0x011305, 0x01130C, "Lo" },
{ 0x01130D, 0x01130E, "Cn" },
{ 0x01130F, 0x011310, "Lo" },
{ 0x011311, 0x011312, "Cn" },
{ 0x011313, 0x011328, "Lo" },
{ 0x01132A, 0x011330, "Lo" },
{ 0x011332, 0x011333, "Lo" },
{ 0x011335, 0x011339, "Lo" },
{ 0x01133B, 0x01133C, "Mn" },
{ 0x01133E, 0x01133F, "Mc" },
{ 0x011341, 0x011344, "Mc" },
{ 0x011345, 0x011346, "Cn" },
{ 0x011347, 0x011348, "Mc" },
{ 0x011349, 0x01134A, "Cn" },
{ 0x01134B, 0x01134D, "Mc" },
{ 0x01134E, 0x01134F, "Cn" },
{ 0x011351, 0x011356, "Cn" },
{ 0x011358, 0x01135C, "Cn" },
{ 0x01135D, 0x011361, "Lo" },
{ 0x011362, 0x011363, "Mc" },
{ 0x011364, 0x011365, "Cn" },
{ 0x011366, 0x01136C, "Mn" },
{ 0x01136D, 0x01136F, "Cn" },
{ 0x011370, 0x011374, "Mn" },
{ 0x011375, 0x01137F, "Cn" },
{ 0x011380, 0x011389, "Lo" },
{ 0x01138C, 0x01138D, "Cn" },
{ 0x011390, 0x0113B5, "Lo" },
{ 0x0113B8, 0x0113BA, "Mc" },
{ 0x0113BB, 0x0113C0, "Mn" },
{ 0x0113C3, 0x0113C4, "Cn" },
{ 0x0113C7, 0x0113CA, "Mc" },
{ 0x0113CC, 0x0113CD, "Mc" },
{ 0x0113D4, 0x0113D5, "Po" },
{ 0x0113D7, 0x0113D8, "Po" },
{ 0x0113D9, 0x0113E0, "Cn" },
{ 0x0113E1, 0x0113E2, "Mn" },
{ 0x0113E3, 0x0113FF, "Cn" },
{ 0x011400, 0x011434, "Lo" },
{ 0x011435, 0x011437, "Mc" },
{ 0x011438, 0x01143F, "Mn" },
{ 0x011440, 0x011441, "Mc" },
{ 0x011442, 0x011444, "Mn" },
{ 0x011447, 0x01144A, "Lo" },
{ 0x01144B, 0x01144F, "Po" },
{ 0x011450, 0x011459, "Nd" },
{ 0x01145A, 0x01145B, "Po" },
{ 0x01145F, 0x011461, "Lo" },
{ 0x011462, 0x01147F, "Cn" },
{ 0x011480, 0x0114AF, "Lo" },
{ 0x0114B0, 0x0114B2, "Mc" },
{ 0x0114B3, 0x0114B8, "Mn" },
{ 0x0114BB, 0x0114BE, "Mc" },
{ 0x0114BF, 0x0114C0, "Mn" },
{ 0x0114C2, 0x0114C3, "Mn" },
{ 0x0114C4, 0x0114C5, "Lo" },
{ 0x0114C8, 0x0114CF, "Cn" },
{ 0x0114D0, 0x0114D9, "Nd" },
{ 0x0114DA, 0x01157F, "Cn" },
{ 0x011580, 0x0115AE, "Lo" },
{ 0x0115AF, 0x0115B1, "Mc" },
{ 0x0115B2, 0x0115B5, "Mn" },
{ 0x0115B6, 0x0115B7, "Cn" },
{ 0x0115B8, 0x0115BB, "Mc" },
{ 0x0115BC, 0x0115BD, "Mn" },
{ 0x0115BF, 0x0115C0, "Mn" },
{ 0x0115C1, 0x0115D7, "Po" },
{ 0x0115D8, 0x0115DB, "Lo" },
{ 0x0115DC, 0x0115DD, "Mn" },
{ 0x0115DE, 0x0115FF, "Cn" },
{ 0x011600, 0x01162F, "Lo" },
{ 0x011630, 0x011632, "Mc" },
{ 0x011633, 0x01163A, "Mn" },
{ 0x01163B, 0x01163C, "Mc" },
{ 0x01163F, 0x011640, "Mn" },
{ 0x011641, 0x011643, "Po" },
{ 0x011645, 0x01164F, "Cn" },
{ 0x011650, 0x011659, "Nd" },
{ 0x01165A, 0x01165F, "Cn" },
{ 0x011660, 0x01166C, "Po" },
{ 0x01166D, 0x01167F, "Cn" },
{ 0x011680, 0x0116AA, "Lo" },
{ 0x0116AE, 0x0116AF, "Mc" },
{ 0x0116B0, 0x0116B5, "Mn" },
{ 0x0116BA, 0x0116BF, "Cn" },
{ 0x0116C0, 0x0116C9, "Nd" },
{ 0x0116CA, 0x0116CF, "Cn" },
{ 0x0116D0, 0x0116E3, "Nd" },
{ 0x0116E4, 0x0116FF, "Cn" },
{ 0x011700, 0x01171A, "Lo" },
{ 0x01171B, 0x01171C, "Cn" },
{ 0x011720, 0x011721, "Mc" },
{ 0x011722, 0x011725, "Mn" },
{ 0x011727, 0x01172B, "Mn" },
{ 0x01172C, 0x01172F, "Cn" },
{ 0x011730, 0x011739, "Nd" },
{ 0x01173A, 0x01173B, "No" },
{ 0x01173C, 0x01173E, "Po" },
{ 0x011740, 0x011746, "Lo" },
{ 0x011747, 0x0117FF, "Cn" },
{ 0x011800, 0x01182B, "Lo" },
{ 0x01182C, 0x01182E, "Mc" },
{ 0x01182F, 0x011837, "Mn" },
{ 0x011839, 0x01183A, "Mn" },
{ 0x01183C, 0x01189F, "Cn" },
{ 0x0118A0, 0x0118BF, "Lu" },
{ 0x0118C0, 0x0118DF, "Ll" },
{ 0x0118E0, 0x0118E9, "Nd" },
{ 0x0118EA, 0x0118F2, "No" },
{ 0x0118F3, 0x0118FE, "Cn" },
{ 0x0118FF, 0x011906, "Lo" },
{ 0x011907, 0x011908, "Cn" },
{ 0x01190A, 0x01190B, "Cn" },
{ 0x01190C, 0x011913, "Lo" },
{ 0x011915, 0x011916, "Lo" },
{ 0x011918, 0x01192F, "Lo" },
{ 0x011930, 0x011935, "Mc" },
{ 0x011937, 0x011938, "Mc" },
{ 0x011939, 0x01193A, "Cn" },
{ 0x01193B, 0x01193C, "Mn" },
{ 0x011944, 0x011946, "Po" },
{ 0x011947, 0x01194F, "Cn" },
{ 0x011950, 0x011959, "Nd" },
{ 0x01195A, 0x01199F, "Cn" },
{ 0x0119A0, 0x0119A7, "Lo" },
{ 0x0119A8, 0x0119A9, "Cn" },
{ 0x0119AA, 0x0119D0, "Lo" },
{ 0x0119D1, 0x0119D3, "Mc" },
{ 0x0119D4, 0x0119D7, "Mn" },
{ 0x0119D8, 0x0119D9, "Cn" },
{ 0x0119DA, 0x0119DB, "Mn" },
{ 0x0119DC, 0x0119DF, "Mc" },
{ 0x0119E5, 0x0119FF, "Cn" },
{ 0x011A01, 0x011A0A, "Mn" },
{ 0x011A0B, 0x011A32, "Lo" },
{ 0x011A33, 0x011A38, "Mn" },
{ 0x011A3B, 0x011A3E, "Mn" },
{ 0x011A3F, 0x011A46, "Po" },
{ 0x011A48, 0x011A4F, "Cn" },
{ 0x011A51, 0x011A56, "Mn" },
{ 0x011A57, 0x011A58, "Mc" },
{ 0x011A59, 0x011A5B, "Mn" },
{ 0x011A5C, 0x011A89, "Lo" },
{ 0x011A8A, 0x011A96, "Mn" },
{ 0x011A98, 0x011A99, "Mn" },
{ 0x011A9A, 0x011A9C, "Po" },
{ 0x011A9E, 0x011AA2, "Po" },
{ 0x011AA3, 0x011AAF, "Cn" },
{ 0x011AB0, 0x011AF8, "Lo" },
{ 0x011AF9, 0x011AFF, "Cn" },
{ 0x011B00, 0x011B09, "Po" },
{ 0x011B0A, 0x011B5F, "Cn" },
{ 0x011B62, 0x011B64, "Mn" },
{ 0x011B68, 0x011BBF, "Cn" },
{ 0x011BC0, 0x011BE0, "Lo" },
{ 0x011BE2, 0x011BEF, "Cn" },
{ 0x011BF0, 0x011BF9, "Nd" },
{ 0x011BFA, 0x011BFF, "Cn" },
{ 0x011C00, 0x011C08, "Lo" },
{ 0x011C0A, 0x011C2E, "Lo" },
{ 0x011C30, 0x011C36, "Mn" },
{ 0x011C38, 0x011C3D, "Mn" },
{ 0x011C41, 0x011C45, "Po" },
{ 0x011C46, 0x011C4F, "Cn" },
{ 0x011C50, 0x011C59, "Nd" },
{ 0x011C5A, 0x011C6C, "No" },
{ 0x011C6D, 0x011C6F, "Cn" },
{ 0x011C70, 0x011C71, "Po" },
{ 0x011C72, 0x011C8F, "Lo" },
{ 0x011C90, 0x011C91, "Cn" },
{ 0x011C92, 0x011CA7, "Mn" },
{ 0x011CAA, 0x011CB0, "Mn" },
{ 0x011CB2, 0x011CB3, "Mn" },
{ 0x011CB5, 0x011CB6, "Mn" },
{ 0x011CB7, 0x011CFF, "Cn" },
{ 0x011D00, 0x011D06, "Lo" },
{ 0x011D08, 0x011D09, "Lo" },
{ 0x011D0B, 0x011D30, "Lo" },
{ 0x011D31, 0x011D36, "Mn" },
{ 0x011D37, 0x011D39, "Cn" },
{ 0x011D3C, 0x011D3D, "Mn" },
{ 0x011D3F, 0x011D45, "Mn" },
{ 0x011D48, 0x011D4F, "Cn" },
{ 0x011D50, 0x011D59, "Nd" },
{ 0x011D5A, 0x011D5F, "Cn" },
{ 0x011D60, 0x011D65, "Lo" },
{ 0x011D67, 0x011D68, "Lo" },
{ 0x011D6A, 0x011D89, "Lo" },
{ 0x011D8A, 0x011D8E, "Mc" },
{ 0x011D90, 0x011D91, "Mn" },
{ 0x011D93, 0x011D94, "Mc" },
{ 0x011D99, 0x011D9F, "Cn" },
{ 0x011DA0, 0x011DA9, "Nd" },
{ 0x011DAA, 0x011DAF, "Cn" },
{ 0x011DB0, 0x011DD8, "Lo" },
{ 0x011DDA, 0x011DDB, "Lo" },
{ 0x011DDC, 0x011DDF, "Cn" },
{ 0x011DE0, 0x011DE9, "Nd" },
{ 0x011DEA, 0x011EDF, "Cn" },
{ 0x011EE0, 0x011EF2, "Lo" },
{ 0x011EF3, 0x011EF4, "Mn" },
{ 0x011EF5, 0x011EF6, "Mc" },
{ 0x011EF7, 0x011EF8, "Po" },
{ 0x011EF9, 0x011EFF, "Cn" },
{ 0x011F00, 0x011F01, "Mn" },
{ 0x011F04, 0x011F10, "Lo" },
{ 0x011F12, 0x011F33, "Lo" },
{ 0x011F34, 0x011F35, "Mc" },
{ 0x011F36, 0x011F3A, "Mn" },
{ 0x011F3B, 0x011F3D, "Cn" },
{ 0x011F3E, 0x011F3F, "Mc" },
{ 0x011F43, 0x011F4F, "Po" },
{ 0x011F50, 0x011F59, "Nd" },
{ 0x011F5B, 0x011FAF, "Cn" },
{ 0x011FB1, 0x011FBF, "Cn" },
{ 0x011FC0, 0x011FD4, "No" },
{ 0x011FD5, 0x011FDC, "So" },
{ 0x011FDD, 0x011FE0, "Sc" },
{ 0x011FE1, 0x011FF1, "So" },
{ 0x011FF2, 0x011FFE, "Cn" },
{ 0x012000, 0x012399, "Lo" },
{ 0x01239A, 0x0123FF, "Cn" },
{ 0x012400, 0x01246E, "Nl" },
{ 0x012470, 0x012474, "Po" },
{ 0x012475, 0x01247F, "Cn" },
{ 0x012480, 0x012543, "Lo" },
{ 0x012544, 0x012F8F, "Cn" },
{ 0x012F90, 0x012FF0, "Lo" },
{ 0x012FF1, 0x012FF2, "Po" },
{ 0x012FF3, 0x012FFF, "Cn" },
{ 0x013000, 0x01342F, "Lo" },
{ 0x013430, 0x01343F, "Cf" },
{ 0x013441, 0x013446, "Lo" },
{ 0x013447, 0x013455, "Mn" },
{ 0x013456, 0x01345F, "Cn" },
{ 0x013460, 0x0143FA, "Lo" },
{ 0x0143FB, 0x0143FF, "Cn" },
{ 0x014400, 0x014646, "Lo" },
{ 0x014647, 0x0160FF, "Cn" },
{ 0x016100, 0x01611D, "Lo" },
{ 0x01611E, 0x016129, "Mn" },
{ 0x01612A, 0x01612C, "Mc" },
{ 0x01612D, 0x01612F, "Mn" },
{ 0x016130, 0x016139, "Nd" },
{ 0x01613A, 0x0167FF, "Cn" },
{ 0x016800, 0x016A38, "Lo" },
{ 0x016A39, 0x016A3F, "Cn" },
{ 0x016A40, 0x016A5E, "Lo" },
{ 0x016A60, 0x016A69, "Nd" },
{ 0x016A6A, 0x016A6D, "Cn" },
{ 0x016A6E, 0x016A6F, "Po" },
{ 0x016A70, 0x016ABE, "Lo" },
{ 0x016AC0, 0x016AC9, "Nd" },
{ 0x016ACA, 0x016ACF, "Cn" },
{ 0x016AD0, 0x016AED, "Lo" },
{ 0x016AEE, 0x016AEF, "Cn" },
{ 0x016AF0, 0x016AF4, "Mn" },
{ 0x016AF6, 0x016AFF, "Cn" },
{ 0x016B00, 0x016B2F, "Lo" },
{ 0x016B30, 0x016B36, "Mn" },
{ 0x016B37, 0x016B3B, "Po" },
{ 0x016B3C, 0x016B3F, "So" },
{ 0x016B40, 0x016B43, "Lm" },
{ 0x016B46, 0x016B4F, "Cn" },
{ 0x016B50, 0x016B59, "Nd" },
{ 0x016B5B, 0x016B61, "No" },
{ 0x016B63, 0x016B77, "Lo" },
{ 0x016B78, 0x016B7C, "Cn" },
{ 0x016B7D, 0x016B8F, "Lo" },
{ 0x016B90, 0x016D3F, "Cn" },
{ 0x016D40, 0x016D42, "Lm" },
{ 0x016D43, 0x016D6A, "Lo" },
{ 0x016D6B, 0x016D6C, "Lm" },
{ 0x016D6D, 0x016D6F, "Po" },
{ 0x016D70, 0x016D79, "Nd" },
{ 0x016D7A, 0x016E3F, "Cn" },
{ 0x016E40, 0x016E5F, "Lu" },
{ 0x016E60, 0x016E7F, "Ll" },
{ 0x016E80, 0x016E96, "No" },
{ 0x016E97, 0x016E9A, "Po" },
{ 0x016E9B, 0x016E9F, "Cn" },
{ 0x016EA0, 0x016EB8, "Lu" },
{ 0x016EB9, 0x016EBA, "Cn" },
{ 0x016EBB, 0x016ED3, "Ll" },
{ 0x016ED4, 0x016EFF, "Cn" },
{ 0x016F00, 0x016F4A, "Lo" },
{ 0x016F4B, 0x016F4E, "Cn" },
{ 0x016F51, 0x016F87, "Mc" },
{ 0x016F88, 0x016F8E, "Cn" },
{ 0x016F8F, 0x016F92, "Mn" },
{ 0x016F93, 0x016F9F, "Lm" },
{ 0x016FA0, 0x016FDF, "Cn" },
{ 0x016FE0, 0x016FE1, "Lm" },
{ 0x016FE5, 0x016FEF, "Cn" },
{ 0x016FF0, 0x016FF1, "Mc" },
{ 0x016FF2, 0x016FF3, "Lm" },
{ 0x016FF4, 0x016FF6, "Nl" },
{ 0x016FF7, 0x016FFF, "Cn" },
{ 0x017000, 0x018CD5, "Lo" },
{ 0x018CD6, 0x018CFE, "Cn" },
{ 0x018CFF, 0x018D1E, "Lo" },
{ 0x018D1F, 0x018D7F, "Cn" },
{ 0x018D80, 0x018DF2, "Lo" },
{ 0x018DF3, 0x01AFEF, "Cn" },
{ 0x01AFF0, 0x01AFF3, "Lm" },
{ 0x01AFF5, 0x01AFFB, "Lm" },
{ 0x01AFFD, 0x01AFFE, "Lm" },
{ 0x01B000, 0x01B122, "Lo" },
{ 0x01B123, 0x01B131, "Cn" },
{ 0x01B133, 0x01B14F, "Cn" },
{ 0x01B150, 0x01B152, "Lo" },
{ 0x01B153, 0x01B154, "Cn" },
{ 0x01B156, 0x01B163, "Cn" },
{ 0x01B164, 0x01B167, "Lo" },
{ 0x01B168, 0x01B16F, "Cn" },
{ 0x01B170, 0x01B2FB, "Lo" },
{ 0x01B2FC, 0x01BBFF, "Cn" },
{ 0x01BC00, 0x01BC6A, "Lo" },
{ 0x01BC6B, 0x01BC6F, "Cn" },
{ 0x01BC70, 0x01BC7C, "Lo" },
{ 0x01BC7D, 0x01BC7F, "Cn" },
{ 0x01BC80, 0x01BC88, "Lo" },
{ 0x01BC89, 0x01BC8F, "Cn" },
{ 0x01BC90, 0x01BC99, "Lo" },
{ 0x01BC9A, 0x01BC9B, "Cn" },
{ 0x01BC9D, 0x01BC9E, "Mn" },
{ 0x01BCA0, 0x01BCA3, "Cf" },
{ 0x01BCA4, 0x01CBFF, "Cn" },
{ 0x01CC00, 0x01CCEF, "So" },
{ 0x01CCF0, 0x01CCF9, "Nd" },
{ 0x01CCFA, 0x01CCFC, "So" },
{ 0x01CCFD, 0x01CCFF, "Cn" },
{ 0x01CD00, 0x01CEB3, "So" },
{ 0x01CEB4, 0x01CEB9, "Cn" },
{ 0x01CEBA, 0x01CED0, "So" },
{ 0x01CED1, 0x01CEDF, "Cn" },
{ 0x01CEE0, 0x01CEEF, "So" },
{ 0x01CEF1, 0x01CEFF, "Cn" },
{ 0x01CF00, 0x01CF2D, "Mn" },
{ 0x01CF2E, 0x01CF2F, "Cn" },
{ 0x01CF30, 0x01CF46, "Mn" },
{ 0x01CF47, 0x01CF4F, "Cn" },
{ 0x01CF50, 0x01CFC3, "So" },
{ 0x01CFC4, 0x01CFFF, "Cn" },
{ 0x01D000, 0x01D0F5, "So" },
{ 0x01D0F6, 0x01D0FF, "Cn" },
{ 0x01D100, 0x01D126, "So" },
{ 0x01D127, 0x01D128, "Cn" },
{ 0x01D129, 0x01D164, "So" },
{ 0x01D165, 0x01D166, "Mc" },
{ 0x01D167, 0x01D169, "Mn" },
{ 0x01D16A, 0x01D16C, "So" },
{ 0x01D16D, 0x01D172, "Mc" },
{ 0x01D173, 0x01D17A, "Cf" },
{ 0x01D17B, 0x01D182, "Mn" },
{ 0x01D183, 0x01D184, "So" },
{ 0x01D185, 0x01D18B, "Mn" },
{ 0x01D18C, 0x01D1A9, "So" },
{ 0x01D1AA, 0x01D1AD, "Mn" },
{ 0x01D1AE, 0x01D1EA, "So" },
{ 0x01D1EB, 0x01D1FF, "Cn" },
{ 0x01D200, 0x01D241, "So" },
{ 0x01D242, 0x01D244, "Mn" },
{ 0x01D246, 0x01D2BF, "Cn" },
{ 0x01D2C0, 0x01D2D3, "No" },
{ 0x01D2D4, 0x01D2DF, "Cn" },
{ 0x01D2E0, 0x01D2F3, "No" },
{ 0x01D2F4, 0x01D2FF, "Cn" },
{ 0x01D300, 0x01D356, "So" },
{ 0x01D357, 0x01D35F, "Cn" },
{ 0x01D360, 0x01D378, "No" },
{ 0x01D379, 0x01D3FF, "Cn" },
{ 0x01D400, 0x01D419, "Lu" },
{ 0x01D41A, 0x01D433, "Ll" },
{ 0x01D434, 0x01D44D, "Lu" },
{ 0x01D44E, 0x01D454, "Ll" },
{ 0x01D456, 0x01D467, "Ll" },
{ 0x01D468, 0x01D481, "Lu" },
{ 0x01D482, 0x01D49B, "Ll" },
{ 0x01D49E, 0x01D49F, "Lu" },
{ 0x01D4A0, 0x01D4A1, "Cn" },
{ 0x01D4A3, 0x01D4A4, "Cn" },
{ 0x01D4A5, 0x01D4A6, "Lu" },
{ 0x01D4A7, 0x01D4A8, "Cn" },
{ 0x01D4A9, 0x01D4AC, "Lu" },
{ 0x01D4AE, 0x01D4B5, "Lu" },
{ 0x01D4B6, 0x01D4B9, "Ll" },
{ 0x01D4BD, 0x01D4C3, "Ll" },
{ 0x01D4C5, 0x01D4CF, "Ll" },
{ 0x01D4D0, 0x01D4E9, "Lu" },
{ 0x01D4EA, 0x01D503, "Ll" },
{ 0x01D504, 0x01D505, "Lu" },
{ 0x01D507, 0x01D50A, "Lu" },
{ 0x01D50B, 0x01D50C, "Cn" },
{ 0x01D50D, 0x01D514, "Lu" },
{ 0x01D516, 0x01D51C, "Lu" },
{ 0x01D51E, 0x01D537, "Ll" },
{ 0x01D538, 0x01D539, "Lu" },
{ 0x01D53B, 0x01D53E, "Lu" },
{ 0x01D540, 0x01D544, "Lu" },
{ 0x01D547, 0x01D549, "Cn" },
{ 0x01D54A, 0x01D550, "Lu" },
{ 0x01D552, 0x01D56B, "Ll" },
{ 0x01D56C, 0x01D585, "Lu" },
{ 0x01D586, 0x01D59F, "Ll" },
{ 0x01D5A0, 0x01D5B9, "Lu" },
{ 0x01D5BA, 0x01D5D3, "Ll" },
{ 0x01D5D4, 0x01D5ED, "Lu" },
{ 0x01D5EE, 0x01D607, "Ll" },
{ 0x01D608, 0x01D621, "Lu" },
{ 0x01D622, 0x01D63B, "Ll" },
{ 0x01D63C, 0x01D655, "Lu" },
{ 0x01D656, 0x01D66F, "Ll" },
{ 0x01D670, 0x01D689, "Lu" },
{ 0x01D68A, 0x01D6A5, "Ll" },
{ 0x01D6A6, 0x01D6A7, "Cn" },
{ 0x01D6A8, 0x01D6C0, "Lu" },
{ 0x01D6C2, 0x01D6DA, "Ll" },
{ 0x01D6DC, 0x01D6E1, "Ll" },
{ 0x01D6E2, 0x01D6FA, "Lu" },
{ 0x01D6FC, 0x01D714, "Ll" },
{ 0x01D716, 0x01D71B, "Ll" },
{ 0x01D71C, 0x01D734, "Lu" },
{ 0x01D736, 0x01D74E, "Ll" },
{ 0x01D750, 0x01D755, "Ll" },
{ 0x01D756, 0x01D76E, "Lu" },
{ 0x01D770, 0x01D788, "Ll" },
{ 0x01D78A, 0x01D78F, "Ll" },
{ 0x01D790, 0x01D7A8, "Lu" },
{ 0x01D7AA, 0x01D7C2, "Ll" },
{ 0x01D7C4, 0x01D7C9, "Ll" },
{ 0x01D7CC, 0x01D7CD, "Cn" },
{ 0x01D7CE, 0x01D7FF, "Nd" },
{ 0x01D800, 0x01D9FF, "So" },
{ 0x01DA00, 0x01DA36, "Mn" },
{ 0x01DA37, 0x01DA3A, "So" },
{ 0x01DA3B, 0x01DA6C, "Mn" },
{ 0x01DA6D, 0x01DA74, "So" },
{ 0x01DA76, 0x01DA83, "So" },
{ 0x01DA85, 0x01DA86, "So" },
{ 0x01DA87, 0x01DA8B, "Po" },
{ 0x01DA8C, 0x01DA9A, "Cn" },
{ 0x01DA9B, 0x01DA9F, "Mn" },
{ 0x01DAA1, 0x01DAAF, "Mn" },
{ 0x01DAB0, 0x01DEFF, "Cn" },
{ 0x01DF00, 0x01DF09, "Ll" },
{ 0x01DF0B, 0x01DF1E, "Ll" },
{ 0x01DF1F, 0x01DF24, "Cn" },
{ 0x01DF25, 0x01DF2A, "Ll" },
{ 0x01DF2B, 0x01DFFF, "Cn" },
{ 0x01E000, 0x01E006, "Mn" },
{ 0x01E008, 0x01E018, "Mn" },
{ 0x01E019, 0x01E01A, "Cn" },
{ 0x01E01B, 0x01E021, "Mn" },
{ 0x01E023, 0x01E024, "Mn" },
{ 0x01E026, 0x01E02A, "Mn" },
{ 0x01E02B, 0x01E02F, "Cn" },
{ 0x01E030, 0x01E06D, "Lm" },
{ 0x01E06E, 0x01E08E, "Cn" },
{ 0x01E090, 0x01E0FF, "Cn" },
{ 0x01E100, 0x01E12C, "Lo" },
{ 0x01E12D, 0x01E12F, "Cn" },
{ 0x01E130, 0x01E136, "Mn" },
{ 0x01E137, 0x01E13D, "Lm" },
{ 0x01E13E, 0x01E13F, "Cn" },
{ 0x01E140, 0x01E149, "Nd" },
{ 0x01E14A, 0x01E14D, "Cn" },
{ 0x01E150, 0x01E28F, "Cn" },
{ 0x01E290, 0x01E2AD, "Lo" },
{ 0x01E2AF, 0x01E2BF, "Cn" },
{ 0x01E2C0, 0x01E2EB, "Lo" },
{ 0x01E2EC, 0x01E2EF, "Mn" },
{ 0x01E2F0, 0x01E2F9, "Nd" },
{ 0x01E2FA, 0x01E2FE, "Cn" },
{ 0x01E300, 0x01E4CF, "Cn" },
{ 0x01E4D0, 0x01E4EA, "Lo" },
{ 0x01E4EC, 0x01E4EF, "Mn" },
{ 0x01E4F0, 0x01E4F9, "Nd" },
{ 0x01E4FA, 0x01E5CF, "Cn" },
{ 0x01E5D0, 0x01E5ED, "Lo" },
{ 0x01E5EE, 0x01E5EF, "Mn" },
{ 0x01E5F1, 0x01E5FA, "Nd" },
{ 0x01E5FB, 0x01E5FE, "Cn" },
{ 0x01E600, 0x01E6BF, "Cn" },
{ 0x01E6C0, 0x01E6DE, "Lo" },
{ 0x01E6E0, 0x01E6E2, "Lo" },
{ 0x01E6E4, 0x01E6E5, "Lo" },
{ 0x01E6E7, 0x01E6ED, "Lo" },
{ 0x01E6EE, 0x01E6EF, "Mn" },
{ 0x01E6F0, 0x01E6F4, "Lo" },
{ 0x01E6F6, 0x01E6FD, "Cn" },
{ 0x01E700, 0x01E7DF, "Cn" },
{ 0x01E7E0, 0x01E7E6, "Lo" },
{ 0x01E7E8, 0x01E7EB, "Lo" },
{ 0x01E7ED, 0x01E7EE, "Lo" },
{ 0x01E7F0, 0x01E7FE, "Lo" },
{ 0x01E800, 0x01E8C4, "Lo" },
{ 0x01E8C5, 0x01E8C6, "Cn" },
{ 0x01E8C7, 0x01E8CF, "No" },
{ 0x01E8D0, 0x01E8D6, "Mn" },
{ 0x01E8D7, 0x01E8FF, "Cn" },
{ 0x01E900, 0x01E921, "Lu" },
{ 0x01E922, 0x01E943, "Ll" },
{ 0x01E944, 0x01E94A, "Mn" },
{ 0x01E94C, 0x01E94F, "Cn" },
{ 0x01E950, 0x01E959, "Nd" },
{ 0x01E95A, 0x01E95D, "Cn" },
{ 0x01E95E, 0x01E95F, "Po" },
{ 0x01E960, 0x01EC70, "Cn" },
{ 0x01EC71, 0x01ECAB, "No" },
{ 0x01ECAD, 0x01ECAF, "No" },
{ 0x01ECB1, 0x01ECB4, "No" },
{ 0x01ECB5, 0x01ED00, "Cn" },
{ 0x01ED01, 0x01ED2D, "No" },
{ 0x01ED2F, 0x01ED3D, "No" },
{ 0x01ED3E, 0x01EDFF, "Cn" },
{ 0x01EE00, 0x01EE03, "Lo" },
{ 0x01EE05, 0x01EE1F, "Lo" },
{ 0x01EE21, 0x01EE22, "Lo" },
{ 0x01EE25, 0x01EE26, "Cn" },
{ 0x01EE29, 0x01EE32, "Lo" },
{ 0x01EE34, 0x01EE37, "Lo" },
{ 0x01EE3C, 0x01EE41, "Cn" },
{ 0x01EE43, 0x01EE46, "Cn" },
{ 0x01EE4D, 0x01EE4F, "Lo" },
{ 0x01EE51, 0x01EE52, "Lo" },
{ 0x01EE55, 0x01EE56, "Cn" },
{ 0x01EE61, 0x01EE62, "Lo" },
{ 0x01EE65, 0x01EE66, "Cn" },
{ 0x01EE67, 0x01EE6A, "Lo" },
{ 0x01EE6C, 0x01EE72, "Lo" },
{ 0x01EE74, 0x01EE77, "Lo" },
{ 0x01EE79, 0x01EE7C, "Lo" },
{ 0x01EE80, 0x01EE89, "Lo" },
{ 0x01EE8B, 0x01EE9B, "Lo" },
{ 0x01EE9C, 0x01EEA0, "Cn" },
{ 0x01EEA1, 0x01EEA3, "Lo" },
{ 0x01EEA5, 0x01EEA9, "Lo" },
{ 0x01EEAB, 0x01EEBB, "Lo" },
{ 0x01EEBC, 0x01EEEF, "Cn" },
{ 0x01EEF0, 0x01EEF1, "Sm" },
{ 0x01EEF2, 0x01EFFF, "Cn" },
{ 0x01F000, 0x01F02B, "So" },
{ 0x01F02C, 0x01F02F, "Cn" },
{ 0x01F030, 0x01F093, "So" },
{ 0x01F094, 0x01F09F, "Cn" },
{ 0x01F0A0, 0x01F0AE, "So" },
{ 0x01F0AF, 0x01F0B0, "Cn" },
{ 0x01F0B1, 0x01F0BF, "So" },
{ 0x01F0C1, 0x01F0CF, "So" },
{ 0x01F0D1, 0x01F0F5, "So" },
{ 0x01F0F6, 0x01F0FF, "Cn" },
{ 0x01F100, 0x01F10C, "No" },
{ 0x01F10D, 0x01F1AD, "So" },
{ 0x01F1AE, 0x01F1E5, "Cn" },
{ 0x01F1E6, 0x01F202, "So" },
{ 0x01F203, 0x01F20F, "Cn" },
{ 0x01F210, 0x01F23B, "So" },
{ 0x01F23C, 0x01F23F, "Cn" },
{ 0x01F240, 0x01F248, "So" },
{ 0x01F249, 0x01F24F, "Cn" },
{ 0x01F250, 0x01F251, "So" },
{ 0x01F252, 0x01F25F, "Cn" },
{ 0x01F260, 0x01F265, "So" },
{ 0x01F266, 0x01F2FF, "Cn" },
{ 0x01F300, 0x01F3FA, "So" },
{ 0x01F3FB, 0x01F3FF, "Sk" },
{ 0x01F400, 0x01F6D8, "So" },
{ 0x01F6D9, 0x01F6DB, "Cn" },
{ 0x01F6DC, 0x01F6EC, "So" },
{ 0x01F6ED, 0x01F6EF, "Cn" },
{ 0x01F6F0, 0x01F6FC, "So" },
{ 0x01F6FD, 0x01F6FF, "Cn" },
{ 0x01F700, 0x01F7D9, "So" },
{ 0x01F7DA, 0x01F7DF, "Cn" },
{ 0x01F7E0, 0x01F7EB, "So" },
{ 0x01F7EC, 0x01F7EF, "Cn" },
{ 0x01F7F1, 0x01F7FF, "Cn" },
{ 0x01F800, 0x01F80B, "So" },
{ 0x01F80C, 0x01F80F, "Cn" },
{ 0x01F810, 0x01F847, "So" },
{ 0x01F848, 0x01F84F, "Cn" },
{ 0x01F850, 0x01F859, "So" },
{ 0x01F85A, 0x01F85F, "Cn" },
{ 0x01F860, 0x01F887, "So" },
{ 0x01F888, 0x01F88F, "Cn" },
{ 0x01F890, 0x01F8AD, "So" },
{ 0x01F8AE, 0x01F8AF, "Cn" },
{ 0x01F8B0, 0x01F8BB, "So" },
{ 0x01F8BC, 0x01F8BF, "Cn" },
{ 0x01F8C0, 0x01F8C1, "So" },
{ 0x01F8C2, 0x01F8CF, "Cn" },
{ 0x01F8D0, 0x01F8D8, "Sm" },
{ 0x01F8D9, 0x01F8FF, "Cn" },
{ 0x01F900, 0x01FA57, "So" },
{ 0x01FA58, 0x01FA5F, "Cn" },
{ 0x01FA60, 0x01FA6D, "So" },
{ 0x01FA6E, 0x01FA6F, "Cn" },
{ 0x01FA70, 0x01FA7C, "So" },
{ 0x01FA7D, 0x01FA7F, "Cn" },
{ 0x01FA80, 0x01FA8A, "So" },
{ 0x01FA8B, 0x01FA8D, "Cn" },
{ 0x01FA8E, 0x01FAC6, "So" },
{ 0x01FAC9, 0x01FACC, "Cn" },
{ 0x01FACD, 0x01FADC, "So" },
{ 0x01FADD, 0x01FADE, "Cn" },
{ 0x01FADF, 0x01FAEA, "So" },
{ 0x01FAEB, 0x01FAEE, "Cn" },
{ 0x01FAEF, 0x01FAF8, "So" },
{ 0x01FAF9, 0x01FAFF, "Cn" },
{ 0x01FB00, 0x01FB92, "So" },
{ 0x01FB94, 0x01FBEF, "So" },
{ 0x01FBF0, 0x01FBF9, "Nd" },
{ 0x01FBFB, 0x01FFFF, "Cn" },
{ 0x020000, 0x02A6DF, "Lo" },
{ 0x02A6E0, 0x02A6FF, "Cn" },
{ 0x02A700, 0x02B81D, "Lo" },
{ 0x02B81E, 0x02B81F, "Cn" },
{ 0x02B820, 0x02CEAD, "Lo" },
{ 0x02CEAE, 0x02CEAF, "Cn" },
{ 0x02CEB0, 0x02EBE0, "Lo" },
{ 0x02EBE1, 0x02EBEF, "Cn" },
{ 0x02EBF0, 0x02EE5D, "Lo" },
{ 0x02EE5E, 0x02F7FF, "Cn" },
{ 0x02F800, 0x02FA1D, "Lo" },
{ 0x02FA1E, 0x02FFFF, "Cn" },
{ 0x030000, 0x03134A, "Lo" },
{ 0x03134B, 0x03134F, "Cn" },
{ 0x031350, 0x033479, "Lo" },
{ 0x03347A, 0x0E0000, "Cn" },
{ 0x0E0002, 0x0E001F, "Cn" },
{ 0x0E0020, 0x0E007F, "Cf" },
{ 0x0E0080, 0x0E00FF, "Cn" },
{ 0x0E0100, 0x0E01EF, "Mn" },
{ 0x0E01F0, 0x0EFFFF, "Cn" },
{ 0x0F0000, 0x0FFFFD, "Co" },
{ 0x0FFFFE, 0x0FFFFF, "Cn" },
{ 0x100000, 0x10FFFD, "Co" },
{ 0x10FFFE, 0x10FFFF, "Cn" },
},
long_names = {
C = "Other",
Cc = "Control",
Cf = "Format",
Cn = "Unassigned",
Co = "Private_Use",
Cs = "Surrogate",
L = "Letter",
LC = "Cased_Letter",
Ll = "Lowercase_Letter",
Lm = "Modifier_Letter",
Lo = "Other_Letter",
Lt = "Titlecase_Letter",
Lu = "Uppercase_Letter",
M = "Mark",
Mc = "Spacing_Mark",
Me = "Enclosing_Mark",
Mn = "Nonspacing_Mark",
N = "Number",
Nd = "Decimal_Number",
Nl = "Letter_Number",
No = "Other_Number",
P = "Punctuation",
Pc = "Connector_Punctuation",
Pd = "Dash_Punctuation",
Pe = "Close_Punctuation",
Pf = "Final_Punctuation",
Pi = "Initial_Punctuation",
Po = "Other_Punctuation",
Ps = "Open_Punctuation",
S = "Symbol",
Sc = "Currency_Symbol",
Sk = "Modifier_Symbol",
Sm = "Math_Symbol",
So = "Other_Symbol",
Z = "Separator",
Zl = "Line_Separator",
Zp = "Paragraph_Separator",
Zs = "Space_Separator",
},
}
data.ranges.length = #data.ranges
return data
n9bcmzw78nw2si18u2c8t8bwgg4p9d8
Module:Unicode data/Hangul
828
3661
43494
2026-05-13T23:02:25Z
Kambai Akau
8
Created page with "-- Data used to generate the names of characters in the Hangul Syllables block -- (U+AC00 to U+D7A3). local Hangul = {} -- The following leads, vowels, and trails come from here: -- http://www.unicode.org/Public/UNIDATA/Jamo.txt Hangul.leads = { [0] = "G", "GG", "N", "D", "DD", "R", "M", "B", "BB", "S", "SS", "", "J", "JJ", "C", "K", "T", "P", "H" } -- not actually used: Hangul.lead_count = #Hangul.leads + 1 Hangul.vowels = { [0] = "A", "AE", "YA", "YAE", "EO", "E",..."
43494
Scribunto
text/plain
-- Data used to generate the names of characters in the Hangul Syllables block
-- (U+AC00 to U+D7A3).
local Hangul = {}
-- The following leads, vowels, and trails come from here:
-- http://www.unicode.org/Public/UNIDATA/Jamo.txt
Hangul.leads = {
[0] = "G", "GG", "N", "D", "DD", "R", "M", "B", "BB", "S", "SS",
"", "J", "JJ", "C", "K", "T", "P", "H"
}
-- not actually used:
Hangul.lead_count = #Hangul.leads + 1
Hangul.vowels = {
[0] = "A", "AE", "YA", "YAE", "EO", "E", "YEO", "YE", "O", "WA",
"WAE", "OE", "YO", "U", "WEO", "WE", "WI", "YU", "EU", "YI",
"I"
}
Hangul.vowel_count = #Hangul.vowels + 1
Hangul.trails = {
[0] = "", "G", "GG", "GS", "N", "NJ", "NH", "D", "L", "LG", "LM", "LB",
"LS", "LT", "LP", "LH", "M", "B", "BS", "S", "SS", "NG", "J", "C", "K",
"T", "P", "H"
}
Hangul.trail_count = #Hangul.trails + 1
--For the term "final", see [[Syllable#Chinese model]].
Hangul.final_count = Hangul.vowel_count * Hangul.trail_count
return Hangul
3nkgw7i7ttbjq883k4e90d809x4s1n8
Module:User:Erutuon/Unicode
828
3662
43495
2026-05-13T23:03:56Z
Kambai Akau
8
Created page with "local export = {} local Array = require("Module:array") local fun = require("Module:fun") local m_table = require("Module:table") local m_Unicode_data = require("Module:Unicode data") local lookup_name = m_Unicode_data.lookup_name local is_combining = m_Unicode_data.is_combining local capitalize = m_table.listToSet { "latin", "greek", "coptic", "cyrillic", "armenian", "hebrew", "arabic", "syriac", "thaana", "samaritan", "mandaic", "devanagari", "bengali", } -- ... lo..."
43495
Scribunto
text/plain
local export = {}
local Array = require("Module:array")
local fun = require("Module:fun")
local m_table = require("Module:table")
local m_Unicode_data = require("Module:Unicode data")
local lookup_name = m_Unicode_data.lookup_name
local is_combining = m_Unicode_data.is_combining
local capitalize = m_table.listToSet { "latin", "greek", "coptic", "cyrillic",
"armenian", "hebrew", "arabic", "syriac", "thaana", "samaritan", "mandaic",
"devanagari", "bengali", } -- ...
local special = { ipa = "IPA", nko = "NKo", }
local function get_name(char)
return lookup_name(mw.ustring.codepoint(char))
:lower()
:gsub("%S+",
function(word)
if capitalize[word] then
return word:gsub("^%l", string.upper)
else
return special[word]
end
end)
end
function export.get_names(frame)
local text = fun.map(get_name, frame.args[1])
return table.concat(text, ", ")
end
local function exists(pagename)
return mw.title.new(pagename).exists
end
local function safe_exists(pagename)
local success, exists = pcall(exists, pagename)
return success and exists
end
local output_mt = {}
function output_mt:insert(str)
self.n = self.n + 1
self[self.n] = str
end
function output_mt:insert_format(...)
self:insert(string.format(...))
end
output_mt.join = table.concat
output_mt.__index = output_mt
local function Output()
return setmetatable({ n = 0 }, output_mt)
end
function export.show_modules()
local output = Output()
output:insert [[
{| class="wikitable" style="text-align: center;"'
|+ Unicode name and image modules,<br>organized by first three digits of codepoint in hexadecimal base]]
for i = -1, 0xF do
if i >= 0 then
output:insert_format('\n! %X', i)
else
output:insert '\n!'
end
end
output:insert '\n|-'
local function range(start, ending)
if not (start and ending) then
error("Expected two arguments")
end
local output = Array()
for i = start, ending do
output:insert(i)
end
return output
end
local function make_module_row(type, first_two_digits, attribute)
local found_module = false
local output = range(0x0, 0xF):map(
function(i)
local first_three_digits = first_two_digits * 0x10 + i
local module = ('Module:Unicode data/' .. type .. '/%03X')
:format(first_three_digits)
local link
if safe_exists(module) then
link = ("[[%s|" .. type .. "]]"):format(module)
found_module = true
else
link = " "
end
return ('| title="U+%Xxxx"%s | %s')
:format(first_three_digits, attribute, link)
end)
:concat "\n"
return output, found_module
end
local first = true
for first_two_digits = 0x00, 0x0E do
local attribute = not first and ' style="border-top-width: 3px;"'
or ""
local row1, found_module1 = make_module_row("names", first_two_digits, attribute)
local row2, found_module2 = make_module_row("images", first_two_digits, "")
local row3, found_module3 = make_module_row("emoji images", first_two_digits, "")
if found_module1 or found_module2 or found_module3 then
output:insert_format('\n|-\n! rowspan="3"%s | %02Xx\n%s\n|-\n%s\n|-\n%s',
attribute, first_two_digits, row1, row2, row3)
first = false
end
end
output:insert "\n|}"
return output:join()
end
export.show = export.show_modules
return export
4v9ep9a2dsz9icuyqxc081zz5q88t0m
Ta̱mpi̱let:character info/var
10
3663
43496
2026-05-13T23:05:42Z
Kambai Akau
8
Created page with "<div class="NavFrame floatright" style="width:25em;max-width:100%;box-sizing:border-box;text-align:left;margin-bottom:7px;font-size:100%;"> <div class="NavHead">Character variations</div> <div class="NavContent"><!-- -->{{#if:{{{1|}}}|{{character info|{{{1}}}}}}}<!-- -->{{#if:{{{2|}}}|{{character info|{{{2}}}}}}}<!-- -->{{#if:{{{3|}}}|{{character info|{{{3}}}}}}}<!-- -->{{#if:{{{4|}}}|{{character info|{{{4}}}}}}}<!-- -->{{#if:{{{5|}}}|{{character info|{{{5}}}}}}}<!-- -->..."
43496
wikitext
text/x-wiki
<div class="NavFrame floatright" style="width:25em;max-width:100%;box-sizing:border-box;text-align:left;margin-bottom:7px;font-size:100%;">
<div class="NavHead">Character variations</div>
<div class="NavContent"><!--
-->{{#if:{{{1|}}}|{{character info|{{{1}}}}}}}<!--
-->{{#if:{{{2|}}}|{{character info|{{{2}}}}}}}<!--
-->{{#if:{{{3|}}}|{{character info|{{{3}}}}}}}<!--
-->{{#if:{{{4|}}}|{{character info|{{{4}}}}}}}<!--
-->{{#if:{{{5|}}}|{{character info|{{{5}}}}}}}<!--
-->{{#if:{{{6|}}}|{{character info|{{{6}}}}}}}<!--
-->{{#if:{{{7|}}}|{{character info|{{{7}}}}}}}<!--
-->{{#if:{{{8|}}}|{{character info|{{{8}}}}}}}<!--
-->{{#if:{{{9|}}}|{{character info|{{{9}}}}}}}<!--
-->{{#if:{{{10|}}}|{{character info|{{{10}}}}}}}<!--
-->{{#if:{{{11|}}}|{{character info|{{{11}}}}}}}<!--
-->{{#if:{{{12|}}}|{{character info|{{{12}}}}}}}<!--
-->{{#if:{{{13|}}}|{{character info|{{{13}}}}}}}<!--
-->{{#if:{{{14|}}}|{{character info|{{{14}}}}}}}<!--
-->{{#if:{{{15|}}}|{{character info|{{{15}}}}}}}<!--
-->{{#if:{{{16|}}}|{{character info|{{{16}}}}}}}<!--
-->{{#if:{{{17|}}}|{{character info|{{{17}}}}}}}<!--
-->{{#if:{{{18|}}}|{{character info|{{{18}}}}}}}<!--
-->{{#if:{{{19|}}}|{{character info|{{{19}}}}}}}<!--
-->{{#if:{{{20|}}}|{{character info|{{{20}}}}}}}<!--
--></div></div><noinclude>{{documentation}}</noinclude>
suc1c67zg5s2tp088c60v3mkbs6sxfq
Module:JSON data
828
3664
43498
2026-05-13T23:17:25Z
Kambai Akau
8
Created page with "local export = {} -- optimisation: local variable lookup is slightly faster than global lookup local tab_concat, type, tostring, pairs, ipairs = table.concat, type, tostring, pairs, ipairs local function export_str(s) -- rudimentary escaping, to save time return '"' .. tostring(s):gsub('["\\]', '\\%0') .. '"' end local function export_array(tab) local items = {} for key, value in ipairs(tab) do if type(value) == 'string' then items[#items + 1] = export_str(va..."
43498
Scribunto
text/plain
local export = {}
-- optimisation: local variable lookup is slightly faster than global lookup
local tab_concat, type, tostring, pairs, ipairs = table.concat, type, tostring, pairs, ipairs
local function export_str(s)
-- rudimentary escaping, to save time
return '"' .. tostring(s):gsub('["\\]', '\\%0') .. '"'
end
local function export_array(tab)
local items = {}
for key, value in ipairs(tab) do
if type(value) == 'string' then
items[#items + 1] = export_str(value)
elseif type(value) == 'boolean' or type(value) == 'number' then
items[#items + 1] = tostring(value)
else
error("serialisation failed: unsupported array element type '" .. type(value) .. "'")
end
end
return "[" .. tab_concat(items, ",") .. "]"
end
-- the second argument is a rudimentary "schema" which specifies
-- whether a table value at a given key should be serialised
-- as an array or an object; Lua uses the same table type for both
local function export_object(tab, schema)
local items = {}
if tab == nil then
return "null"
end
for key, value in pairs(tab) do
if type(value) == 'string' then
items[#items + 1] = export_str(key) .. ':' .. export_str(value)
elseif type(value) == 'boolean' or type(value) == 'number' then
items[#items + 1] = export_str(key) .. ':' .. tostring(value)
elseif type(value) == 'table' then
if not schema then
error("no schema given for array with table values, key '" .. key .. "'")
end
local ktype = {}
if type(schema) == 'table' then
ktype = schema[key]
end
-- false indicates array, true indicates un-schematised object
if ktype == false then
items[#items + 1] = export_str(key) .. ':' .. export_array(value)
else
items[#items + 1] = export_str(key) .. ':' .. export_object(value, ktype)
end
else
error("serialisation failed: unsupported object value type '" .. type(value) .. "'")
end
end
return "{" .. tab_concat(items, ",") .. "}"
end
function export.export_languages(item_filter, key_filter, skip_nulls)
if type(item_filter) == "table" then
key_filter = {}
local i = 2
while item_filter.args[i] do
if tonumber(item_filter.args[i]) ~= nil then
key_filter[#key_filter + 1] = tonumber(item_filter.args[i])
else
key_filter[#key_filter + 1] = item_filter.args[i]
end
i = i + 1
end
if #key_filter == 0 then
key_filter = nil
end
skip_nulls = require('Module:yesno')(item_filter.args.nulls)
item_filter = item_filter.args[1]
end
item_filter = (item_filter ~= "") and item_filter or function() return true end
if type(item_filter) == 'string' then
if item_filter == "TWO_LETTER" then
function item_filter(key, value)
return #key == 2
end
elseif item_filter == "TWO_THREE_LETTER" then
function item_filter(key, value)
return #key <= 3
end
elseif item_filter == "TWO_THREE_LETTER_REGULAR" then
function item_filter(key, value)
return (#key <= 3) and value.type == 'regular'
end
elseif item_filter:sub(1, 1) == '=' then
local list = {}
for item in mw.text.gsplit(item_filter:sub(2), ',') do
list[item] = true
end
function item_filter(key, value)
return list[key]
end
else
local t = item_filter
function item_filter(key, value)
return value.type == t
end
end
end
local data = mw.loadData("Module:languages/data/all")
local items = {}
-- false indicates array, true indicates un-schematised object (just dump raw)
local schema = {
canonicalName = false,
type = false,
scripts = false,
family = false,
otherNames = false,
ancestors = false,
wikimedia_codes = false,
aliases = false,
varieties = false,
sort_key = true,
entry_name = true
}
for key, value in pairs(data) do
if item_filter(key, value) then
if key_filter then
if #key_filter == 1 then
local item = value[key_filter[1]]
local itsc = schema[key_filter[1]]
if item == nil then
if not skip_nulls then
items[#items + 1] = export_str(key) .. ':null'
end
else
items[#items + 1] = export_str(key) .. ':' ..
((type(item) == "string" and export_str(item))
or (itsc == false and export_array(item))
or export_object(item, true))
end
else
local langobj = {}
for _, fkey in pairs(key_filter) do
langobj[fkey] = value[fkey]
end
items[#items + 1] = export_str(key) .. ':' .. export_object(langobj, schema)
end
else
items[#items + 1] = export_str(key) .. ':' .. export_object(value, schema)
end
end
end
return "{" .. tab_concat(items, ",") .. "}"
end
function export.export_scripts()
local data = mw.loadData("Module:scripts/data")
local items = {}
for key, value in pairs(data) do
items[#items + 1] = export_str(key) .. ':' .. export_object(value, {
canonicalName = false,
characters = false,
systems = false,
otherNames = false,
aliases = false,
varieties = false,
})
end
return "{" .. tab_concat(items, ",") .. "}"
end
function export.export_etymology_languages()
local data = mw.loadData("Module:etymology languages/data")
local items = {}
for key, value in pairs(data) do
items[#items + 1] = export_str(key) .. ':' .. export_object(value, {
canonicalName = false,
parent = false,
wikipedia_article = false,
otherNames = false,
ancestors = false,
aliases = false,
entry_name = false,
varieties = true,
strip_diacritics = { -- this is a bit of a hack...
remove_exceptions = false,
},
})
end
return "{" .. tab_concat(items, ",") .. "}"
end
function export.export_families()
local data = mw.loadData("Module:families/data")
local items = {}
for key, value in pairs(data) do
items[#items + 1] = export_str(key) .. ':' .. export_object(value, {
canonicalName = false,
otherNames = false,
family = false,
aliases = false,
varieties = false
})
end
return "{" .. tab_concat(items, ",") .. "}"
end
function export.export_labels()
local data = mw.loadData("Module:labels/data")
local labels = {}
for key, value in pairs(data.labels) do
if type(value) == "string" then
labels[#labels + 1] = export_str(key) .. ':' .. export_str(value)
else
labels[#labels + 1] = export_str(key) .. ':' .. export_object(value, {
plain_categories = false,
topical_categories = false,
pos_categories = false,
regional_categories = false
})
end
end
return "{" .. tab_concat(labels, ',') .. "}"
end
function export.export_wgs()
local m_wgdata = mw.loadData('Module:workgroup ping/data')
local items = {}
for key, value in pairs(m_wgdata) do
if type(value) == 'string' then
items[#items + 1] = export_str(key) .. ':' .. export_str(value)
else
local item = { desc = value.desc; category = value.category; members = {} }
for _, user in ipairs(value) do
item.members[#item.members + 1] = user
end
items[#items + 1] = export_str(key) .. ':' .. export_object(item, {
members = false
})
end
end
return "{" .. tab_concat(items, ",") .. "}"
end
-- replacement for using the [[mw:API]] to do [[Special:PrefixIndex/Template:langrev/]]
-- TODO: limits?
function export.complete_langname(frame)
local m_langs = mw.loadData("Module:languages/data/all")
local target = frame.args[1]
local items = {}
for code, data in pairs(m_langs) do
for _, name in ipairs(data.names) do
if name:sub(1, #target) == target then
items[#items + 1] = export_str(name) .. ":" .. export_str(code)
end
end
end
return "{" .. tab_concat(items, ",") .. "}"
end
return export
12hmkmvs173hshdu6qbnqo2vzecn4pu
Ta̱mpi̱let:m+
10
3665
43499
2026-05-13T23:19:58Z
Kambai Akau
8
Created page with "<includeonly><onlyinclude>{{safesubst:<noinclude/>#invoke:links/templates|l_term_t|langname=1|face=term}}</onlyinclude></includeonly><!-- -->{{m+|en|term}}{{documentation}}"
43499
wikitext
text/x-wiki
<includeonly><onlyinclude>{{safesubst:<noinclude/>#invoke:links/templates|l_term_t|langname=1|face=term}}</onlyinclude></includeonly><!--
-->{{m+|en|term}}{{documentation}}
24rcsskporjthfd9afqocn7986e9qpg
Ta̱mpi̱let:swp
10
3666
43500
2026-05-13T23:22:00Z
Kambai Akau
8
Redirected page to [[Ta̱mpi̱let:slim-wikipedia]]
43500
wikitext
text/x-wiki
#REDIRECT [[Template:slim-wikipedia]]
h5xlz8iqxo8nvmr246q5phqxsa3v50e
Ta̱mpi̱let:lw
10
3667
43502
2026-05-13T23:26:34Z
Kambai Akau
8
Created page with "{{#invoke:checkparams|warn}}<!-- Validate template parameters --><includeonly><!-- -->{{l<!-- -->|1={{{1}}}<!-- -->|2={{{pref|}}}[[w:{{{1}}}:{{{2}}}|{{{3|{{{2}}}}}}]]{{{suf|}}}{{#ifeq:{{{1|}}}|zh|//}}<!-- -->|tr={{{tr|}}}<!-- -->|t={{{4|{{{t|}}}}}}<!-- -->|sc={{{sc|}}}<!-- -->|lit={{{lit|}}}<!-- -->|pos={{{pos|}}}<!-- -->}}</includeonly><!-- --><noinclude>{{documentation}}</noinclude>"
43502
wikitext
text/x-wiki
{{#invoke:checkparams|warn}}<!-- Validate template parameters
--><includeonly><!--
-->{{l<!--
-->|1={{{1}}}<!--
-->|2={{{pref|}}}[[w:{{{1}}}:{{{2}}}|{{{3|{{{2}}}}}}]]{{{suf|}}}{{#ifeq:{{{1|}}}|zh|//}}<!--
-->|tr={{{tr|}}}<!--
-->|t={{{4|{{{t|}}}}}}<!--
-->|sc={{{sc|}}}<!--
-->|lit={{{lit|}}}<!--
-->|pos={{{pos|}}}<!--
-->}}</includeonly><!--
--><noinclude>{{documentation}}</noinclude>
m3dn3iccmokerwuzp8jrm23cjx9bem1
nta̱m
0
3668
43510
2026-05-14T10:09:25Z
Danjuma Anthony
44
Created page with "=={{-kcg-}}== ===Yei=== * {{IPA|kcg|//}} ** {{audio|kcg|Kcg-á̱tyok.ogg|a=}} ** {{hyphenation|kcg|n|ta̱m}} (a̱ka̱tyong 2) ===A̱lyoot=== # ''a̱nyiung'' '''[[nta̱m]]''' [[na#Ati̱kut|na]] ====Bwan di̱ lilyem ghyáng==== {{trans-top|Á̱sam ba kan nyia̱ nyeang a̱ni}} * [[Dot]]: {{t+|nl|echtgenooten}} * [[Shong]]: {{t+|en|husbands}} {{trans-bottom}} [[Sa:a̱lyoot]] [[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]"
43510
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|//}}
** {{audio|kcg|Kcg-á̱tyok.ogg|a=}}
** {{hyphenation|kcg|n|ta̱m}} (a̱ka̱tyong 2)
===A̱lyoot===
# ''a̱nyiung'' '''[[nta̱m]]''' [[na#Ati̱kut|na]]
====Bwan di̱ lilyem ghyáng====
{{trans-top|Á̱sam ba kan nyia̱ nyeang a̱ni}}
* [[Dot]]: {{t+|nl|echtgenooten}}
* [[Shong]]: {{t+|en|husbands}}
{{trans-bottom}}
[[Sa:a̱lyoot]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
f4sbnjjueft50e6xugi98dshkskff8f
43511
43510
2026-05-14T10:24:29Z
Danjuma Anthony
44
/* Tyap */
43511
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|//}}
** {{audio|kcg|Kcg-nta̱m.ogg|a=}}
** {{hyphenation|kcg|n|ta̱m}} (a̱ka̱tyong 2)
===A̱lyoot===
# ''a̱nyiung'' '''[[nta̱m]]''' [[na#Ati̱kut|na]]
====Bwan di̱ lilyem ghyáng====
{{trans-top|Á̱sam ba kan nyia̱ nyeang a̱ni}}
* [[Dot]]: {{t+|nl|echtgenooten}}
* [[Shong]]: {{t+|en|husbands}}
{{trans-bottom}}
[[Sa:a̱lyoot]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
2bv7atizxmlzmifc94e75mtaji0776b
43512
43511
2026-05-14T10:27:06Z
Danjuma Anthony
44
/* A̱lyoot */
43512
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|//}}
** {{audio|kcg|Kcg-nta̱m.ogg|a=}}
** {{hyphenation|kcg|n|ta̱m}} (a̱ka̱tyong 2)
===A̱lyoot===
# ''a̱nyiung'' '''[[ta̱m]]''' [[na#Ati̱kut|na]]
====Bwan di̱ lilyem ghyáng====
{{trans-top|Ka̱nang a̱yin nat a̱yit}}
* [[Dot]]: {{t+|nl| ?}}
* [[Shong]]: {{t+|en|works}}
{{trans-bottom}}
[[Sa:Ta̱m]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
6syg8lxf2jolrq0772ebnb9rfaocpbg
43513
43512
2026-05-14T10:27:40Z
Danjuma Anthony
44
/* A̱lyoot */
43513
wikitext
text/x-wiki
=={{-kcg-}}==
===Yei===
* {{IPA|kcg|//}}
** {{audio|kcg|Kcg-nta̱m.ogg|a=}}
** {{hyphenation|kcg|n|ta̱m}} (a̱ka̱tyong 2)
===A̱lyoot===
# ''a̱nyiung'' '''[[ta̱m]]''' [[hu#Ati̱kut|hu]]
====Bwan di̱ lilyem ghyáng====
{{trans-top|Ka̱nang a̱yin nat a̱yit}}
* [[Dot]]: {{t+|nl| ?}}
* [[Shong]]: {{t+|en|works}}
{{trans-bottom}}
[[Sa:Ta̱m]]
[[Sa:Swánga̱lyiatwuki-WikiWoordenboek project part 3]]
916pyi4gjbwjhx3zxqikkjjvhb0xmci