Apparent errors have been silently corrected.
GRETIL normalizes all texts contributed consistently for each language.
Sanskrit texts are normalized in accordance with the scheme of the International Alphabet of Sanskrit Transliteration (IAST) in order to facilitate word search across its corpus without additional transformations.
All characters that were fully equivalent to an IAST-conformant character in the text file contributed have been made to conform to the character list below.
Non-conformant characters with additional information including accents, capitalization, and whitespace have been preserved as orig elements inside choice elements with their IAST-conformant equivalent in reg elements for the creation of a plain text.
These additional informations will be available only in those transformations of this file that display the original information.
Characters Used for the Transliteration of Sanskrit according to IAST
Character |
Character Name |
Unicode Code Point |
a |
LATIN SMALL LETTER A |
U+0061 |
ā |
LATIN SMALL LETTER A WITH MACRON |
U+0101 |
i |
LATIN SMALL LETTER I |
U+0069 |
ī |
LATIN SMALL LETTER I WITH MACRON |
U+012B |
u |
LATIN SMALL LETTER U |
U+0075 |
ū |
LATIN SMALL LETTER U WITH MACRON |
U+016B |
ṛ |
LATIN SMALL LETTER R WITH DOT BELOW |
U+1E5B |
ṝ |
LATIN SMALL LETTER R WITH DOT BELOW AND MACRON |
U+1E5D |
ḷ |
LATIN SMALL LETTER L WITH DOT BELOW |
U+1E37 |
ḹ |
LATIN SMALL LETTER L WITH DOT BELOW AND MACRON |
U+1E39 |
e |
LATIN SMALL LETTER E |
U+0065 |
o |
LATIN SMALL LETTER O |
U+006F |
ṃ |
LATIN SMALL LETTER M WITH DOT BELOW |
U+1E43 |
ḥ |
LATIN SMALL LETTER H WITH DOT BELOW |
U+1E25 |
' |
APOSTROPHE |
U+0027 |
k |
LATIN SMALL LETTER K |
U+006B |
g |
LATIN SMALL LETTER G |
U+0067 |
ṅ |
LATIN SMALL LETTER N WITH DOT ABOVE |
U+1E45 |
c |
LATIN SMALL LETTER C |
U+0063 |
j |
LATIN SMALL LETTER J |
U+006A |
ñ |
LATIN SMALL LETTER N WITH TILDE |
U+00F1 |
ṭ |
LATIN SMALL LETTER T WITH DOT BELOW |
U+1E6D |
ḍ |
LATIN SMALL LETTER D WITH DOT BELOW |
U+1E0D |
ṇ |
LATIN SMALL LETTER N WITH DOT BELOW |
U+1E47 |
t |
LATIN SMALL LETTER T |
U+0074 |
d |
LATIN SMALL LETTER D |
U+0064 |
n |
LATIN SMALL LETTER N |
U+006E |
p |
LATIN SMALL LETTER P |
U+0070 |
b |
LATIN SMALL LETTER B |
U+0062 |
m |
LATIN SMALL LETTER M |
U+006D |
y |
LATIN SMALL LETTER Y |
U+0079 |
r |
LATIN SMALL LETTER R |
U+0072 |
l |
LATIN SMALL LETTER L |
U+006C |
v |
LATIN SMALL LETTER V |
U+0076 |
ś |
LATIN SMALL LETTER S WITH ACUTE |
U+x015B |
ṣ |
LATIN SMALL LETTER S WITH DOT BELOW |
U+1E63 |
s |
LATIN SMALL LETTER S |
U+0073 |
h |
LATIN SMALL LETTER H |
U+0068 |
If indicated unambiguously in the text file contributed quotation marks have been replaced with the q element and, in case a reference was provided, have been nested inside a cit with the respective reference provided in a ref element after the quote.
Unnumbered div elements are used to structure the text, using the type and n attributes in accordance with the refsDecl.
Interpretive markup which is visible only in analytic transformations of this file consists of:
- Highlighting in hi elements,
- Corruptions in sic elements,
- Remarks in note elements with the resp attribute identifying the agency.