closed
4.2.1.1
2005-05-28T19:12:26.618-06:00
2004-06-12T21:31:53.009-06:00
TSD4.2
Archaeological Markup Language (ArchaeoML), version 0.9, February 2006.
Created by David Schloen of the University of Chicago.
DictionaryUnit document type.
A DictionaryUnit document can represent a volume of a multivolume dictionary, a fascicle within a volume, or a single dictionary entry.
A complete dictionary is represented by a Tree document that organizes DictionaryUnit documents into a hierarchy.
When used as the starting point of a display, indicates how the content is to be formatted.
Values include: tabbed, linear, index.
The exportIntermediateURI refers to an XSLT stylesheet used for converting from the underlying XSTAR database format to an intermediate XML format that is used for exchange and further customized processing.
The importIntermediateURI refers to an XSLT stylesheet used for converting from the intermediate XML format to the underlying XSTAR database format.
The outputIntermediateURI refers to an XSL:FO stylesheet used for generating presentation output (PDF/RTF/HTML/Java graphics) from the intermediate XML format.
The outputDatabaseURI refers to an XSL:FO stylesheet used for generating presentation output (PDF/RTF/HTML/Java graphics) directly from the underlying XSTAR database format.
A DictionaryUnit may represent a single dictionary entry, described by the entry element.
If a DictionaryUnit represents a group of dictionary entries (i.e., an entire dictionary, a volume, or a fascicle), then the entry element will be absent.
The citation form of a lemma will often be a phonetic transcription using the Latin alphabet (perhaps with special diacritics) or the International Phonetic Alphabet.
This transcription is stored in a string element. In some cases it will not be a full phonetic transcription but rather a Latin transliteration of consonants without vowels (e.g., Egyptian).
The citation form of a lemma may be rendered in the native script in addition to, or instead of, being rendered in Latin phonetic transcription.
For example, the citation forms of dictionary words in Hindi or Marathi may be given in both Devanagari script and in Latin phonetic transcription.
Note that in an English dictionary, the nativeForm element would contain the word as spelled normally, in distinction from the phonetic transcription of the word.
In some cases the native-script form of a dictionary lemma is epigraphically complex (e.g., Egyptian hieroglyphs) and needs to be represented by a hierarchy of EpigraphicUnit documents rather than by string elements.
This epigraphic hierarchy is not part of a text and is not referenced elsewhere in the database. It represents the abstract citation form of a word rather than a particular instance of the word in a text.
A meaning may have submeanings, which are represented by recursive child elements.
The optional language attribute indicates the modern language in which the dictionary unit is written.
The grammaticalForms element can be used strictly as a grouping element when necessary for proper organization of types.
A text citation is linked to a meaningful discourse unit (a phrase, clause, or sentence), but the actual sign-by-sign transliteration (as opposed to a transcription) is obtained by following links from the discourse unit to the relevant epigraphic units.
nestedVariableIndex
nestedValueIndex
nestedDocIdIndex
A reference element contains reference information about a textual excerpt that is cited somewhere within the dictionary entry. A reference element can have optional docID links to Text, Bibliography, or other types of documents.
This represents links to discourse units (e.g. words) in texts in which the cited example or form appears.
The source element can have mixed content, so it can contain just plain text with no child elements, or if special formatting is needed, it can contain string elements.
Alternatively, it may point to a docID that references a Text document.
The location element can have mixed content, so it can contain just plain text with no child elements, or if special formatting is needed, it can contain string elements.
The category element can have mixed content, so it can contain just plain text with no child elements, or if special formatting is needed, it can contain string elements.
The period element can have mixed content, so it can contain just plain text with no child elements, or if special formatting is needed, it can contain string elements.
nestedLinkIndex