Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
annotation [2016/07/05 20:21]
gatherk
annotation [2016/09/13 11:28]
kleiberi
Line 1: Line 1:
 ====== Annotation ====== ====== Annotation ======
 ### ###
-XML-markup ​includes the linguistic levels ​of descriptions and single grammar topics, as well as all kinds of judgemental statements on language use. +XML-markup ​consists ​of two kinds of annotation. Structural markup (Markup I) contains elements such as paragraphs, headings, and tables. Project-specific markup (Markup II) contains elements relevant to network analyses, e.g. referenced grammarians.
-\\+
 \\ \\
 The XML structure is defined by a [[doc:​schema|documented RELAX NG schema]] that is provided alongside the corpus files. ​ The XML structure is defined by a [[doc:​schema|documented RELAX NG schema]] that is provided alongside the corpus files. ​
 +###
  
-### 
 ===== Annotation Philosophy ===== ===== Annotation Philosophy =====
-### +The idea behind the annotation of this corpus is that we only preserve relevant information instead of trying to copy the grammar book's layout as closely as possible. We, for instance, mark italicised elements if they have a function in the grammar, e.g. emphasis or highlighting. Lists, tables and tree diagrams are rebuilt as structural elements, preserving the logic behind them, and not necessarily their layout.\\
-Die idea behind the annotation of this corpus is that we only preserve relevant information instead of trying to copy the grammar book's layout as closely as possible. We, for instance, mark italicised elements if they have a function in the grammar, e.g. emphasis or highlighting. Lists, tables and tree diagrams are rebuilt as structural elements, preserving the logic behind them, and not necessarily their layout.\\ ​\\ +
-The planned annotation of references to other authors or grammar books will help us to make the networks visible. Furthermore,​ the markup of judgemental statements will illustrate a grammar book's degree of prescriptivism and might also hint at its potential for innovation.\\ \\ +
-Attributes of elements provide additional information that is not part of the text itself, e.g. hierarchies of headings, and different kinds of paragraph. \\ \\  +
-###+
  
-===== Tags ===== +In the following the most important and most frequent elements are documented and exemplified. ​A list of all available tags and their attributes ​is provided [[taglist|here]].
-A list of all [[taglist|available tags and their attributes]]. ​+
  
 +===== Markup I: Structural Annotation =====
 ==== Inline Elements ==== ==== Inline Elements ====
 === "​bold",​ "​italic",​ "​underline"​ - Font Weight === === "​bold",​ "​italic",​ "​underline"​ - Font Weight ===
Line 39: Line 34:
  
 Footnotes are inserted where the footnote indicator (e.g. Asterisk, Dagger, etc.) occurs. If footnotes continue beyond pagebreaks, the pagebreak is omitted within the footnote because it exists within the main text and should not be doubled. Footnotes are inserted where the footnote indicator (e.g. Asterisk, Dagger, etc.) occurs. If footnotes continue beyond pagebreaks, the pagebreak is omitted within the footnote because it exists within the main text and should not be doubled.
-\\ \\ \\ 
  
-==== Structural ​Elements ====+==== Tabular ​Elements ====
 === "​list"​ - Tree Diagrams === === "​list"​ - Tree Diagrams ===
 Lists can either be "​simple",​ "​bulleted",​ or "​numbered"​. If the list is "​numbered",​ the label element is required. Lists can either be "​simple",​ "​bulleted",​ or "​numbered"​. If the list is "​numbered",​ the label element is required.
Line 114: Line 108:
 </​tree>​ </​tree>​
 </​code>​ </​code>​
-\\ \\  
  
-==== Evaluative and Normative Utterances ​==== +==== Miscellaneous ​==== 
-Evaluative and normative statements by the author on other authors, grammars, society, etc. Single words, phrases, ​or sentences.+=== "​paragraph"​ - Paragraph === 
 +A paragraph is a self-contained structural unit which denotes one coherent line of thought ​or idea
  
 <code xml> <code xml>
-<judgement tendency="​positive"​ type="​praise"​ addressee_explicit="​KiganJohn" addresse_implicit="​KiganJohn">​Appraisal</judgement>+<paragraph>​ALEXANDER IRELAND &amp; CO.Pall Mall CourtManchester, propose to issue, at intervals, a SERIES OF SCHOOL BOOKS, Under the above title.</paragraph>
 </​code>​ </​code>​
-\\ \\  
  
-==== References ​and Quotations ==== +=== "​heading" ​and "heading_undefined" - Headings ​=== 
-=== "quotation" - Quotations=== +There are two types of heading elements - hierarchical ones and undefined ones. The hierarchical depth (level) is theoretically unlimited.
-The attribute "​source_added"​ marks if "​title"​ or "​author"​ have been added by the editors+
  
 <code xml> <code xml>
-<quotation author="Surname, First name" ​title="Title" source_added="​0">A quotation</quotation>+<heading level="1">PART I. - OF WORDS.</​heading>​ 
 +<heading level="2">Chapter I. Nouns.</​heading>​ 
 + 
 +<​heading_undefined>​An Easy English Grammar for Beginners; being a Plain Doctrine of Words and Sentences.</heading_undefined>
 </​code>​ </​code>​
  
-=== "reference" - References=== +=== "pagebreak" - Pagebreaks ​=== 
-Indicates a referenced author. ​The element ​encapsulates ​the author'​s name and title.+The pagebreak ​element ​denotes ​the break of the physical page in the original documentThe "​page_no"​ attribute contains the number of the new page as given in the original text. 
 <code xml> <code xml>
-<reference referenced="Referenced Author" ​referencing="​Referencing Author"​ type="​dedication"​ judgemental="​0"​ source="​Source (Year)">​Dedication<​/reference>+<pagebreak page_no="12" />
 </​code>​ </​code>​
-\\ \\  
  
-==== Miscellaneous ==== 
 === "​ed_note"​ - Editor'​s Note === === "​ed_note"​ - Editor'​s Note ===
 //​Attributes://​ type = addition | correction | omission | note //​Attributes://​ type = addition | correction | omission | note
Line 160: Line 154:
 </​code>​ </​code>​
  
 +\\
 +
 +===== Markup II: Project Specific Annotation =====
 +==== Evaluative and Normative Utterances ====
 +Evaluative and normative statements by the author on other authors, grammars, society, etc. Single words, phrases, or sentences.
 +
 +<code xml>
 +<​judgement tendency="​positive"​ type="​praise"​ addressee_explicit="​Kigan,​ John" addresse_implicit="​Kigan,​ John">​Appraisal</​judgement>​
 +</​code>​
 +
 +==== References and Quotations ====
 +=== "​quotation"​ - Quotations===
 +The attribute "​source_added"​ marks if "​title"​ or "​author"​ have been added by the editors. ​
 +
 +<code xml>
 +<​quotation author="​Surname,​ First name" title="​Title"​ source_added="​0">​A quotation</​quotation>​
 +</​code>​
 +
 +=== "​reference"​ - References===
 +Indicates a referenced author. The element encapsulates the author'​s name and title.
 +<code xml>
 +<​reference referenced="​Referenced Author"​ referencing="​Referencing Author"​ type="​dedication"​ judgemental="​0"​ source="​Source (Year)">​Dedication</​reference>​
 +</​code>​