HiOA HiOA

Introduction and well-formed documents

Tor Arne Dahl, 22.08.2011

Plan for the summer school

Monday 22 August
Introduction and well-formed documents
Tuesday 23 August
DTDs and valid documents
Wednesday 24 August
Making use of XML

Exercises every day.

Today's overview

The World Wide Web (WWW)

Unstructured text

The deconstructive paradigm of reality and cultural deappropriation John Hubbard Department of English, Oxford University D. Hans Hanfkopf Department of Literature, Miskatonic University, Arkham, Mass. Cultural deappropriation and postdialectic desituationism The primary theme of d'Erlette's1 analysis of the capitalist paradigm of context is the genre, and some would say the paradigm, of preconceptual class. Therefore, Bataille uses the term 'postdialectic desituationism' to denote not theory, as Lyotard would have it, but subtheory. The characteristic theme of the works of Rushdie is a textual totality. But Finnis2 states that the works of Rushdie are not postmodern. Marx uses the term 'the deconstructive paradigm of reality' to denote not, in fact, structuralism, but poststructuralism. Therefore, if postdialectic desituationism holds, we have to choose between cultural deappropriation and neodialectic discourse. Sontag promotes the use of postdialectic desituationism to analyse and attack society. In a sense, a number of narratives concerning cultural deappropriation may be revealed. The premise of capitalist capitalism holds that narrative is created by communication.

(Text randomly generated by the Postmodernism Generator on August 2, 2007.)

Marked-up text

<article> <header1>The deconstructive paradigm of reality and cultural deappropriation</header1> <author>John Hubbard</author> <institution>Department of English, Oxford University</institution> <author>D. Hans Hanfkopf</author> <institution>Department of Literature, Miskatonic University, Arkham, Mass.</institution> <paragraph>Cultural deappropriation and postdialectic desituationism The primary theme of d'Erlette's<note>1</note> analysis of the capitalist paradigm of context is the genre, and some would say the paradigm, of preconceptual class. Therefore, Bataille uses the term <term>'postdialectic desituationism'</term>to denote not theory, as Lyotard would have it, but subtheory. The characteristic theme of the works of Rushdie is a textual totality.</paragraph> <paragraph>But Finnis<note>2</note> states that the works of Rushdie are not postmodern. Marx uses the term <term>'the deconstructive paradigm of reality'</term> to denote not, in fact, structuralism, but poststructuralism.</paragraph> <paragraph>Therefore, if postdialectic desituationism holds, we have to choose between cultural deappropriation and neodialectic discourse. Sontag promotes the use of postdialectic desituationism to analyse and attack society.</paragraph> <paragraph>In a sense, a number of narratives concerning cultural deappropriation may be revealed. The premise of capitalist capitalism holds that narrative is created by communication.</paragraph> </article>

HTML

SGML

SGML: Advantages

SGML: Usage

From SGML to XML

XML

XML applications: Examples 1

The relationship between SGML, XML, HTML and XHTML

(Illustration by Gerd Berget, 2005)

XML applications: Examples 2

The World Wide Web Consortium (W3C)

W3C's goals2

Principles

Vision

The W3C Process

  1. Working draft
  2. Candidate Recommendation
  3. Proposed Recommendation
  4. W3C Recommendation

XML-related W3C Recommendations

The XML specification

XML concepts

<message type="welcome">Welcome to Oslo and Akershus University College of Applied Sciences!</message>

Document types

XML documents

XML editors

Well-formed XML document: An example

<?xml version="1.0" encoding="UTF-8"?>
<books>
   <book number="b1">
      <title>How to Build a Digital Library</title>
      <authors>
         <author>
            <first_name>Ian H.</first_name>
            <family_name>Witten</family_name>
         </author>
         <author>
            <first_name>David</first_name>
            <family_name>Bainbridge</family_name>
         </author>
      </authors>
      <year>2003</year>
      <publisher>Morgan Kaufmann Publishers</publisher>
      <isbn>1-55860-790-0</isbn>
   </book>
   <book number="b2">
      <title>Understanding Digital Libraries</title>
      <edition>2</edition>
      <authors>
         <author>
            <first_name>Michael</first_name>
            <family_name>Lesk</family_name>
         </author>
      </authors>
      <year>2005</year>
      <publisher>Morgan Kaufmann Publishers</publisher>
      <isbn>1-55860-924-5</isbn>
   </book>
</books>

Document tree

Tree view of the XML document

Relationships

Relationships between nodes in the XML document tree

Well-formed XML

Example 1

<message type="greeting">Happy
birthday!</message>
<message type="greeting">Hello!</message>

We can make it well-formed by adding a root element:

<messages>
   <message type="greeting">Happy
   birthday!</message>
   <message type="greeting">Hello!</message>
</messages>

Example 2

<message type="warning"><caution>Beware of <strong>the dog!</caution></strong></message>

We can fix this by nesting the elements correctly:

<message type="warning"><caution>Beware of <strong>the dog!</strong></caution></message>

XML applications versus HTML

The XML declaration

Elements

Attributes

Namespaces

Comments

Processing instructions

Whitespace

Predefined character entities

Exercise

We will be working with digital book reviews in the exercises and the workshop. The first task is to create an XML language for book reviews.

Links

This lecture is loosely based on Chapter 2 in Ray, E. T. (2003). Learning XML (2nd ed.). Sebastopol: O'Reilly & Associates. Online: http://oreilly.com/catalog/learnxml2/chapter/ch02.pdf (PDF) [8 August 2011]

Still in doubt about XML? Check out XML in 10 points.

Notes

1W3C: About the World Wide Web Consortium (W3C). Available: http://www.w3.org/Consortium/ [8 August 2011]

2W3C: About W3C: Goals. Available: http://www.w3.org/Consortium/mission [8 August 2011]

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

Tor Arne Dahl
Tor-Arne.Dahl@hioa.no
Last modified: 19 August 2011