XML Simple Tutorial Three

Author：Eve Cole Update Time：2009-07-07 16:09:32

The Future of XML Now you know XML. It's true that the structure is a bit complex, and the DTD has various options for defining what the document can contain. But that's not all.

Consider an industry for which data exchange is important, such as banking. Banks use ownership systems to track transactions internally, but if they use a common XML format on the Web, then they must describe the transaction information to another institution or application (such as Quicken or MS Money). Of course, they can also represent data on Web pages. FYI: This tag does not exist. It's called OFEX, Open Financial Exchange.

Under certain circumstances, if IE 4 on a PC encounters a <SOFTPKG> tag, a function will be initiated to give the user the opportunity to update installed software. If you are using Windows 98, you may have seen this situation, but did not know that it is an XML application.

Here we have three XML applications that look different from the adding machines, typewriters and pencils Andy Grove saw in the 1970s. But similar to the applications that eventually appeared on PCs, the benefits of XML can be described generally as: "When you use human- and machine-readable tags to describe your data, good things happen."

Those good things

happen.

What is it? I have no idea. But I also don't know what the next generation of programs on my PC will look like. As long as the data is tagged in this way, different applications can be generated.

Are you starting to think about how far it might expand?

We have a lot of practical applications of XML to talk about, and I'll be covering them in the near future. Since we are all Internet users, the future will be XSL (Extensible Style Language-
eXtensible Style Language).

By the way, this recipe is indeed my mom's and it's outstanding. If you are using that, add another half cup of grated coconut.

I’m writing this because I genuinely care about what you think of me. My concern is this: if you read my introduction to XML and are ready to start writing your own XML documents. So you start looking for an already established DTD to represent your information. You find one, as shown below:

<!ATTLIST fn

%attr.lang;

value CDATA #FIXED "TEXT">

<!ENTITY % attr.img "

img.type CDATA #REQUIRED

img.data ENTITY #REQUIRED">

Right off the bat you think Jay must be an idiot. He didn't say anything about ATTLIST and ENTITY - whatever they were.

So let’s talk about this, first with a little patience.

The lines above may not look good, but they're actually nothing. They are used in DTDs to define attributes and entities in XML documents. Anyone who knows HTML will know this very well. Attributes are entries with HTML tags that describe the tags more accurately. In the frequently appearing <img src="my.gif" height="20" width="20">, there are two attributes: height and width. As you'll see later, using attributes in XML documents is very similar.

There's nothing new about entities either. If you've used &, you already know the basics. A string surrounded by & and semicolons represents another character or set of characters. (A complete list of ISO entities is available here.)

Of course, attributes and entities in XML have other functions. This inevitably introduces syntax, although not too much. Once you know this, working with XML documents will be effortless.

Simplified Recipes

If you read my introduction to XML, you'll remember that the ingredients in a recipe are represented by simple tags, such as <item>2 cups flour</item>. After writing that article, I was roaming around the web and found another XML document about recipes. The recipe elements are as follows:

<ingredient quantity="2" units="cups">flour</ingredient>

This approach has a practical benefit: it makes it easier to control the data. With the first approach, the <item> tag is used to hold a bunch of different information. If I wanted to extract a list of ingredients without the amounts of each ingredient, I wouldn't do that.

I can achieve similar functionality using the following structure:

<item>flour

This can be handled, but there are two problems: First, the item element contains mixed content: text and other markup. I quickly discovered that this structure should be avoided whenever possible. The second is that markers have almost no independent meaning. It's hard to imagine a situation where there are only units but no actual components. These items can be described simply, I prefer to think of them as properties.

The first thing to note is that the attribute names, quantities and units are only meaningful when processed by an application that can translate them.

The DTD should be told to allow it before being included in a valid document. For the ingredient element above, we only included the following code in the DTD:

<!ELEMENT ingredient #PCDATA>