The Hardest Part
Storing markup in markup
Creating an XML format to directly store other XML, especially XML that is itself likely to contain unresolved entity references, is a difficult problem.
CDATA sections not the answer
CDATA sounds like a good idea, but first, it's kind of misleading if you glance at the document, and second, you can't nest CDATA sections. If the entity value contains a CDATA section... oh well.
There are times when I wish every character in a document came with an extra byte or two for color, and I could use colors to paint things like "markup" and "not markup", but this doesn't yet exist.
In the end, I chose to use existing delimiters - << and >> to set off containers for XML. It's still a good idea, but unfortunately the code I wrote to do that doesn't work properly (despite what I said in the paper) and I'm still working on fixing it.
Previous Page <
> Next Page