| TOC | 
| 
 | 
This document is an Internet-Draft and is in full conformance with all provisions of Section 10 of RFC2026.
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt.
The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html.
This Internet-Draft will expire on April 20, 2003.
Copyright (C) The Internet Society (2002). All Rights Reserved.
This document specifies an xpath1() scheme for use in XPointer-based fragment identifiers. This scheme, like other XPointer Framework[11] schemes, is designed primarily for use with the XML Media Types defined in RFC 3023[5], to identify locations within a given XML representation of a resource. The xpath1() scheme uses XPath 1.0 syntax
| TOC | 
| TOC | 
The XPath 1.0-based xpath1() scheme is intended to be used within the XPointer Framework[11] to support the addressing of nodes within XML documents. Like the xpointer() scheme, the xpath() scheme supports addressing into the internal structures of XML documents and external parsed entities, supporting addressing document's hierarchical structure and choice of its internal parts based on various properties, such as element types, attribute values, character content, and relative position. Unlike the xpointer() scheme[14], the xpath1() scheme only identifies complete nodes and has no support for character-based ranges.
| TOC | 
As the W3C's xpointer() scheme[14] already provides a superset of the functionality provided by the xpath1() scheme, some consideration of why the xpath1() scheme is useful seems worthwhile.
The xpointer() scheme, designed to support the out-of-line linking capabilities of XLink[10], provides support for character ranges which may arbitrarily cross node boundaries. While this is extremely useful for many hypertext applications, it is unnecessary for a wide variety of simpler projects, and XPath 1.0 is generally far more widely supported than the xpointer() scheme.
While the XPointer Framework explicitly supports multiple levels of conformance, the xpointer() scheme states that "Conforming XPointer processors claiming to support the xpointer() scheme must conform to the behavior defined in this specification and may conform to additional XPointer scheme specifications." Conforming xpointer() processors must implement both XPath and the xpointer() scheme's own extensions, and while applications might use only the subset of xpointer() that is pure XPath, processors built for that approach are non-conformant.
The XPointer set of specifications also includes shorthand pointers (based on ID values with their own complications) and support for an element()[13] scheme that is effectively a subset of XPath, but these offer considerably less functionality than XPath.
The xpath1() scheme strikes a balance between the simple implementation but limited functionality of shorthand pointers and the element() scheme, and the complex implementation but great capabilities of the xpointer() scheme. Perhaps more importantly, it strikes that balance using processing capabilities that are already widely deployed.
| TOC | 
The scheme name is "xpath1". The scheme data syntax is as follows; if scheme data in a pointer part with the xpath1() scheme does not conform to the syntax defined in this section, it is an error and the pointer part fails.
xpath1() Scheme Syntax:
ptrpart ::= xpath1( xpath1schemedata ) xpath1schemedata ::= Expr
Expr is as defined in the XPath 1.0 Recommendation[7]. To suport identifiers operating on external parsed entities with multiple root elements or text nodes, the xpath1() scheme extends the XPath data model to permit the root node to contain any sequence of nodes that an element node may contain. (XSLT provides the same extension.)
| TOC | 
The XML namespace[9] context for the XPath should be set by previous XPointer parts using the XPointer xmlns() scheme[12], and the document context is set by the URI containing the XPointer expression.
Evaluating the expression contained in xpath1schemedata in that namespace and document context returns a location-set or an error. If the XPath expression identifies a single node, then the location-set contains a single location corresponding to that node. If the XPath expression identifies a node-set, this XPointer identifies a location-set corresponding to the node-set. If the result of the XPath expression is something other than a node or node-set, the XPointer result is an error.
| TOC | 
MIME Media type registrations should indicate whether or not the xpath1() scheme is applicable to their contents. While this scheme is obviously most directly connected to XML registrations made in accordance with RFC 3023[5], it could conceivably be used with any registration made in accordance with RFC 2046[1] and RFC 2048[2], provided that the registration provides an explicit mapping between an XML structure and the contents of the type.
| TOC | 
At the time of writing, the W3C is developing Version 2.0 of XPath[8]. While this version has many more features, it also has many more dependencies and will require a much more sophisticated processor. Developers who want to use XPath 2.0 for fragment identifiers should define a new XPointer scheme (perhaps xpath2()). All xpath1() scheme processors must reject expressions which do not conform to the syntax and feature set of XPath 1.0[7].
| TOC | 
Current versions of XPath provide support for navigation from the later points in a document to prior points, a useful set of tools but only in contexts where the complete document (or substantial portions of it) remain available throughout expression processing. The xpath1() scheme supports that functionality and therefore makes a purely streaming processor difficult to write. Developers who need to be able to support streaming processing of fragment identifiers should consider or create other schemes.
| TOC | 
This specification normatively depends on the XPointer Framework[11] and XPath 1.0[7]. It also normatively depends on the XPointer xmlns() scheme[12]. XPointer processors claiming to conform to this specification must also conform to the xmlns() specification.
The scheme data for the xpath1() scheme conforms to this specification if it is processable using XPath 1.0[7], with support for multi-rooted external parsed entities if applied in that context.
Conforming XPointer processors claiming to support the xpath1() scheme must conform to the behavior defined in this specification and may conform to additional XPointer scheme specifications, including in particular the xpointer() scheme[14] which is a superset of the xpath1() scheme.
| TOC | 
While it is conceivable that faulty XPath expressions could be used to overflow or overload XPath processors used in processing of the xpath1() scheme data, the author is not aware of any such assaults having taken place. XPath processors should be robust enough to handle or reject XPaths of all kinds in order to process this scheme.
| TOC | 
None.
| TOC | 
| [1] | Freed, N. and N. Borenstein, "Multipurpose Internet Mail Extensions (MIME) Part Two: Media Types", RFC 2046, November 1996. | 
| [2] | Freed, N., Klensin, J. and J. Postel, "Multipurpose Internet Mail Extensions (MIME) Part Four: Registration Procedures", RFC 2048, November 1996. | 
| [3] | Berners-Lee, T., Fielding, R. and L. Masinter, "Uniform Resource Identifiers (URI): Generic Syntax", RFC 2396, August 1998. | 
| [4] | Rose, M., "Writing I-Ds and RFCs using XML", RFC 2629, June 1999. | 
| [5] | Murata, M., St.Laurent, S. and D. Kohn, "XML Media Types", RFC 3023, January 2001. | 
| [6] | Bray, T., Paoli, J. and C. Sperberg-McQueen, "Extensible Markup Language (XML) 1.0", World Wide Web Consortium Recommendation REC-xml, February 1998. | 
| [7] | DeRose, S. and J. Clark, "XML Path Language (XPath) Version 1.0", World Wide Web Consortium Recommendation REC-xpath-19991116, November 1999. | 
| [8] | Berglund, A., Boag, S., Chamberlin, D., Fernandez, M., Kay, M., Robie, J. and J. Simeon, "XML Path Language (XPath) 2.0", World Wide Web Consortium Working Draft WD-xpath20-20020816, November 1999. | 
| [9] | Bray, T., Hollander, D. and A. Layman, "Namespaces in XML", World Wide Web Consortium Recommendation REC-xml-names, January 1999. | 
| [10] | DeRose, S., Maler, E. and D. Orchard, "XML Linking Language (XLink)", World Wide Web Consortium Recommendation XLink, June 2001. | 
| [11] | Grosso, P., Maler, E., Marsh, J. and N. Walsh, "XPointer Framework", World Wide Web Consortium Working Draft XPointer Framework, July 2002. | 
| [12] | DeRose, S., Daniel Jr., R. and E. Maler, "XPointer xmlns() Scheme", World Wide Web Consortium Working Draft XPointer xmlns() Scheme, July 2002. | 
| [13] | Grosso, P., Maler, E., Marsh, J. and N. Walsh, "XPointer element() Scheme", World Wide Web Consortium Working Draft XPointer element() Scheme, July 2002. | 
| [14] | DeRose, S., Daniel Jr., R. and E. Maler, "XPointer xpointer() Scheme", World Wide Web Consortium Working Draft XPointer xpointer() Scheme, July 2002. | 
| TOC | 
| Simon St.Laurent | |
| O'Reilly & Associates | |
| 1259 Dryden Road | |
| Ithaca, New York 14850 | |
| USA | |
| EMail: | simonstl@simonstl.com | 
| URI: | http://www.simonstl.com/ | 
| TOC | 
Thanks to Uche Ogbuji and Eric van der Vlist for inspiration.
| TOC | 
00 - First version.
[To be deleted before publication.]
| TOC | 
Copyright (C) The Internet Society (2002). All Rights Reserved.
This document and translations of it may be copied and furnished to others, and derivative works that comment on or otherwise explain it or assist in its implementation may be prepared, copied, published and distributed, in whole or in part, without restriction of any kind, provided that the above copyright notice and this paragraph are included on all such copies and derivative works. However, this document itself may not be modified in any way, such as by removing the copyright notice or references to the Internet Society or other Internet organizations, except as needed for the purpose of developing Internet standards in which case the procedures for copyrights defined in the Internet Standards process must be followed, or as required to translate it into languages other than English.
The limited permissions granted above are perpetual and will not be revoked by the Internet Society or its successors or assigns.
This document and the information contained herein is provided on an "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Funding for the RFC Editor function is currently provided by the Internet Society.