<! attlist := ATTLIST
character buffer, for names
character buffer, for names
The library and compiler parsers had the interesting distinction of different behavior for nextch (a function for which there are a total of two plausible behaviors, so we know the design space was fully explored.) One of them returned the value of nextch before the increment and one of them the new value.
The library and compiler parsers had the interesting distinction of different behavior for nextch (a function for which there are a total of two plausible behaviors, so we know the design space was fully explored.) One of them returned the value of nextch before the increment and one of them the new value. So to unify code we have to at least temporarily abstract over the nextchs.
callback method invoked by MarkupParser after parsing comment.
callback method invoked by MarkupParser after parsing comment.
content1 ::= '<' content1 | '&' charref ...
'<' content1 ::= ...
[22] prolog ::= XMLDecl? Misc* (doctypedecl Misc*)? [23] XMLDecl ::= ' VersionInfo EncodingDecl? SDDecl? S? '?>' [24] VersionInfo ::= S 'version' Eq ("'" VersionNum "'" | '"' VersionNum '"') [25] Eq ::= S? '=' S? [26] VersionNum ::= '1.0' [27] Misc ::= Comment | PI | S
callback method invoked by MarkupParser after parsing an element, between the elemStart and elemEnd callbacks
callback method invoked by MarkupParser after parsing an element, between the elemStart and elemEnd callbacks
the position in the source file
the prefix
the local name
the attributes (metadata)
true
if the element was previously empty; false
otherwise.
callback method invoked by MarkupParser after end-tag of element.
callback method invoked by MarkupParser after end-tag of element.
the position in the source file
the prefix
the local name
callback method invoked by MarkupParser after start-tag of element.
callback method invoked by MarkupParser after start-tag of element.
the position in the sourcefile
the prefix
the local name
the attributes (metadata)
'<' element ::= xmlTag1 '>' { xmlExpr | '{' simpleExpr '}' } ETag | xmlTag1 '/' '>'
<! element := ELEMENT
<! element := ELEMENT
<! element := ELEMENT
callback method invoked by MarkupParser after parsing entity ref.
callback method invoked by MarkupParser after parsing entity ref.
expanding entity references
externalID ::= SYSTEM S syslit PUBLIC S pubid S syslit
As the current code requires you to call nextch once manually after construction, this method formalizes that suboptimal reality.
As the current code requires you to call nextch once manually after construction, this method formalizes that suboptimal reality.
stack of inputs
stack of inputs
"rec-xml/#ExtSubset" pe references may not occur within markup declarations
"rec-xml/#ExtSubset" pe references may not occur within markup declarations
These are 99% sure to be redundant but refactoring on the safe side.
These are 99% sure to be redundant but refactoring on the safe side.
Name ::= ( Letter | '_' ) (NameChar)*
See [5] of XML 1.0 specification.
NameChar ::= Letter | Digit | '.' | '-' | '_' | ':' | CombiningChar | Extender
See [4] and Appendix B of XML 1.0 specification.
NameStart ::= ( Letter | '_' )
where Letter means in one of the Unicode general
categories { Ll, Lu, Lo, Lt, Nl }
.
We do not allow a name to start with :
.
See [3] and Appendix B of XML 1.0 specification
(#x20 | #x9 | #xD | #xA)+
(#x20 | #x9 | #xD | #xA)
Returns true
if the encoding name is a valid IANA encoding.
Returns true
if the encoding name is a valid IANA encoding.
This method does not verify that there is a decoder available
for this encoding, only that the characters are valid for an
IANA encoding name.
The IANA encoding name.
returns true is this markup handler is validating
returns true is this markup handler is validating
Create a lookahead reader which does not influence the input
Create a lookahead reader which does not influence the input
holds the next character
holds the next character
this method tells ch to get the next character when next called
this method tells ch to get the next character when next called
'N' notationDecl ::= "OTATION"
parses document type declaration and assigns it to instance variable dtd.
parses document type declaration and assigns it to instance variable dtd.
<! parseDTD ::= DOCTYPE name ... >
holds the position in the source file
holds the position in the source file
if true, does not remove surplus whitespace
if true, does not remove surplus whitespace
callback method invoked by MarkupParser after parsing PI.
callback method invoked by MarkupParser after parsing PI.
<? prolog ::= xml S?
// this is a bit more lenient than necessary...
[12] PubidLiteral ::= '"' PubidChar* '"' | "'" (PubidChar - "'")* "'"
append Unicode character to name buffer
append Unicode character to name buffer
Apply a function and return the passed value
Apply a function and return the passed value
Execute body with a variable saved and restored after execution
Execute body with a variable saved and restored after execution
attribute value, terminated by either ' or ".
attribute value, terminated by either ' or ". value may not contain <.
AttValue ::= `'` { _ } `'`
| `"` { _ } `"`
callback method invoked by MarkupParser after parsing text.
callback method invoked by MarkupParser after parsing text.
prolog, but without standalone
prolog, but without standalone
holds temporary values of pos
holds temporary values of pos
attribute value, terminated by either '
or "
.
attribute value, terminated by either '
or "
. value may not contain <
.
either '
or "
parse attribute and create namespace scope, metadata
parse attribute and create namespace scope, metadata
[41] Attributes ::= { S Name Eq AttValue }
'<! CharData ::= [CDATA[ ( {char} - {char}"]]>"{char} ) ']]>' see [15]
CharRef ::= "&#" '0'..'9' {'0'..'9'} ";" | "&#x" '0'..'9'|'A'..'F'|'a'..'f' { hexdigit } ";"
CharRef ::= "&#" '0'..'9' {'0'..'9'} ";" | "&#x" '0'..'9'|'A'..'F'|'a'..'f' { hexdigit } ";"
see [66]
Comment ::= '' see [15]
scan [S] '=' [S]
scan [S] '=' [S]
[42] '<' xmlEndTag ::= '<' '/' Name S? '>'
[42] '<' xmlEndTag ::= '<' '/' Name S? '>'
entity value, terminated by either ' or ".
entity value, terminated by either ' or ". value may not contain <.
AttValue ::= `'` { _ } `'`
| `"` { _ } `"`
actually, Name ::= (Letter | '_' | ':') (NameChar)* but starting with ':' cannot happen Name ::= (Letter | '_') (NameChar)*
actually, Name ::= (Letter | '_' | ':') (NameChar)* but starting with ':' cannot happen Name ::= (Letter | '_') (NameChar)*
see [5] of XML 1.0 specification
pre-condition: ch != ':' // assured by definition of XMLSTART token post-condition: name does neither start, nor end in ':'
'<?' ProcInstr ::= Name [S ({Char} - ({Char}'>?' {Char})]'?>'
'<?' ProcInstr ::= Name [S ({Char} - ({Char}'>?' {Char})]'?>'
see [15]
scan [3] S ::= (#x20 | #x9 | #xD | #xA)+
scan [3] S ::= (#x20 | #x9 | #xD | #xA)+
skip optional space S?
skip optional space S?
parse a start or empty tag.
parse a start or empty tag. [40] STag ::= '<' Name { S Attribute } [S] [44] EmptyElemTag ::= '<' Name { S Attribute } [S]
Take characters from input stream until given String "until" is seen.
Take characters from input stream until given String "until" is seen. Once seen, the accumulated characters are passed along with the current Position to the supplied handler function.
<? prolog ::= xml S ... ?>
(Since version 2.11) This method and its usages will be removed. Use a debugger to debug code.
An xml parser. parses XML and invokes callback methods of a MarkupHandler. Don't forget to call next.ch on a freshly instantiated parser in order to initialize it. If you get the parser from the object method, initialization is already done for you.