<br><br><div><span class="gmail_quote">On 8/10/06, <b class="gmail_sendername">Peter Saint-Andre</b> <<a href="mailto:stpeter@jabber.org">stpeter@jabber.org</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Scott Cotton wrote:<br>><br>><br>> On 8/9/06, *Michal vorner Vaner* <<a href="mailto:michal.vaner@kdemail.net">michal.vaner@kdemail.net</a><br>> <mailto:<a href="mailto:michal.vaner@kdemail.net">michal.vaner@kdemail.net
</a>>> wrote:<br>><br>> On Wed, Aug 09, 2006 at 08:34:28PM +0200, Scott Cotton wrote:<br>> > Hi all,<br>> ><br><br>><br>> I wouldn't equate removing text with ignoring it, but this is certainly
<br>> sensible for embedded<br>> dtds. Removing all such restricted content might lead to confusion, if<br>> say a message contains non-default entity references which are standard<br>> in in some common format like xhtml. These may even be crucial to the
<br>> communication (like dollar sign vs. euro) Should those be silently<br>> removed too? If it were up to me, I'd either pass it all through, reject<br>> it all, or return a warning to the initiator to all restricted content.
</blockquote><div>[ignoring restricted xml data]<br> </div><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">In RFC 3920, ignore means "treat it as if it did not exist". Probably we
<br>can make this clearer in rfc3921bis -- i.e., what this means both for<br>XML routers (servers) and for the stanza recipient.</blockquote><div><br>Hi,<br><br>I'm still unclear on what "treat as if does not exist" means.
<br>First and foremost, I don't know whether ignoring is <br>passing through untouched and uninterpreted or <br>removing it. <br><br>A smaller more technical issue is that some restricted content,<br>like embedded dtds, has its own structure. Since an implementation
<br>is bound to accept such input (but ignore it), it has to parse it <br>in order to identify it (which hardly counts as ignoring it). What if<br>the restricted input doesn't actually parse according to xml 1.0?<br>Then a server returns a stream error? For example:
<br><!DOCTYPE[<br> <jibberish><br>]> <br>Since it's not a valid embedded DOCTYPE, its not restricted xml and so an implementation is not bound to accept it. But if it were a valid xml 1.0<br>embedded doctype, the implementation must accept the input (parse
<br>it and validate that it's xml 1.0 compliant) and the implementation must ignore it. But by that time, the implementation can't ignore it because it already parsed it. <br><br>Well, enough games :) what is the reason for the assymetry in rfc3920?
<br>I mean why is it that everyone conforming to the protocol MUST <br>use the xml subset which is not restricted, but then again everyone MUST accept and ignore restricted xml? <br> <br><br><br> </div><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Peter<br><br>--<br>Peter Saint-Andre<br>Jabber Software Foundation<br><a href="http://www.jabber.org/people/stpeter.shtml">http://www.jabber.org/people/stpeter.shtml</a><br><br><br><br></blockquote></div><br><br clear="all">
<br>-- <br>scott