[JDEV] Problem with accentuation...
Daniel Veillard
veillard at redhat.com
Thu Mar 15 08:02:52 CST 2001
On Thu, Mar 15, 2001 at 01:24:44PM +0000, Michael Wilson wrote:
> I still don't know why Jabber insists on unescaping all character
> entities though.
Unless I misunderstood, there is a set of good reasons:
- you don't want to have to load and use a DTD to check those
entities (yes it's a major pain on top of SAX and would definitely
be too costly on the servers !).
- you don't want either to use an internal subset (I wonder how many
Jabber client or server would choke on this [1])
- all those entities would be used to express single chars which are
cleanly, unambiguously and easilly expressible and prossessable as
character references instead.
- the DtD don't define any entity
http://protocol.jabber.org/jabber/jabber.dtd
and is not written in an extensible way which would allow to
define them as extensions.
Of course this conflicts with the possibility of extending message content
with HTML basic (or SMIL basic) but again all the entities they define can
be conveniently replaced by chareacter references.
No entities in Jabber please, thanks !
Daniel
[1] and unless I have missed it there is nothing about entities, PIs
or Dtd defined in the jabber spec. It would be a good idea to
define a similar rule as in SOAP where there are all forbidden
(well more precisely, DOCTYPE and PIs are forbidden, which means
that no entity can be attached and in this case finding an
entity reference is a well formedness error).
--
Daniel Veillard | Red Hat Network http://redhat.com/products/network/
veillard at redhat.com | libxml Gnome XML XSLT toolkit http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
More information about the JDev
mailing list