[JDEV] Writings from the Journal of TCharron

Scott Robinson scott at tranzoa.com
Wed Aug 4 13:00:42 CDT 1999


Interleaved response.

Scott.

* arh14 at cornell.edu translated into ASCII [Wed, Aug 04, 1999 at 01:23:24PM -0400][<Pine.SOL.3.91.990804131428.3558D-100000 at travelers.mail.cornell.edu>]
> 
> On Wed, 4 Aug 1999, Scott Robinson wrote:
> 
> > What you said is along the lines in my head. I'll spew my thoughts some
> > more, since they have been a bit more refined.
> > 
> > First off, since we love being able to debug manually with telnet, the C/S
> > MUST support ASCII. Moreover, since UTF-8 has ASCII and it is the XML
> > standard, therefore the C/S should support UTF-8. There is really nothing
> > suprising here, but I'll just put that down.
> > 
> > Second, I was waiting for the proper time to discuss UNICODE... which was to
> > be my suggestion. Personally, and I'll admit I have not yet screwed around
> > with expat, although I've received the vibes it is quite difficult to change
> > charsets in mid-stream, I believe that since the XML standard allows for a
> 
> Sorry if I'm thick, but what would be the reason for switching 
> charsets in mid-stream of document parsing?  Wouldn't the entire XML doc be 
> normalized to one standard, and, given a message encoding parameter, the 
> client would decide what it wants to do with the normalized characters?  My 
> understanding is that the XML markup itself should never deviate from a 
> pre-stated charset, but the CDATA might (which, really, the parser doesn't 
> care about, right?).  If a standard is set, it will ultimately be the 
> client's responsibility to make sure all outgoing messages are 
> normalized, and all incoming messages are reconstituted in their favorite 
> Star Trek dialect.
> 

Hmm. Let me think a sec.

Ok, I'm about to make an idioitic comment, but it's only because I'm the
kinda guy that thinks this way. I see no reason not to allow for alternate
characters in XML. I'll allow the point that it would only cause confusion
later on and gives no functionality; however, in some future bizarre
universe everyone _could_ be sending data across whatever we use instead of
sockets in some strange charset. I would build in the functionality for the
_XML_ (not CDATA) being in alternate charsets.

Moving to the current CDATA topic... I believe many messages ago the
suggestion for adding a package for specifying what charset the CDATA is in
was made. There were arguments again, but they were
anti-internationalization ones. The only alternative given was a tag. A
<message charset="charset/unicode>...</message> solution is the nicest one
in my mind.

> > charset different from UTF-8, that the C/S should be able to use that
> > particular feature. I would note, that if the C/S cannot understand UNICODE
> > (just an example) there should be a way of saying it. ala HTTP's "Accept:
> > charset/ascii, charset/utf-8" and "Deny: charset/unicode".
> 
> Should you really rely on the facility of XML to use different charsets?  
> Really the only thing that needs to change charsets is the CDATA of 
> users' messages.  The markup itself never needs to deviate from a set 
> standard encoding.  This standard encoding should be broad enough to be 
> able to store every other encoding clients might want to use.  You don't 
> want to change the nature of the messenger based on the characteristics 
> of the message (if that makes any sense).
> 

I believe my drivel was becoming overlapping. Let me seperate. The
"messenger" should be able to support different charsets and the "message"
inside should be able to be completely different.

> > 
> > Standardizing on UNICODE, though, might be a way to go. I'm not sure, but if
> > the C/S plain receives/sends ASCII, it could just convert inside and
> > everyone could be happy.
> > 
> > The following comments are certified werid.
> > 
> > Scott.
> 
> an interloper,
> Aaron
> 
> _______________________________________________
> jdev mailing list
> jdev at jabber.org
> http://mailman.jabber.org/listinfo/jdev
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 240 bytes
Desc: not available
URL: <https://www.jabber.org/jdev/attachments/19990804/61aeda1c/attachment-0002.pgp>


More information about the JDev mailing list