[jdev] Ok, here's my tiny little Perl script

John Talbot jtalbot at proionta.gr
Thu Sep 29 13:17:42 CDT 2005


Tijl Houtbeckers wrote:
> On Thu, 29 Sep 2005 17:57:04 +0200, John Talbot <jtalbot at proionta.gr> 
> wrote:
>
>>
>> That is very surprising. Since Perl probably has nothing to do with the
>> unicode here, the culprit has to be jabberd then. I'll try to upgrade
>> (though I use the apt-get system for which the most recent versions
>> don't always exist).
>
> Back up a second there ;) Perl is notorious for being bad with 
> unicode, wether that's a repuation still deserved I don't know, but it 
> was justified in the past. And afaik never in the history of jabberd 
> has there been such a serious problem with UTF-8 handeling.

You are absolutely right. I tried using another public jabber server, 
before installing one myself, and same malfunction happened.
I even tried Psi, and that didn't make it work either.

> I think you should consider first:
> - is the file UTF-8? (you seem to have this covered)

Yes.

> - is your version of Perl configured right to read unicode UTF-8files..

I'm not sure about this... Perl can get configured? During compile time 
you mean? Also I didn't think that Perl could have anything to do with 
this, because the libraries (Net::XMPP::etc and Net::Jabber::etc) don't 
contain the string 'utf8' anywhere, so I was guessing that these 
libraries were just passing whatever data they found inside the <body> 
tags without regard for utf8 compliance... but I guess I was wrong?

> - *and* to use unicode for string handeling by default

No, it doesn't have that (and 5.8.6 has that?). It has got to be Perl's 
fault, but perhaps there's a way to avoid installing a second version of 
Perl on my system (I've got to keep the old one too, so many .debs are 
dependent on it in some way) - is this advice you're giving accurate? 
i.e. are newer versions of Perl handling unicode by default? If so, 
maybe I can just plug a 'use utf8;' command or type-in some CLI switch 
and make it all happen...

And what do you mean by Perl being able to read unicode files right? 
Aren't utf8 files just a series of bytes in Perl's eyes, just like any 
other file?

> - do you have the most recent Net::Jabber

Yes, and there's only two versions of Net::Jabber (0.1 and 1.0) so all 
have the same.

> Not critizing you or anything, but I'm worried you'll loose a lot of 
> time accomplishing very little trying to find a UTF-8 bug in jabberd. 
> Jabberd's unicode handeling is independant of how your system is 
> setup, how you build it, etc. The same can not be said for Perl. If 
> you're still not convinced it's not jabberd, try another server (a 
> public one).

Yes, and thanks, you did save me. Installing jabberd 1.4.4 was the next 
thing I was going to do... but it's Perl's fault actually :-)



More information about the JDev mailing list