i'll be posting some information. i think i'm going to try to write a series of artices on staying sane with php5's lack of native unicode support.
"Do you know your character encodings?" SitePoint
http://www.sitepoint.com/blogs/2006/03/15/do- … encodings/
"A tutorial on character code issues," Jukka Korpela
http://www.cs.tut.fi/~jkorpela/chars.html
"The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)," Joel Spolsky
http://www.joelonsoftware.com/articles/Unicode.html
"Scripters' UTF-8 Survival Guide," Harry Fuecks
http://www.webtuesday.ch/_media/meetings/utf-8_survival.pdf
interesting - i've had some problems with broken xml feeds on the site i admin (or rather did admin because i'm transferring)
This means that if you want to make a PHP application which supports any language and runs on the average PHP host out there, that there's only one option: use UTF-8 internally, and write your own functions for string truncation, email header encoding, validation, etc.
http://acko.net/php-no-unicode-support
When I discovered that the popular web development tool PHP has almost complete ignorance of character encoding issues, blithely using 8 bits for characters, making it darn near impossible to develop good international web applications, I thought, enough is enough.
http://www.joelonsoftware.com/articles/Unicode.html
"Character Sets / Character Encoding Issues," Web Application Component Toolkit
http://www.phpwact.org/php/i18n/charsets
"Unicode for the working PHP programmer," IBM developerWorks
http://www.ibm.com/developerworks/library/os- … index.html
"Handling UTF-8 with PHP," Web Application Component Toolkit
http://www.phpwact.org/php/i18n/utf-8
phputf8, a
GPL library
http://phputf8.sourceforge.net/