Specifically, based on the request from a user in Germany, I attempted to change the output to encode content as UTF-8 using the new features on Magpie RSS 0.7. However, I have gotten an email and a comment from people with apparently French language sites who claim it has broken their french accents and characters.
However, when I preview the feeds in question from our site using the Build a Feed tool, they look okay.
For one comment to the site, I was suspicious since the URL provided had its own encoding set in the HEAD meta tags as iso-8559-1… does that mean French language sites break under UTF-8?? I am really ignorant of this stuff. But if it breaks more sites than it helps, I will have to revert the encoding to what it was before (Magpie does not allow a per feed encoding setting, it is all or nothing).
What’s a character to do?
Update: Until I can sort this out, I am reverting Feed2JS so it uses default iso-8559-1 encoding. Feeds may need an hour to refresh from our cache.
Another Update: Another attempt. A new paraemter
utf=8 sent to the script on our server, should fork it to a different Magpie for the UTF encoding (see the examples on the Feed2JS log site)
The post "Oh, those messy character encodings.." was originally pulled from under moldy cheese at the back of the fridge at CogDogBlog (http://cogdogblog.com/2005/01/oh-those/) on January 13, 2005.