Rant: ISO-8859-1 vs UTF-8: FIGHT
Friday, June 17th, 2005So I ran into a problem today. Someone had an XML file with an Ñ in it, and the encoding was UTF-8. But, alas, the file encoding was actually ISO-8859-1.
What’s the difference? Start here to read about encoding types and Unicode. Then view some examples here.
What happens is that UTF-8 has strict high-bit format rules for “What constitutes a non-single-byte character” whereas ISO 8859-1 uses all 8 bits to produce 191 characters.