>>101 Granted, the Unicode spec is complex. But you can use UTF-8 as an encoding without having to worry about most of the additional technical standards. I mean, you should probably at least normalize input when saving it to a persistent store, but hell - you can just ignore it and fail hard on all kinds of foreign language edge cases like all American software. UTF-8 don't care.