Well, the subject pretty much says it all. I have some jank XML docs which have non-ascii chars in them. I just want to remove them, or translate them to their equiv. The big thing here though is, I want to retain the file as an XML file, so I don't want to convert the standard ascii chars. I've found many Perl scripts which will do the whole thing, but I really don't know how to do it for non-ascii.