So that's a somewhat confusing title, but basically this is what I want to know:
If I have the string '& # 31169 ;' (without the spaces, I had to add those so it would display the string rather than the character), which represents a Unicode character (in this case 私), how can I use python to recognize that as a single Unicode character rather than just the string of characters that makes up the Unicode representation? More to the point, if I had a whole string of such character representations (say a sentence in Japanese but represented in the ampersand pound format), how could I get Python to convert it to an actual Unicode string?