Japanese - http://www.w3.org/TR/japanese-xml/
http://www.charbase.com/
http://www.unicode.org/
http://www-archive.mozilla.org/projects/intl/UniversalCharsetDetection.html
http://cs229.stanford.edu/proj2007/KimPark-AutomaticDetectionOfCharacterEncodingAndLanguages.pdf
http://l0.cm/encodings/list/
http://www.iana.org/assignments/character-sets/character-sets.xml
http://demo.icu-project.org/icu-bin/convexp
http://site.icu-project.org/charts/charset