Universal Encoding Detector currently supports over two dozen character encodings.
ISO-2022-CN(Traditional and Simplified Chinese)
windows-1252(Western European languages)
windows-1255(Visual and Logical Hebrew)
UTF-32BE, LE, 3412-ordered, or 2143-ordered (with a BOM)
UTF-16BE or LE (with a BOM)
UTF-8(with or without a BOM)
Due to inherent similarities between certain encodings, some encodings may
be detected incorrectly. In my tests, the most problematic case was
Hungarian text encoded as
windows-1250 (encoded as
one but reported as the other). Also, Greek text encoded as
was often mis-reported as
ISO-8859-2. Your mileage may vary.