The purpose of this library is to "guess" the encoding of files, and retrieve a reader that is properly configured to use the right encoding as guessed. The library is able to recognize the various Unicode encoding variants:
- UTF-16LE - Low Endian
- UTF-16BE - Big Endian
If a Unicode encoding isn't recognized, it's an 8-bit encoding. If the 8-bit encoding is not US-ASCII, the default platform 8-bit encoding is assumed whatever it is. However, the library cannot guess between different 8-bit encodings. Only statistical analysis, n-grams and similar techniques specific to each language used in those files can help guessing the encoding, but this is not supported by the library.
Use Patent Claims
These details are provided for information only. No information here is legal advice and should not be used as such.