The purpose of this library is to "guess" the encoding of files, and retrieve a reader that is properly configured to use the right encoding as guessed. The library is able to recognize the various Unicode encoding variants:
- UTF-8
- UTF-16LE - Low Endian
- UTF-16BE - Big Endian
- UTF-32
If a Unicode encoding isn't recognized, it's an 8-bit encoding. If the 8-bit encoding is not US-ASCII, the default platform 8-bit encoding is assumed whatever it is. However, the library cannot guess between different 8-bit encodings. Only statistical analysis, n-grams and similar techniques specific to each language used in those files can help guessing the encoding, but this is not supported by the library.
Commercial Use
Modify
Distribute
Place Warranty
Sub-License
Private Use
Use Patent Claims
Hold Liable
Use Trademarks
Include Copyright
State Changes
Include License
Include Notice
These details are provided for information only. No information here is legal advice and should not be used as such.