Thomas Goyne
6ca879938d
Always feed the entire file into uchardet when detection is needed
...
uchardet will report that a file is "ascii" if the first page has no bytes
>127, so we need to actually look at the entire file in case the first higher
byte is later in the file.
2019-09-28 18:13:44 -07:00
Thomas Goyne
8d17a0e88a
Assume that files which start with a unicode BOM are valid files of that type
2019-09-28 18:13:44 -07:00
wangqr
bb5090ac3a
Detect EBML magic number to skip encoding detection for MKV
...
MKV loads slow after f733297499
2019-09-22 17:40:46 -07:00
Thomas Goyne
45315476bd
Switch to a maintained fork of universalchardet
2016-02-09 20:29:29 -08:00
Thomas Goyne
879788fe83
Eliminate some gratuitous exceptions
2015-02-15 08:19:10 -08:00
Thomas Goyne
09e325a1c3
Clean up unused includes
2014-05-23 07:28:24 -07:00
Thomas Goyne
3d21f00356
Use read_file_mapping for charset detection
2014-03-21 08:06:41 -07:00
Thomas Goyne
ea96c6e2ad
Make everything final that can be
...
Apparently gcc does use final for devirtualization.
2014-03-12 19:07:30 -07:00
Thomas Goyne
33a4a056a4
Move everything up a level since the root dir no longer has stuff
2014-03-11 12:14:57 -07:00