content_inspector
A simple library for fast inspection of binary buffers to guess/determine the type of content.
This is mainly intended to quickly determine whether a given buffer contains "binary" or "text" data. The analysis is based on a very simple heuristic: Detection of special byte order marks and searching for NULL bytes. Note that this analysis can fail. For example, even if unlikely, UTF-8-encoded text can legally contain NULL bytes. Also, for performance reasons, only the first 1024 bytes are checked for the NULL-byte (if no BOM) is detected.
Usage
use ;
assert_eq!;
assert_eq!;
assert!;
CLI example
This crate also comes with a small example command-line program (see examples/inspect.rs) that demonstrates the usage:
> inspect
> inspect
If you only want to detect whether something is a binary or text file, this is about a factor of 250 faster than file --mime ....