Struct encoding::codec::utf_8::UTF8Encoding [] [src]

pub struct UTF8Encoding;

UTF-8 (UCS Transformation Format, 8-bit).

This is a Unicode encoding compatible to ASCII (ISO/IEC 646:US) and able to represent all Unicode codepoints uniquely and unambiguously. It has a variable-length design, where one codepoint may use 1 (up to U+007F), 2 (up to U+07FF), 3 (up to U+FFFF) and 4 bytes (up to U+10FFFF) depending on its value. The first byte of the sequence is distinct from other "continuation" bytes of the sequence making UTF-8 self-synchronizable and easy to handle. It has a fixed endianness, and can be lexicographically sorted by codepoints.

The UTF-8 scanner used by this module is heavily based on Bjoern Hoehrmann's Flexible and Economical UTF-8 Decoder.

Trait Implementations

impl Copy for UTF8Encoding
[src]

impl Clone for UTF8Encoding
[src]

Returns a copy of the value. Read more

Performs copy-assignment from source. Read more

impl Encoding for UTF8Encoding
[src]

Returns the canonical name of given encoding. This name is guaranteed to be unique across built-in encodings, but it is not normative and would be at most arbitrary. Read more

Returns a name of given encoding defined in the WHATWG Encoding standard, if any. This name often differs from name due to the compatibility reason. Read more

Creates a new encoder.

Creates a new decoder.

An easy-to-use interface to RawEncoder. On the encoder error trap is called, which may return a replacement sequence to continue processing, or a failure to return the error. Read more

Encode into a ByteWriter.

An easy-to-use interface to RawDecoder. On the decoder error trap is called, which may return a replacement string to continue processing, or a failure to return the error. Read more

Decode into a StringWriter. Read more