If you like BoxMatrix then please contribute Supportdata, Supportdata2, Firmware and/or Hardware (get in touch).
My metamonk@yahoo.com is not reachable by me since years. Please use hippie2000@webnmail.de instead.
Property:UTF-8
BoxMatrix >> Lexicon >> Internationalisation >> UTF-8 | @ BoxMatrix - IRC-Chat - Translate: de es fr it nl pl |
News | Selectors | Models | Accessories | Components | Environment | Config | Commands | System | Webif | Software | Develop | Lexicon | Community | Project | Media |
Computer | FRITZ | I18N | Telephony | Smarthome | Internet | Protocols | Multimedia | Formats | Hardware | Software | Research |
Term
Term: | UTF-8 | Wiki | Freetz | IPPF | whmf | AVM | Web |
Short for: | Unicode Transformation Format 8-bit | ||||||
Location: | Lexicon >> Internationalisation | ||||||
Weblinks: | Wikipedia.int - Wikipedia.de - RFC-3629 - ISO/IEC-10646 (nonpub) | ||||||
Description: | Unicode variable-width 8-bit Charset |
Goto: FRITZ!OS - SMW-Browser
Details
UTF-8, is an Unicode standardized 8-bit variable-width (1-4 Bytes) encoded Charset.
Unicode introduced the term code point
, which may be a Character, but a character may consist of multiple code points.
The first 128 code points (0x00 - 0x7F) are US-ASCII compatible, so single Byte (8-bit encoded) ASCII is valid UTF-8 encoding,
if the MSB (0x80) is cleared. All further code points use 2-byte, 3-byte or 4-byte encoding with MSB set and special bit pattern:
0xxxxxxx
- 1-byte - 128 code points 8-bit ASCII110xxxxx 10xxxxxx
- 2-byte - 1920 code points covering most Alphabets1110xxxx 10xxxxxx 10xxxxxx
- 3-byte11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
- 4-byte
These patterns make UTF-8 pretty detectable in Network-Protocols which do not use MIME, like IRC.
For more details see the Wikipedia articles above.
FRITZ!OS
Webinterface
Starting with generation x2xx (ie 7270) and fw 4.63 the Webinterface switched to UTF-8 charset. See CONFIG_UTF8.
More details in the Charset article.
NAS
Linux filesystem NLS support for UTF-8 was added in fw 4.57 - see nls_utf8.ko.
The Kernel-Config variable enabling it is CONFIG_NLS_UTF8. More details in the NLS article.
Starting with fw 5.55 FRITZ!OS converts all filenames on the NAS to UTF-8 using conv2utf8. See CONFIG_UTF8_FULL.