Unicode control characters  

From The Art and Popular Culture Encyclopedia

Jump to: navigation, search

Related e

Wikipedia
Wiktionary
Shop


Featured:

Many Unicode control characters are used to control the interpretation or display of text, but these characters themselves have no visual or spatial representation. For example, the null character (Template:Unichar) is used in C-programming application environments to indicate the end of a string of characters. In this way, these programs only require a single starting memory address for a string (as opposed to a starting address and a length), since the string ends once the program reads the null character.

ISO 6429 control characters (C0 and C1)

The control characters U+0000–U+001F and U+007F come from ASCII. Additionally, U+0080–U+009F were used in conjunction with ISO 8859 character sets (among others). They are specified in ISO 6429 and often referred to as C0 and C1 control codes respectively.

Most of these characters play no explicit role in Unicode text handling. The characters Template:Unichar, Template:Unichar, Template:Unichar, Template:Unichar, and Template:Unichar are commonly used in text processing as formatting characters.

Unicode introduced separators

In an attempt to simplify the several newline characters used in legacy text, UCS introduces its own newline characters to separate either lines or paragraphs: U+2028 LINE SEPARATOR (HTML 
 · LSEP) and U+2029 PARAGRAPH SEPARATOR (HTML 
 · PSEP). These characters are text formatting only, and not <control> characters.

See also




Unless indicated otherwise, the text in this article is either based on Wikipedia article "Unicode control characters" or another language Wikipedia page thereof used under the terms of the GNU Free Documentation License; or on research by Jahsonic and friends. See Art and Popular Culture's copyright notice.

Personal tools