Legato
Legato

GoFiler Legato Script Reference

 

Legato v 1.4j

Application v 5.22b

  

 

Chapter FiveGeneral Functions (continued)

5.24 Unicode Functions

5.24.1 Overview 

Legato provides a number of functions to work with converting and processing data to and from Unicode and Unicode Text Format. Unicode specifies a wide range of characters used throughout languages of the world. In addition, Unicode contains a plethora of special characters for various branches of science, mathematics and graphics.

The base Unicode set in 16-bits specifying up to 65,535 characters. This set incorporates ASCII and it control character and ISO-8859-1 latin or accented characters common to romance languages which accounts for about 200 characters. Unicode-16 extends that another approximately 60,000 characters by defining positions for various languages and symbols. As it has expanded from its original set, Unicode not has more than 136,000 characters and requires 24-bits for full expression. See Unicode 10.0.0 for more information.

The wchar and wstring types Legato support 16-bit characters and strings (common for Windows Unicode support). In addition, there is a method of encoding Unicode into an 8-bit text stream known as Unicode Text Format, or UTF. Legato supports working with UTF-8 (Unicode-16). UTF encoding uses the top two bits of the 8-bit character to indicate an encoded sequence. It is important to note that 8-bit UTF is not ANSI and special or extended characters will appear as garbage if not interpreted correctly.

5.24.2 Functions