Concept in Definition ABC
Miscellanea / / July 04, 2021
By Guillem Alsina González, in Aug. 2017
In order to transmit information, you must first agree on how it will be done. This is the function of both languages to speak, and alphabets to to write, or communication protocols for computers and electronic devices.
To transmit textual information between hardware different and different operating systems that we will have the problem that these machines can be configured in very different languages (for example, one written alphabetically, and the other in ideograms), and that even present a greater or lesser number of letters comparing them with each other. others.
In this framework, the creation of a text encoding standard that is independent of hardware Y software, valid for use with multiple languages, which is precisely what Unicode is.
Born at the end of 1987, Unicode sought to solve precisely the exchange of documents between different computer systems, so that users do not have to worry about the language in which these documents were written.
The key to Unicode is the assignment of a unique code for each symbol, invariable across the platforms that support it (both hardware What software)
To do this, Unicode uses two bytes (16 bits), giving it an address space of 65,536 characters. Those that are shared by several alphabets, are not repeated. Thus, for example, there is only one character Ç, which is shared by languages such as Portuguese, Catalan, Turkish or Albanian.
In this way, there is enough space to accommodate all the necessary characters corresponding to languages that require a larger range, such as Chinese or Japanese.
Unicode was created by Apple and Xerox, followed by other companies such as Sun Microsystems and Microsoft.
Along with the basic characters, they are included with accents, circumflex accents, and other modifiers commonly used, such as accented vowels in Spanish.
In addition to characters belonging to living language writing systems, in Unicode also we find characters corresponding to dead languages, such as ancient Greek, cuneiform, or runic.
In addition, we find characters that do not correspond to letters and numbers, such as musical notes, icons, arrows, or frames and borders.
When opening a document containing characters that do not correspond to the writing of the language in which the software system, what it does is retrieve the corresponding character from the Unicode standard, thus displaying the representation appropriate at all times, and not a symbol rare product of a bad interpretation of the coding.
Currently, all operating systems support Unicode.
Photos: Fotolia - Carballo / Sangeeta
Unicode themes