Universal Character Set - Definition. Was ist Universal Character Set
Diclib.com
Wörterbuch ChatGPT
Geben Sie ein Wort oder eine Phrase in einer beliebigen Sprache ein 👆
Sprache:

Übersetzung und Analyse von Wörtern durch künstliche Intelligenz ChatGPT

Auf dieser Seite erhalten Sie eine detaillierte Analyse eines Wortes oder einer Phrase mithilfe der besten heute verfügbaren Technologie der künstlichen Intelligenz:

  • wie das Wort verwendet wird
  • Häufigkeit der Nutzung
  • es wird häufiger in mündlicher oder schriftlicher Rede verwendet
  • Wortübersetzungsoptionen
  • Anwendungsbeispiele (mehrere Phrasen mit Übersetzung)
  • Etymologie

Was (wer) ist Universal Character Set - definition

TYPE OF CHARACTER ENCODING SCHEME
Variable width encoding; Multi-byte character set; Variable-width; Multi-Byte Character Set; Multibyte character; Multi-byte character; Multibyte character set; Multi Byte Character Set; Multi-byte Character Set; MBCS (encoding); Multi byte character set; MB (character set); Multiple-byte character set; Multiple-Byte Character Set; Multiple Byte Character Set; Multiple-byte Character Set; Multiple byte character set; Multiple-byte character; Mixed Multi-Byte Character Set; PCMB (encoding); Mixed Multi-byte Character Set; Mixed multi-byte character set; Mixed multi byte character set; Mixed Multi Byte Character Set; Mixed Multiple-Byte Character Set; Mixed Multiple-byte Character Set; Mixed multiple-byte character set; Mixed Multiple Byte Character Set; Mixed multiple byte character set

Universal Character Set         
STANDARD SET OF CHARACTERS DEFINED BY THE INTERNATIONAL STANDARD ISO/IEC 10646
UCS-16; UCS-2; ISO 10646; ISO/IEC 10646; IEC 10646; ISO10646; Universal code (typography); ISO-10646; Universal Code (Typography); ISO/IEC 10646-1; List of Unicode entities; Iso 10646-1; ISO/IEC 10646-1:2000; Universal character set; ISO/IEC JTC1/SC2/WG2; ISO/IEC 10646-1:2000(E); Universal Character Set; 10646-1:1993; ISO/CEI 10646; ISO/IEC 10646-2; ISO/CEI 10646-1; ISO/CEI 10646-2; ISO/IEC 10646-1:1993; ISO/IEC 10646-2:2001; ISO/CEI 10646-2:2001; ISO/CEI 10646-1:1993; ISO/CEI 10646-1:2000; ISO/IEC 10646:1993; ISO/IEC 10646:2000; ISO/IEC 10646:2001; ISO/IEC 10646:2003; ISO/IEC 10646:2011; ISO/IEC 10646:2012; ISO/IEC 10646:2014; ISO/CEI 10646:1993; ISO/CEI 10646:2000; ISO/CEI 10646:2001; ISO/CEI 10646:2003; ISO/CEI 10646:2011; ISO/CEI 10646:2012; ISO/CEI 10646:2014
<character, standard> (UCS, ISO/IEC 10646) A 1993 ISO and IEC standard character set, also known as "Universal Multiple-Octet Coded Character Set". UCS comes in a 16-bit variant called UCS-2 and a 32-bit variant called UCS-4, which is composed of 16-bit UCS-2 "planes". So far only one 16-bit plane has been defined, which is known as the Basic Multilingual Plane. The implementation of UCS is still in its infancy, though some moves, such as the Java language defining a character to be 16 bits, are suggestive. [Relationship with Unicode?] (1997-07-04)
Universal Coded Character Set         
STANDARD SET OF CHARACTERS DEFINED BY THE INTERNATIONAL STANDARD ISO/IEC 10646
UCS-16; UCS-2; ISO 10646; ISO/IEC 10646; IEC 10646; ISO10646; Universal code (typography); ISO-10646; Universal Code (Typography); ISO/IEC 10646-1; List of Unicode entities; Iso 10646-1; ISO/IEC 10646-1:2000; Universal character set; ISO/IEC JTC1/SC2/WG2; ISO/IEC 10646-1:2000(E); Universal Character Set; 10646-1:1993; ISO/CEI 10646; ISO/IEC 10646-2; ISO/CEI 10646-1; ISO/CEI 10646-2; ISO/IEC 10646-1:1993; ISO/IEC 10646-2:2001; ISO/CEI 10646-2:2001; ISO/CEI 10646-1:1993; ISO/CEI 10646-1:2000; ISO/IEC 10646:1993; ISO/IEC 10646:2000; ISO/IEC 10646:2001; ISO/IEC 10646:2003; ISO/IEC 10646:2011; ISO/IEC 10646:2012; ISO/IEC 10646:2014; ISO/CEI 10646:1993; ISO/CEI 10646:2000; ISO/CEI 10646:2001; ISO/CEI 10646:2003; ISO/CEI 10646:2011; ISO/CEI 10646:2012; ISO/CEI 10646:2014
The Universal Coded Character Set (UCS, Unicode) is a standard set of characters defined by the international standard ISO/IEC 10646, Information technology — Universal Coded Character Set (UCS) (plus amendments to that standard), which is the basis of many character encodings, improving as characters from previously unrepresented typing systems are added.
ISO 10646         
STANDARD SET OF CHARACTERS DEFINED BY THE INTERNATIONAL STANDARD ISO/IEC 10646
UCS-16; UCS-2; ISO 10646; ISO/IEC 10646; IEC 10646; ISO10646; Universal code (typography); ISO-10646; Universal Code (Typography); ISO/IEC 10646-1; List of Unicode entities; Iso 10646-1; ISO/IEC 10646-1:2000; Universal character set; ISO/IEC JTC1/SC2/WG2; ISO/IEC 10646-1:2000(E); Universal Character Set; 10646-1:1993; ISO/CEI 10646; ISO/IEC 10646-2; ISO/CEI 10646-1; ISO/CEI 10646-2; ISO/IEC 10646-1:1993; ISO/IEC 10646-2:2001; ISO/CEI 10646-2:2001; ISO/CEI 10646-1:1993; ISO/CEI 10646-1:2000; ISO/IEC 10646:1993; ISO/IEC 10646:2000; ISO/IEC 10646:2001; ISO/IEC 10646:2003; ISO/IEC 10646:2011; ISO/IEC 10646:2012; ISO/IEC 10646:2014; ISO/CEI 10646:1993; ISO/CEI 10646:2000; ISO/CEI 10646:2001; ISO/CEI 10646:2003; ISO/CEI 10646:2011; ISO/CEI 10646:2012; ISO/CEI 10646:2014

Wikipedia

Variable-width encoding

A variable-width encoding is a type of character encoding scheme in which codes of differing lengths are used to encode a character set (a repertoire of symbols) for representation, usually in a computer. Most common variable-width encodings are multibyte encodings, which use varying numbers of bytes (octets) to encode different characters. (Some authors, notably in Microsoft documentation, use the term multibyte character set, which is a misnomer, because representation size is an attribute of the encoding, not of the character set.)

Early variable width encodings using less than a byte per character were sometimes used to pack English text into fewer bytes in adventure games for early microcomputers. However disks (which unlike tapes allowed random access allowing text to be loaded on demand), increases in computer memory and general purpose compression algorithms have rendered such tricks largely obsolete.

Multibyte encodings are usually the result of a need to increase the number of characters which can be encoded without breaking backward compatibility with an existing constraint. For example, with one byte (8 bits) per character, one can encode 256 possible characters; in order to encode more than 256 characters, the obvious choice would be to use two or more bytes per encoding unit, two bytes (16 bits) would allow 65,536 possible characters, but such a change would break compatibility with existing systems and therefore might not be feasible at all.