SBCS, or single-byte character set, is used to refer to character encodings that use exactly one byte for each graphic character. An SBCS can accommodate a maximum of 256 symbols, and is useful for scripts that do not have many symbols or accented letters such as the Latin, Greek and Cyrillic scripts used mainly for European languages. Examples of SBCS encodings include ISO/IEC 646, the various ISO 8859 encodings, and the various Microsoft/IBM code pages.[1][2]

The term SBCS is commonly contrasted against the terms DBCS (double-byte character set) and TBCS (triple-byte character set), as well as MBCS (multi-byte character set). The multi-byte character sets are used to accommodate languages with scripts that have large numbers of characters and symbols, predominantly Asian languages such as Chinese, Japanese, and Korean. These are sometimes referred to by the acronym CJK. In these computing systems, SBCSs are traditionally associated with half-width characters, so-called because such SBCS characters would traditionally occupy half the width of a DBCS character on a fixed-width computer terminal or text screen.

In modern times

edit

Though single-byte character sets have largely been supplanted by UTF-8 and its variants on modern systems,[3] they have found a niche in code golfing, where the smaller byte size of characters allows participants to gain an edge if they use SBCSs with specially-designed programming languages such as Vyxal[4] and GolfScript.

See also

edit

References

edit
  1. ^ "Single-byte Character Sets". Microsoft. Retrieved 2017-04-17.
  2. ^ "SBCS, DBCS, and MBCS data conversion considerations". IBM. 14 September 2015. Retrieved 2017-04-17.
  3. ^ "Usage statistics of UTF-8 for websites". w3techs.com. Retrieved 2024-09-09.
  4. ^ "Vyxal language README". Github. 4 September 2024. Retrieved 2024-09-09.