surrogate pair
two-code-unit character.
Last updated
Was this helpful?
two-code-unit character.
Last updated
Was this helpful?
⟩ ⟩ ⟩ ⟩ ⟩ ⟩ ⟩ surrogate pair
beyond the (also called astral code points) that are transformed into two UTF-16 .
two parts of the pair must be in 0xD800
~ 0xDFFF
. (2^11 = 0x800)
high-surrogate code unit : 0xD800 ~ 0xDBFF. (2^10 = 0x400)
low-surrogate code unit : 0xDC00 ~ 0xDFFF. (2^10 = 0x400)
these code units (called "lone surrogates") are not used to encode single-code-unit characters.