Unicode
๐ง under construction -> tidy this page
Last updated
Was this helpful?
๐ง under construction -> tidy this page
Last updated
Was this helpful?
โฉ โฉ โฉ โฉ Unicode
every Unicode character is assigned a .
are divided into 17 .
one or more can be combined into a single .
transforms into .
most JavaScript engines use encoding.
Letter L
:
lowercase Ll
modifier Lm
,
titlecase Lt
,
uppercase Lu
,
other Lo
.
Number N
:
decimal digit Nd
,
letter number Nl
,
other No
.
Punctuation P
:
connector Pc
,
dash Pd
,
initial quote Pi
,
final quote Pf
,
open Ps
,
close Pe
,
other Po
.
Mark M
(accents etc):
spacing combining Mc
,
enclosing Me
,
non-spacing Mn
.
Symbol S
:
currency Sc
,
modifier Sk
,
math Sm
,
other So
.
Separator Z
:
line Zl
,
paragraph Zp
,
space Zs
.
Other C
:
control Cc
,
format Cf
,
not assigned Cn
,
private use Co
,
surrogate Cs
.
- list all properties by a character
- list all characters by a property
๐
๐
๐
ExploringJS โฉ
โฉ โฉ
โข Script (a writing system) = Cyrillic, Greek, Arabic, Han (Chinese) ... ๐ )
๐พ
๐
codepen โฉ