when working with text, it’s best to split at the boundaries of grapheme clusters, not at the boundaries of Unicode characters ()
• 👉 , 📗
• unless you have very specific requirements or are able to query the font, use an API that segments strings into grapheme clusters wherever you need to deal with the notion of “character”. 👉 ⭐️