O conjunto de caracteres Unicode contém o conceito de Combinação de caracteres :
combining characters are characters that are intended to modify other characters
Ao copiar o texto acima e exibi-lo com o notepad ++, obtém-se:
Astringdetextocontémnomínimo38caracteres,ondeamaioriacombinaoscaracteres.Porexemplo,pode-seencontraropersonagem
Essescaracterescombinados,destinadosprincipalmenteparausoemidiomasasiáticoscomplexos,tambémpodeserusadocriativamentecomodecoraçãoparacaractereslatinos,oquefoifeitoaqui.
De
All combining characters can be applied to any base character and can, in principle, be used with any script. As with other characters, the allocation of a combining character to one block or another identifies only its primary usage; it is not intended to define or limit the range of characters to which it may be applied. In the Unicode Standard, all sequences of character codes are permitted.
This does not create an obligation on implementations to support all possible combinations equally well. Thus, while application of an Arabic annotation mark to a Han character or a Devanagari consonant is permitted, it is unlikely to be supported well in rendering or to make much sense.