Closed
Description
by jjc.jclark.com:
The spec defines identifier like this: identifier = letter { letter | unicode_digit } where letter is _ or class Lu, Ll, Lt, Lm, or Lo. This doesn't work for languages with combining characters (e.g. South and South-East Asian languages). For example, in Thai some vowels have combining class Lo and some vowels have combining class Mn. There's lots of details in http://www.unicode.org/reports/tr31/ I would actually recommend using http://www.unicode.org/reports/tr31/#Alternative_Identifier_Syntax This keeps things simple and ensures that the definition of an identifier is Unicode version independent.