This question is based on this question .
I use \P{M}\p{M}*to match all letters (in both German and French).
I chose this regex to avoid defining every Unicode character, such as:
^[a-zA-Z[\\u00c0-\\u01ff]]+[\\']?(([-]?[a-zA-Z[\\u00c0-\\u01ff]]*[\\s]?)|([\\s]?[a-zA-Z[\\u00c0-\\u01ff]]*[-]?)){1,2}[a-zA-Z[\\u00c0-\\u01ff]]+$
However, despite using the unicode format defined in the previous question, characters such as รor รจdo not match the regular expression.
I am using JDK 6.
What am I missing. Thank!
Ionut source
share