As a user, I wish NLP Primitives had the ability to handle unicode text. Currently, Unicode text is not correctly handled by regexes in `nlp_primitives`. For example, `Àbc` is not recognized as a title word by `TitleWordCount` (`Abc` is).