Support Unicode

As a user, I wish NLP Primitives had the ability to handle unicode text. 

Currently, Unicode text is not correctly handled by regexes in `nlp_primitives`. 

For example, `Àbc` is not recognized as a title word by `TitleWordCount` (`Abc` is).