You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello Julia,
I have just started learning Text Mining with R and came across this regular expression " regex("^chapter [\divxlc]", Would you kindly explain what " \divxlc " is searching for? I understand the ^chapter part, however, I dont understand the last part.
Thank you in advance.
That's a great question @waragamwangi! That is to identify roman numerals, like to find "chapter iv". It doesn't look like it's necessary in this example, but can be good for other datasets.
Thank you @juliasilge . Its clear now. I can see the regular expression was able to capture chapters in Mansfield Park and Emma books which are written in roman numbers.
That's a great question @waragamwangi! That is to identify roman numerals, like to find "chapter iv". It doesn't look like it's necessary in this example, but can be good for other datasets.
The Life-Changing Magic of Tidying Text | Julia Silge
An R package for text mining using tidy data principles
https://juliasilge.com/blog/life-changing-magic/
The text was updated successfully, but these errors were encountered: