The Life-Changing Magic of Tidying Text | Julia Silge #98

utterances-bot · 2024-01-23T09:02:00Z

The Life-Changing Magic of Tidying Text | Julia Silge

An R package for text mining using tidy data principles

https://juliasilge.com/blog/life-changing-magic/

waragamwangi · 2024-01-23T09:02:01Z

Hello Julia,
I have just started learning Text Mining with R and came across this regular expression " regex("^chapter [\divxlc]", Would you kindly explain what " \divxlc " is searching for? I understand the ^chapter part, however, I dont understand the last part.
Thank you in advance.

juliasilge · 2024-01-23T18:50:33Z

That's a great question @waragamwangi! That is to identify roman numerals, like to find "chapter iv". It doesn't look like it's necessary in this example, but can be good for other datasets.

waragamwangi · 2024-01-24T06:14:14Z

Thank you @juliasilge . Its clear now. I can see the regular expression was able to capture chapters in Mansfield Park and Emma books which are written in roman numbers.

That's a great question @waragamwangi! That is to identify roman numerals, like to find "chapter iv". It doesn't look like it's necessary in this example, but can be good for other datasets.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The Life-Changing Magic of Tidying Text | Julia Silge #98

The Life-Changing Magic of Tidying Text | Julia Silge #98

utterances-bot commented Jan 23, 2024

waragamwangi commented Jan 23, 2024

juliasilge commented Jan 23, 2024

waragamwangi commented Jan 24, 2024

The Life-Changing Magic of Tidying Text | Julia Silge #98

The Life-Changing Magic of Tidying Text | Julia Silge #98

Comments

utterances-bot commented Jan 23, 2024

The Life-Changing Magic of Tidying Text | Julia Silge

waragamwangi commented Jan 23, 2024

juliasilge commented Jan 23, 2024

waragamwangi commented Jan 24, 2024