-
Notifications
You must be signed in to change notification settings - Fork 100
Description
Hello, for several subsets of my data, my code runs fine, but for one subset (looking at newspapers from the UK), I keep getting this error:
Warning: There were 10 warnings in mutate()
.
The first warning was:
ℹ In argument: topic_model = future_map(K, ~stm(dfm_uk, K = ., verbose = FALSE, seed = 2461))
.
Caused by warning in dfm2stm()
:
! Dropped 21,185 empty document(s)
ℹ Run dplyr::last_dplyr_warnings() to see the 9 remaining warnings.
Here is my code:
conflicts_prefer(stopwords::stopwords)
toks <-
tokens(text_corp2, remove_numbers = TRUE, remove_punct = TRUE, remove_symbols = TRUE) %>%
tokens_wordstem %>%
tokens_remove(c(stopwords("en", source = "stopwords-iso")))
dfm_uk <- dfm(toks) %>%
dfm_trim(min_docfreq = 0.01, docfreq_type = "prop")
Error occurs here:
many_models <- data_frame(K = c(20, 30, 40, 50, 60)) %>%
mutate(topic_model = future_map(K, ~stm(dfm_uk, K = .,
verbose = FALSE, seed=2461)))
dplyr::last_dplyr_warnings()
[[1]]
<warning/rlang_warning>
Warning inmutate()
:
ℹ In argument:topic_model = future_map(K, ~stm(dfm_uk, K = ., verbose = FALSE, seed = 2461))
.
Caused by warning indfm2stm()
:
! Dropped 21,185 empty document(s)
Backtrace:
▆
- ├─data_frame(K = c(20, 30, 40, 50, 60)) %>% ...
- ├─dplyr::mutate(...)
- └─dplyr:::mutate.data.frame(...)
[[2]]
<warning/rlang_warning>
Warning in mutate()
:
ℹ In argument: topic_model = future_map(K, ~stm(dfm_uk, K = ., verbose = FALSE, seed = 2461))
.
Caused by warning:
! UNRELIABLE VALUE: Future (‘’) unexpectedly generated random numbers without specifying argument 'seed'. There is a risk that those random numbers are not statistically sound and the overall results might be invalid. To fix this, specify 'seed=TRUE'. This ensures that proper, parallel-safe random numbers are produced via the L'Ecuyer-CMRG method. To disable this check, use 'seed=NULL', or set option 'future.rng.onMisuse' to "ignore".
Backtrace:
▆
- ├─data_frame(K = c(20, 30, 40, 50, 60)) %>% ...
- ├─dplyr::mutate(...)
- └─dplyr:::mutate.data.frame(...)
[[3]]
<warning/rlang_warning>
Warning in mutate()
:
ℹ In argument: topic_model = future_map(K, ~stm(dfm_uk, K = ., verbose = FALSE, seed = 2461))
.
Caused by warning in dfm2stm()
:
! Dropped 21,185 empty document(s)
Backtrace:
▆
- ├─data_frame(K = c(20, 30, 40, 50, 60)) %>% ...
- ├─dplyr::mutate(...)
- └─dplyr:::mutate.data.frame(...)
[[4]]
<warning/rlang_warning>
Warning in mutate()
:
ℹ In argument: topic_model = future_map(K, ~stm(dfm_uk, K = ., verbose = FALSE, seed = 2461))
.
Caused by warning:
! UNRELIABLE VALUE: Future (‘’) unexpectedly generated random numbers without specifying argument 'seed'. There is a risk that those random numbers are not statistically sound and the overall results might be invalid. To fix this, specify 'seed=TRUE'. This ensures that proper, parallel-safe random numbers are produced via the L'Ecuyer-CMRG method. To disable this check, use 'seed=NULL', or set option 'future.rng.onMisuse' to "ignore".
Backtrace:
▆
- ├─data_frame(K = c(20, 30, 40, 50, 60)) %>% ...
- ├─dplyr::mutate(...)
- └─dplyr:::mutate.data.frame(...)
[[5]]
<warning/rlang_warning>
Warning in mutate()
:
ℹ In argument: topic_model = future_map(K, ~stm(dfm_uk, K = ., verbose = FALSE, seed = 2461))
.
Caused by warning in dfm2stm()
:
! Dropped 21,185 empty document(s)
Backtrace:
▆
- ├─data_frame(K = c(20, 30, 40, 50, 60)) %>% ...
- ├─dplyr::mutate(...)
- └─dplyr:::mutate.data.frame(...)
... with 5 more warnings.