Skip to content

Commit 5a339c1

Browse files
committed
doc fixes and then re-build and re-check
1 parent a199574 commit 5a339c1

File tree

82 files changed

+1214
-1067
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

82 files changed

+1214
-1067
lines changed

.Rbuildignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -21,3 +21,4 @@
2121
^paper.md$
2222
^paper.bib$
2323
^CONTRIBUTING.md$
24+
^Examples$

DESCRIPTION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ Package: vtreat
22
Type: Package
33
Title: A Statistically Sound 'data.frame' Processor/Conditioner
44
Version: 1.4.5
5-
Date: 2019-07-31
5+
Date: 2019-09-11
66
Authors@R: c(
77
person("John", "Mount", email = "[email protected]", role = c("aut", "cre")),
88
person("Nina", "Zumel", email = "[email protected]", role = c("aut")),

NEWS.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11

2-
# vtreat 1.4.5 2019/07/31
2+
# vtreat 1.4.5 2019/09/11
33

44
* Some doc updates.
55

README.Rmd

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -349,7 +349,8 @@ dTrainN %.>%
349349

350350
Related work:
351351

352-
* ["A Transformation for Simplifying the Interpretation of Coefficients of Binary Variables in Regression Analysis"](https://www.jstor.org/stable/2683780), Robert E. Sweeney and Edwin F. Ulveling; The American Statistician, vol. 26, no. 5, pp. 30-32, 1972.
352+
* Cohen J, Cohen P (1983). Applied Multiple Regression/Correlation Analysis For The Behav-
353+
ioral Sciences. 2 edition. Lawrence Erlbaum Associates, Inc. ISBN 0-89859-268-2.
353354
* ["A preprocessing scheme for high-cardinality categorical attributes in classification and prediction problems"](http://dl.acm.org/citation.cfm?id=507538) Daniele Micci-Barreca; ACM SIGKDD Explorations, Volume 3 Issue 1, July 2001 Pages 27-32.
354355
* ["Modeling Trick: Impact Coding of Categorical Variables with Many Levels"](http://www.win-vector.com/blog/2012/07/modeling-trick-impact-coding-of-categorical-variables-with-many-levels/) Nina Zumel; Win-Vector blog, 2012.
355356
* "Big Learning Made Easy – with Counts!", Misha Bilenko, Cortana Intelligence and Machine Learning Blog, 2015.

README.md

Lines changed: 14 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -304,14 +304,14 @@ dTestC <- data.frame(x=c('a', 'b', 'c', NA), z=c(10, 20, 30, NA))
304304
treatmentsC <- designTreatmentsC(dTrainC, colnames(dTrainC), 'y', TRUE,
305305
verbose=FALSE)
306306
print(treatmentsC$scoreFrame[, c('origName', 'varName', 'code', 'rsq', 'sig', 'extraModelDegrees')])
307-
# origName varName code rsq sig extraModelDegrees
308-
# 1 x x_catP catP 0.130498074 0.26400089 2
309-
# 2 x x_catB catB 0.030345745 0.59013918 2
310-
# 3 z z clean 0.237601767 0.13176020 0
311-
# 4 z z_isBAD isBAD 0.296065432 0.09248399 0
312-
# 5 x x_lev_NA lev 0.296065432 0.09248399 0
313-
# 6 x x_lev_x_a lev 0.130005705 0.26490379 0
314-
# 7 x x_lev_x_b lev 0.006067337 0.80967242 0
307+
# origName varName code rsq sig extraModelDegrees
308+
# 1 x x_catP catP 1.559780e-01 0.22202097 2
309+
# 2 x x_catB catB 1.142159e-05 0.99166241 2
310+
# 3 z z clean 2.376018e-01 0.13176020 0
311+
# 4 z z_isBAD isBAD 2.960654e-01 0.09248399 0
312+
# 5 x x_lev_NA lev 2.960654e-01 0.09248399 0
313+
# 6 x x_lev_x_a lev 1.300057e-01 0.26490379 0
314+
# 7 x x_lev_x_b lev 6.067337e-03 0.80967242 0
315315

316316
# help("prepare")
317317

@@ -349,9 +349,9 @@ treatmentsN = designTreatmentsN(dTrainN, colnames(dTrainN), 'y',
349349
verbose=FALSE)
350350
print(treatmentsN$scoreFrame[, c('origName', 'varName', 'code', 'rsq', 'sig', 'extraModelDegrees')])
351351
# origName varName code rsq sig extraModelDegrees
352-
# 1 x x_catP catP 3.700306e-01 0.1095637 2
353-
# 2 x x_catN catN 2.606061e-01 0.1961166 2
354-
# 3 x x_catD catD 6.666667e-02 0.5369633 2
352+
# 1 x x_catP catP 3.558824e-01 0.1184999 2
353+
# 2 x x_catN catN 2.663337e-02 0.6993947 2
354+
# 3 x x_catD catD 4.226472e-02 0.6252538 2
355355
# 4 z z clean 2.880952e-01 0.1701892 0
356356
# 5 z z_isBAD isBAD 3.333333e-01 0.1339746 0
357357
# 6 x x_lev_NA lev 3.333333e-01 0.1339746 0
@@ -407,11 +407,9 @@ dTrainN %.>%
407407

408408
Related work:
409409

410-
- [“A Transformation for Simplifying the Interpretation of
411-
Coefficients of Binary Variables in Regression
412-
Analysis”](https://www.jstor.org/stable/2683780), Robert E.
413-
Sweeney and Edwin F. Ulveling; The American Statistician, vol. 26,
414-
no. 5, pp. 30-32, 1972.
410+
- Cohen J, Cohen P (1983). Applied Multiple Regression/Correlation
411+
Analysis For The Behav- ioral Sciences. 2 edition. Lawrence Erlbaum
412+
Associates, Inc. ISBN 0-89859-268-2.
415413
- [“A preprocessing scheme for high-cardinality categorical attributes
416414
in classification and prediction
417415
problems”](http://dl.acm.org/citation.cfm?id=507538) Daniele

cran-comments.md

Lines changed: 17 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -6,38 +6,35 @@
66

77
### OSX
88

9-
R CMD check --as-cran vtreat_1.4.4.tar.gz
9+
R CMD check --as-cran vtreat_1.4.5.tar.gz
1010
* using R version 3.6.0 (2019-04-26)
1111
* using platform: x86_64-apple-darwin15.6.0 (64-bit)
1212
* using session charset: UTF-8
1313
* using option ‘--as-cran’
1414
* checking for file ‘vtreat/DESCRIPTION’ ... OK
1515
* checking extension type ... Package
16-
* this is package ‘vtreat’ version ‘1.4.4
17-
* checking CRAN incoming feasibility ... NOTE
16+
* this is package ‘vtreat’ version ‘1.4.5
17+
* checking CRAN incoming feasibility ... Note_to_CRAN_maintainers
1818
Maintainer: ‘John Mount <[email protected]>’
19-
Found the following (possibly) invalid URLs:
20-
URL: https://www.jstor.org/stable/2683780
21-
From: inst/doc/vtreat.html
22-
Status: 403
23-
Message: Forbidden
24-
* checking top-level files ... WARNING
25-
Conversion of ‘README.md’ failed:
26-
pandoc: Could not fetch https://www.r-pkg.org/badges/version/vtreat
27-
TlsException (HandshakeFailed (Error_Protocol ("expecting server hello, got alert : [(AlertLevel_Fatal,HandshakeFailure)]",True,HandshakeFailure)))
28-
Status: 1 WARNING, 1 NOTE
29-
WARNING and NOTE are spurious, URLs https://www.jstor.org/stable/2683780 and https://www.jstor.org/stable/2683780 are correct and working.
19+
Status: OK
3020

3121
### Windows
3222

3323
rhub::check_for_cran()
34-
Build ID: vtreat_1.4.4.tar.gz-d878c7bb020e4c0abb135d9e32ceee14
35-
Platform: Windows Server 2008 R2 SP1, R-devel, 32/64 bit
36-
NOTES:
37-
* checking sizes of PDF files under 'inst/doc' ... NOTE
38-
Unable to find GhostScript executable to run checks on size reduction
24+
595#> * using R Under development (unstable) (2019-08-30 r77101)
25+
596#> * using platform: x86_64-w64-mingw32 (64-bit)
26+
597#> * using session charset: ISO8859-1
27+
598#> * using option '--as-cran'
28+
599#> * checking for file 'vtreat/DESCRIPTION' ... OK
29+
600#> * checking extension type ... Package
30+
601#> * this is package 'vtreat' version '1.4.5'
31+
602#> * checking CRAN incoming feasibility ... Note_to_CRAN_maintainers
32+
603#> Maintainer: 'John Mount '
33+
645#> * checking sizes of PDF files under 'inst/doc' ... NOTE
34+
646#> Unable to find GhostScript executable to run checks on size reduction
35+
660#> * DONE
36+
661#> Status: 1 NOTE
3937
GhostScript NOTE is a property of the test environment, not the package.
40-
4138

4239
## Downstream dependencies
4340

docs/CONTRIBUTING.html

Lines changed: 4 additions & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

docs/LICENSE-text.html

Lines changed: 4 additions & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

docs/articles/MultiClassVtreat.html

Lines changed: 2 additions & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

docs/articles/SavingTreamentPlans.html

Lines changed: 2 additions & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)