Inclusion of Area Under the Precision Recall Curves as the measure to evaluate cross-validation #24
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Dear Jared,
I coded the Area Under the Precision-Recall Curves (AUPRC) as a measure to evaluate cross-validation. I ran several checks and it is working correctly for me. The only limitation is that we could not account for sampling weights when calculating the AUPRC, however, I included a warning to highlight this limitation. I send below a few references:
Fu, G. H., Yi, L. Z., & Pan, J. (2019). Tuning model parameters in class‐imbalanced learning with precision‐recall curve. Biometrical Journal, 61(3), 652-664.
Fu, G. H., Xu, F., Zhang, B. Y., & Yi, L. Z. (2017). Stable variable selection of class-imbalanced data with precision-recall criterion. Chemometrics and Intelligent Laboratory Systems, 171, 241-250.
Kind regards,
Pedro