-
Notifications
You must be signed in to change notification settings - Fork 8
4. Interpreting results
ALLSorts outputs, currently, four key results:
Probabilities distribution of samples per subtype. Black dots are samples that are the negative label for that subtype, red is the positive. The green lines are probability thresholds which are calculated through cross validation based on F1 score or maximal distance between highest negative label and lowest positive label.
Waterfall plot (made that up!) of samples. The X-axis depicts the predicted class, the Y axis is the probability of belonging to a subtype. The colours within the plot represent the true label, white represents previously unknown samples. Note: This does not show subtypes that have multiple labels associated.
Simply a list of predictions made by ALLSorts.
A matrix of subtypes (columns) vs. samples (rows), with each value being the probability of the subtype. The final column is the prediction made (predictions + truth, is labels are supplied). There are two things to note here.
- The probabilities need not sum to 1. This will be explained in a methods section (to-do).
- Child subtypes of parents (Ph/Ph-like are children of Ph Group) have had their probabilities multiplied, i.e. Ph Group probability * Original Ph-like/Ph.