Table 1 Training, validation and held-out test sets used in our study.

All three sets were sampled from the train or validation set of ChestX-ray14 dataset⁹. The training set was used to train data Shapley algorithm and compute Shapley values, the validation set was used to compute the predictor performance score during training, and the held-out test set was used to report the final results. Because the distribution of pneumonia labels in the ChestX-ray14 dataset is highly imbalanced, we sampled a larger proportion of pneumonia cases in the training set and sampled balanced validation and held-out test sets in this study.

Quick links

Search