Restriction for multiple dependent and independent variables
rROC.data.frame.RdRestriction for multiple dependent and independent variables. Traverses all dependent variables and within all independent variables. Then calculates the rROC in the sense of dependent ~ independent.
Can save intermediate results to disk, to avoid re-calculating for crash-reasons or to save time when re-running the same analysis.
Can return plot_density_rROC_empirical for every combination.
Usage
# S3 method for class 'data.frame'
rROC(
x,
independent_vars,
dependent_vars = NULL,
y = NULL,
save_path = NULL,
parallel_permutations = TRUE,
n_permutations = 10000,
save_intermediate = TRUE,
load_existing_intermediate = TRUE,
positive_label = 1,
verbose = TRUE,
do_plots = FALSE,
fix_seed = 0,
...
)Arguments
- x
A data.frame containing all dependent and independent variables as columns.
- independent_vars
A character vector of independent variable column names. If NULL, all columns except dependent_vars are used.
- dependent_vars
A character vector of dependent variable column names. If NULL,
ymust be given.- y
Either a vector of dependent variable values or a list of length 1 of a vector of dependent variable values. If NULL, dependent_vars must be given.
- save_path
Path to save the results to. Intermediate results are saved into the directory file.path(save_path, "_partial_directory").
- parallel_permutations
boolean: If TRUE, the permutation will be done via
future.apply::future_lapply, otherwise bybase::lapply- n_permutations
How many permutations should be done
- save_intermediate
Should intermediate results be saved to disk? If TRUE, every combination by itself is saved into file.path(save_path, "_partial_directory").
- load_existing_intermediate
Should the earlier saved intermediate results in the folder file.path(save_path, "_partial_directory") be loaded?
- positive_label
Label for the positive class. All other values of
responseare regarded as negative cases.- verbose
Should progress be printed?
- do_plots
Should the plot_density_rROC_empirical be calculated and returned?
- fix_seed
boolean: If not FALSE, the seed for each permutation will be set by set.seed(fix_seed + permutation_i)
- ...
Arguments passed on to
simple_rROC_permutationreturn_procShould pROC::roc() be returned for the full dataset? 2) Should pROC::roc() be returned on each of the part datasets? Only works with
get_all_aucs_fun=get_all_aucsafterget_all_aucs_norecalculation()does not calculate the ROC curves for each restriction separately.
Value
A list of lists of simple_rROC_permutation and plot results. It is structured as follows:
dependent variable:
independent variable:
- "plots"
plot_density_rROC_empiricalresult- "permutation"
simple_rROC_permutationresult