RDA speedup #633

antagomir · 2024-08-27T20:45:08Z

Running mia::runRDA can be very slow for large data sets. This is problem in particular when we want to calculate alternative RDA models with different formula (e.g. assay ~ BMI + AGE vs. assay ~ BMI vs. assay ~ AGE etc), as in:

mia::runRDA(tse, 
                assay.type = "relabundance",
                formula = assay ~ BL_AGE + MEN,
                distance = "bray",
                na.action = na.exclude)

One problem is that the beta diversity is here re-calculated for every combination.

Speedups could be obtained by using pre-calculated beta diversity matrix, stored in TreeSE object and then supporting the use of that instead, e.g. something like:

mia::runRDA(meta(tse)$betadiv, 
                assay.type = "relabundance",
                formula = assay ~ BL_AGE + MEN,
                distance = "bray",
                na.action = na.exclude)

Implementation details can be discussed but this would be a substantial improvement.

The text was updated successfully, but these errors were encountered:

antagomir added the enhancement New feature or request label Aug 27, 2024

antagomir assigned TuomasBorman Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RDA speedup #633

RDA speedup #633

antagomir commented Aug 27, 2024

RDA speedup #633

RDA speedup #633

Comments

antagomir commented Aug 27, 2024