Loading…
This event has ended. Visit the official site or create your own event on Sched.
Welcome to the Rocky 2021 Conference. Please click on the links below to access the Rocky website and the list of posters:

CONFERENCE RESOURCES
Rocky Website
Poster Presentation List without abstracts
Poster Presentation List with abstracts
Back To Schedule
Friday, December 3 • 4:40pm - 4:50pm
OP 28 - Improving the interpretability of random forest models of genetic association in the presence of non-additive interactions

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

OP-28
Improving the interpretability of random forest models of genetic association in the presence of non-additive interactions

Presenting Author: Alena Orlenko, University of Pennsylvania

Co-Author(s):
Jason Moore, University of Pennsylvania

Abstract: Non-additive interactions among genes are frequently associated with a number of phenotypes, including known complex diseases such as Alzheimer’s, diabetes, and cardiovascular disease. Detecting interactions requires careful selection of analytical methods, and some machine learning algorithms are unable or underpowered to detect or model feature interactions that exhibit non-additivity. The Random Forest (RF) method is often employed in these efforts due to its ability to detect and model non-additive interactions. RF has the built-in ability to estimate feature importance scores, a characteristic that allows the model to be interpreted with the order and effect size of the feature association with the outcome. This characteristic is very important for epidemiological and clinical studies where results of predictive modeling could be used to define the future direction of the research efforts. An alternative way to interpret the model is with a permutation feature importance metric which employs a permutation approach and with the Shapely additive explanations which employ cooperative game theory approach. Currently, it is unclear which RF feature importance metric provides a superior estimation of the true informative contribution of features in genetic association analysis.<br><br>To address this issue, and to improve interpretability of RF predictions, we compared different methods for feature importance estimation in real and simulated datasets with non-additive interactions. As a result, we detected a discrepancy between the metrics for the real-world datasets and further established that the permutation feature importance metric provides more precise feature importance rank estimation for the simulated datasets with non-additive interactions.


Presenters
AO

Alena Orlenko

University of Pennsylvania


Friday December 3, 2021 4:40pm - 4:50pm MST
Ballroom Salon 1