Assessment of weighted KNN imputation and multiple imputation techniques using colorectal cancer miRNA data

Document Type


Publication Date


Faculty Mentor

John Stevens


Microarray data often suffer from missing values due to various experimental and technical reasons. The statistical analyses of missing data may lose power and have biased inference. In this presentation, we demonstrate the strengths and weaknesses of the weighted KNN imputation and multiple imputation techniques over the case deletion technique using a large colorectal cancer (CRC) dataset. This CRC dataset contains extensive lifestyle, genetic, survival, and tumor marker data collected from the study participants. Differential expression tests of miRNAs are performed using various statistical methods while considering the correlation structure in the imputed data and controlling for additional risk factors by including them as covariates.

This document is currently not available here.