Bayesian identifiability for log-linear models
University of St Andrews
About the Project
Log-linear modelling is the standard approach for investigating the full joint dependence structure between categorical variables. Many applications exist. For instance, phenotypes and SNPs. Complex dependence structures can be easily discerned using graphical log-linear models (Papathomas and Richardson, 2016). This can potentially lead to identifying functionally important pathways. Another application is discerning the size of hidden populations, such as victims of modern slavery (Cruyff, M., Overstall, Papathomas, McCrea (2020)). The number of cells in the associated contingency table increases rapidly with the number of variables, creating sparse contingency tables with a number of zero cell counts, even for a large number of subjects. The presence of zero cell counts can potentially make some model parameters non-estimable, also referred to as non-identifiable (Sharifi Far, Papathomas, King, 2022). Non-identifiability is a major impediment to evaluating how risk factors interact, and understanding important biological or other mechanisms. Problems associated with identifiability are currently not sufficiently understood, and have not been addressed in a systematic manner. The aim of this project is to develop methods that will utilize information pertaining to the Bayesian identifiability of interaction parameters, towards choosing the best log-linear model given the data.
To help us track our recruitment effort, please indicate in your email – cover/motivation letter where (jobs-near-me.eu) you saw this job posting.