Boosting Copulas - Multivariate Distrubution Regression in Digital Medicine

Traditional regression models often provide an overly simplistic view on complex associations and relationships to contemporary data problems in the area of biomedicine. In particular, capturing relevant associations between multiple clinical endpoints correctly is of high relevance to avoid model misspecifications, which can lead to biased results and even wrong or misleading conclusions and treatments. As such, methodological development of statistical methods tailored for such problems in biomedicine are of considerable interest. It is the aim of this project to develop novel conditional copula regression models for high-dimensional biomedical data structures by bringing together efficient statistical learning tools for high-dimensional data and established methods from economics for multivariate data structures that allow to capture complex dependence structures between variables. These methods will allow us to model the entire joint distribution of multiple endpoints simultaneously and to automatically determine the relevant influential covariates and risk factors via algorithms originally proposed in the area of statistical and machine learning. The resulting models can then be used both for the interpretation and analysis of complex association-structures as well as for prediction inference (simultaneous prediction intervals for multiple endpoints). Additional implementation in open software and its application in various studies (e.g., eating disorders or schizophrenia) highlight the potentials of this project’s methodological developments in the area of digital medicine.

Principal Investigators
Klein, Nadja Prof. Dr. (Details) (Applied Statistics (J))

Participating external organizations

DFG: Sachbeihilfe

Duration of Project
Start date: 09/2020
End date: 08/2023

Research Areas
Life Sciences

Research Areas
Mathematische Statistik

Last updated on 2020-05-06 at 00:05