Boosting Copulas - Multivariate Distrubution Regression in Digital Medicine

Traditional regression models often provide an overly simplistic view on complex associations and relationships to contemporary data problems in the area of biomedicine. In particular, capturing relevant associations between multiple clinical endpoints correctly is of high relevance to avoid model misspecifications, which can lead to biased results and even wrong or misleading conclusions and treatments. As such, methodological development of statistical methods tailored for such problems in biomedicine are of considerable interest. It is the aim of this project to develop novel conditional copula regression models for high-dimensional biomedical data structures by bringing together efficient statistical learning tools for high-dimensional data and established methods from economics for multivariate data structures that allow to capture complex dependence structures between variables. These methods will allow us to model the entire joint distribution of multiple endpoints simultaneously and to automatically determine the relevant influential covariates and risk factors via algorithms originally proposed in the area of statistical and machine learning. The resulting models can then be used both for the interpretation and analysis of complex association-structures as well as for prediction inference (simultaneous prediction intervals for multiple endpoints). Additional implementation in open software and its application in various studies (e.g., eating disorders or schizophrenia) highlight the potentials of this project’s methodological developments in the area of digital medicine.

Principal investigators

Klein, Nadja Prof. Dr. (Details) (Statistik and Data Science)

Participating external organisations

University of Bonn

Financer

DFG: Sachbeihilfe

Duration of project

Start date: 09/2020
End date: 12/2023

Research Areas

Life Sciences

Research Areas

Mathematische Statistik