TY - JOUR

T1 - Estimating net transition probabilities from cross-sectional data wit application to risk factors in chronic disease modeling

AU - van de Kassteele, J.

AU - Hoogenveen, R.T.

AU - Engelfriet, P.M.

AU - van Baal, P.H.

AU - Boshuizen, H.C.

PY - 2012

Y1 - 2012

N2 - A problem occurring in chronic disease modeling is the estimation of transition probabilities of moving from one state of a categorical risk factor to another. Transitions could be obtained from a cohort study, but often such data may not be available. However, under the assumption that transitions remain stable over time, age specific cross-sectional prevalence data could be used instead. Problems that then arise are parameter identifiability and the fact that age dependent cross-sectional data are often noisy or are given in age intervals. In this paper we propose a method to estimate so-called net annual transition probabilities from cross-sectional data, including their uncertainties. Net transitions only describe the net inflow or outflow into a certain risk factor state at a certain age. Our approach consists of two steps: first, smooth the data using multinomial P-splines, second, from these data estimate net transition probabilities. This second step can be formulated as a transportation problem, which is solved using the simplex algorithm from linear programming theory. A sensible specification of the cost matrix is crucial to get meaningful results. Uncertainties are assessed by parametric bootstrapping. We illustrate our method using data on body mass index. We conclude that this method provides a flexible way of estimating net transitions and that the use of net transitions has implications for model dynamics, for example when modeling interventions

AB - A problem occurring in chronic disease modeling is the estimation of transition probabilities of moving from one state of a categorical risk factor to another. Transitions could be obtained from a cohort study, but often such data may not be available. However, under the assumption that transitions remain stable over time, age specific cross-sectional prevalence data could be used instead. Problems that then arise are parameter identifiability and the fact that age dependent cross-sectional data are often noisy or are given in age intervals. In this paper we propose a method to estimate so-called net annual transition probabilities from cross-sectional data, including their uncertainties. Net transitions only describe the net inflow or outflow into a certain risk factor state at a certain age. Our approach consists of two steps: first, smooth the data using multinomial P-splines, second, from these data estimate net transition probabilities. This second step can be formulated as a transportation problem, which is solved using the simplex algorithm from linear programming theory. A sensible specification of the cost matrix is crucial to get meaningful results. Uncertainties are assessed by parametric bootstrapping. We illustrate our method using data on body mass index. We conclude that this method provides a flexible way of estimating net transitions and that the use of net transitions has implications for model dynamics, for example when modeling interventions

KW - health

U2 - 10.1002/sim.4423

DO - 10.1002/sim.4423

M3 - Article

SN - 0277-6715

VL - 31

SP - 533

EP - 543

JO - Statistics in Medicine

JF - Statistics in Medicine

IS - 6

ER -