Print page

Two-phase methodology for the combined analysis of secondary data sources

Description

Administrative data of statutory health insurances are a valuable data source for pharmacoepidemiological studies and health services research. The validity of studies based on these data might be affected by the lack of information on some potentially important variables such as Body Mass Index (BMI) or smoking behaviour. If additional or more precise information can be obtained from other data sources for a subsample of persons, two-phase methodology can be used to analyze the complete information in the subsample (phase 2) in combination with the partial information available for all persons (phase 1). The estimation of unbiased and efficient risks relies on the method for inclusion of phase 1 information.
Aim of this project is to investigate the applicability of two-phase methods for the combined analysis of secondary data and to develop recommendations for the planning and conduct of such studies. Furthermore, software will be provided which will simplify the application of two-phase methodology.
A study using insurance claims data and additional data from the disease management program for diabetes mellitus will serve as an example. Methodological challenges arise from the multiplicity of the phase 1 data and from the selectivity of persons included in the disease management program.

Funding period

Begin:   May 2012
End:   June 2015

Contact

Sigrid Behr

Sponsor

  • Deutsche Forschungsgemeinschaft (DFG)