Finite population inference is a central goal in survey sampling. Probability sampling is the main statistical approach to finite population inference. Challenges arise due to high cost and increasing non-response rates. Data integration provides a timely solution by leveraging multiple data sources to provide more robust and efficient inference than using any single data source alone. The technique for data integration varies depending on types of samples and available information to be combined. This article provides a systematic review of data integration techniques for combining probability samples, probability and non-probability samples, and probability and big data samples. We discuss a wide range of integration methods such as generalized least squares, calibration weighting, inverse probability weighting, mass imputation, and doubly robust methods. Finally, we highlight important questions for future research.
机构:
Westat Corp, 1600 Res Blvd, Rockville, MD 20850 USAWestat Corp, 1600 Res Blvd, Rockville, MD 20850 USA
Lohr, Sharon L.
Raghunathan, Trivellore E.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Michigan, Sch Publ Hlth, Survey Res Ctr, Inst Social Res, Ann Arbor, MI 48106 USA
Univ Michigan, Sch Publ Hlth, Biostat, Ann Arbor, MI 48106 USAWestat Corp, 1600 Res Blvd, Rockville, MD 20850 USA
机构:
Univ Michigan, Ann Arbor, MI USA
Univ Maryland, College Pk, MD USA
Univ Michigan, 4620 N Pk Ave,Apt 1406W, Chevy Chase, MD 20815 USA
Univ Maryland, 4620 NPark Ave,Apt 1406W, Chevy Chase, MD 20815 USAUniv Michigan, Ann Arbor, MI USA