Common Data Model for Healthcare data

被引:0
作者
Khan, Umair M. [1 ]
Kothari, Huzaifa [1 ]
Kuchekar, Aditya [1 ]
Koshy, Reeta [1 ]
机构
[1] Bharatiya Vidya Bhavans Sardar Patel Inst Technol, Dept Comp Engn, Bombay, Maharashtra, India
来源
2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT) | 2018年
关键词
Common Data Model; ETL Extract Transform Load; Data Warehouse; Columnar database; Mapping file;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
There is a huge volume of data made openly available to the masses on Government data sites such as the www.data.gov.in. However, it is very difficult to make sense between data from two different data sets or different domains. As a result a lot of time is spent in just collecting and linking this data from various sources before it can be put to use. A potential solution to this problem is using a Common Data Model. A Common Data Model is a data silo or warehouse which can accommodate any data from any source in one pre-designed format. In this paper we discuss an ETL process that allows users to remodel and store data, and a proposed schema for a Common Data Model which will house this transformed data under one roof. We also perform our own analysis on the proposed data model for a particular use-case; state-wise statistics of data related to pregnancy in India. Thus we prove that the common data model is able to integrate data from different sources and facilitate cross domain analysis and linking of data.
引用
收藏
页数:5
相关论文
共 11 条
  • [1] Column-oriented Database Systems
    Abadi, Daniel J.
    Boncz, Peter A.
    Harizopoulos, Stavros
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (02): : 1664 - 1665
  • [2] Towards a Semantic Extract-Transform-Load (ETL) framework for Big Data Integration
    Bansal, Srividya K.
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 521 - 528
  • [3] Data Wrangling: Making data useful again
    Ender, Florian
    Piringer, Harald
    [J]. IFAC PAPERSONLINE, 2015, 48 (01): : 111 - +
  • [4] Furche Tim, 2016, EDBT, V16, P473
  • [5] Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers
    Hripcsak, George
    Duke, Jon D.
    Shah, Nigam H.
    Reich, Christian G.
    Huser, Vojtech
    Schuemie, Martijn J.
    Suchard, Marc A.
    Park, Rae Woong
    Wong, Ian Chi Kei
    Rijnbeek, Peter R.
    van der Lei, Johan
    Pratt, Nicole
    Noren, G. Niklas
    Li, Yu-Chuan
    Stang, Paul E.
    Madigan, David
    Ryan, Patrick B.
    [J]. MEDINFO 2015: EHEALTH-ENABLED HEALTH, 2015, 216 : 574 - 578
  • [6] Research directions in data wrangling: Visualizations and transformations for usable and credible data
    Kandel, Sean
    Heer, Jeffrey
    Plaisant, Catherine
    Kennedy, Jessie
    van Ham, Frank
    Riche, Nathalie Henry
    Weaver, Chris
    Lee, Bongshin
    Brodbeck, Dominique
    Buono, Paolo
    [J]. INFORMATION VISUALIZATION, 2011, 10 (04) : 271 - 288
  • [7] Dynamic-ETL: a hybrid approach for health data extraction, transformation and loading
    Ong, Toan C.
    Kahn, Michael G.
    Kwan, Bethany M.
    Yamashita, Traci
    Brandt, Elias
    Hosokawa, Patrick
    Uhrich, Chris
    Schilling, Lisa M.
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2017, 17
  • [8] Research on Extract, Transform and Load(ETL) in Land and Resources Star Schema Data Warehouse
    Qin Hanlin
    Jin Xianzhen
    Zhang Xianrong
    [J]. 2012 FIFTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2012), VOL 1, 2012, : 120 - 123
  • [9] Shawver Matthew A., 2007, 2007 IEEE Aerospace Conference, P1, DOI 10.1109/AERO.2007.352920
  • [10] Stonebraker Mike., 2005, VLDB'05