Large-Scale Uncertainty Management Systems: Learning and Exploiting Your Data

被引:0
作者
Babu, Shivnath [1 ]
Guha, Sudipto
Munagala, Kamesh [1 ]
机构
[1] Duke Univ, Dept Comp Sci, Durham, NC 27708 USA
来源
ACM SIGMOD/PODS 2009 CONFERENCE | 2009年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The database community has made rapid strides in capturing, representing, and querying uncertain data. Probabilistic databases capture the inherent uncertainty in derived tuples as probability estimates. Data acquisition and stream systems can produce succinct summaries of very large and time-varying datasets. This tutorial addresses the natural next step in harnessing uncertain data: How can we efficiently and quantifiably determine what, how, and how much to learn in order to make good decisions based on the imprecise information available. The material in this tutorial is drawn from a range of fields including database systems, control and information theory, operations research, convex optimization, and statistical learning. The focus of the tutorial is on the natural constraints that are imposed in a database context and the demands of imprecise information from an optimization point of view. We look both into the past as well as into the future; to discuss general tools and techniques that can serve as a guide to database researchers and practitioners, and to enumerate the challenges that lie ahead.
引用
收藏
页码:995 / 998
页数:4
相关论文
共 50 条
  • [41] Scalable management - Technologies for management of large-scale, distributed systems
    Adamst, R
    Brettt, P
    Lyer, S
    Milojicic, D
    Rafaeli, S
    Talwar, V
    [J]. ICAC 2005: SECOND INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING, PROCEEDINGS, 2005, : 159 - 170
  • [42] Electronic document management systems and distributed large-scale systems
    Orlov, V. L.
    Kurako, E. A.
    [J]. 2017 TENTH INTERNATIONAL CONFERENCE MANAGEMENT OF LARGE-SCALE SYSTEM DEVELOPMENT (MLSD), 2017,
  • [43] A Case Study of Data Management Challenges Presented in Large-Scale Machine Learning Workflows
    Lee, Claire Songhyun
    Hewes, V.
    Cerati, Giuseppe
    Kowalkowski, Jim
    Aurisano, Adam
    Agrawal, Ankit
    Choudhary, Alok
    Liao, Wei-keng
    [J]. 2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID, 2023, : 71 - 81
  • [44] SubCollaboration: large-scale group management in collaborative learning
    Pardo, Abelardo
    Delgado Kloos, Carlos
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2011, 41 (04) : 449 - 465
  • [45] Deep learning for the large-scale cancer data analysis
    Tsuji, Shingo
    Aburatani, Hiroyuki
    [J]. CANCER RESEARCH, 2015, 75 (22)
  • [46] Large-Scale Embedding Learning in Heterogeneous Event Data
    Gui, Huan
    Liu, Jialu
    Tao, Fangbo
    Jiang, Meng
    Norick, Brandon
    Han, Jiawei
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 907 - 912
  • [47] Effective interpretable learning for large-scale categorical data
    Zhang, Yishuo
    Zaidi, Nayyar
    Zhou, Jiahui
    Wang, Tao
    Li, Gang
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 2223 - 2251
  • [48] Uncertainty-Aware Multiple Instance Learning from Large-Scale Long Time Series Data
    Zhu, Yuansheng
    Shi, Weishi
    Pandey, Deep Shankar
    Liu, Yang
    Que, Xiaofan
    Krutz, Daniel E.
    Yu, Qi
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1772 - 1778
  • [49] Large-scale transfer learning for data-driven modelling of hot water systems
    Kazmi, Hussain
    Suykens, Johan
    Driesen, Johan
    [J]. PROCEEDINGS OF BUILDING SIMULATION 2019: 16TH CONFERENCE OF IBPSA, 2020, : 2611 - 2618
  • [50] Mesh data management in large-scale scientific computing
    Chen, Hong
    Zheng, Winmin
    [J]. PROCEEDINGS OF THE THIRD CHINAGRID ANNUAL CONFERENCE, 2008, : 144 - 152