Machine learning and the politics of synthetic data

被引:29
|
作者
Jacobsen, Benjamin N. [1 ]
机构
[1] Univ Durham, Dept Geog, South Rd, Durham DH1 3LE, England
基金
欧洲研究理事会;
关键词
Machine learning; data; algorithms; risk; ethics; variability;
D O I
10.1177/20539517221145372
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Machine-learning algorithms have become deeply embedded in contemporary society. As such, ample attention has been paid to the contents, biases, and underlying assumptions of the training datasets that many algorithmic models are trained on. Yet, what happens when algorithms are trained on data that are not real, but instead data that are 'synthetic', not referring to real persons, objects, or events? Increasingly, synthetic data are being incorporated into the training of machine-learning algorithms for use in various societal domains. There is currently little understanding, however, of the role played by and the ethicopolitical implications of synthetic training data for machine-learning algorithms. In this article, I explore the politics of synthetic data through two central aspects: first, synthetic data promise to emerge as a rich source of exposure to variability for the algorithm. Second, the paper explores how synthetic data promise to place algorithms beyond the realm of risk. I propose that an analysis of these two areas will help us better understand the ways in which machine-learning algorithms are envisioned in the light of synthetic data, but also how synthetic training data actively reconfigure the conditions of possibility for machine learning in contemporary society.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Machine learning-based blood pressure estimation using impedance cardiography data
    Bothe, T. L.
    Patzak, A.
    Opatz, O. S.
    Heinz, V.
    Pilz, N.
    ACTA PHYSIOLOGICA, 2025, 241 (02)
  • [42] An improvement of snow/cloud discrimination from machine learning using geostationary satellite data
    Jin, Donghyun
    Lee, Kyeong-Sang
    Choi, Sungwon
    Seong, Noh-Hun
    Jung, Daeseong
    Sim, Suyoung
    Woo, Jongho
    Jeon, Uujin
    Byeon, Yugyeong
    Han, Kyung-Soo
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2022, 15 (01) : 2355 - 2375
  • [43] A review of synthetic and augmented training data for machine learning in ultrasonic non-destructive evaluation
    Sebastian, Uhlig
    Ilkin, Alkhasli
    Frank, Schubert
    Constanze, Tschoepe
    Matthias, Wolff
    ULTRASONICS, 2023, 134
  • [44] Generating Synthetic Sensor Data to Facilitate Machine Learning Paradigm for Prediction of Building Fire Hazard
    Wai Cheong Tam
    Eugene Yujun Fu
    Richard Peacock
    Paul Reneke
    Jun Wang
    Jiajia Li
    Thomas Cleary
    Fire Technology, 2023, 59 : 3027 - 3048
  • [45] Machine Learning Methods for Disease Prediction with Claims Data
    Christensen, Tanner
    Frandsen, Abraham
    Glazier, Seth
    Humpherys, Jeffrey
    Kartchner, David
    2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, : 467 - 471
  • [46] Big Data Platform Configuration Using Machine Learning
    Yeh, Chao-Chun
    Lu, Han-Lin
    Zhou, Jiazheng
    Chang, Sheng-An
    Lin, Xuan-Yi
    Sun, Yi-Chiao
    Huang, Shih-Kun
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2020, 36 (03) : 469 - 493
  • [47] Big data and machine learning to tackle diabetes management
    Pina, Ana F.
    Meneses, Maria Joao
    Sousa-Lima, Ines
    Henriques, Roberto
    Raposo, Joao F.
    Macedo, Maria Paula
    EUROPEAN JOURNAL OF CLINICAL INVESTIGATION, 2023, 53 (01)
  • [48] Weighted Machine Learning for Spatial-Temporal Data
    Hashemi, Mahdi
    Karimi, Hassan A.
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 3066 - 3082
  • [49] Big data, machine learning and uncertainty in foresight studies
    Muraro, Vinicius
    Salles-Filho, Sergio
    FORESIGHT, 2024, 26 (03): : 436 - 452
  • [50] Machine learning with multimodal data for COVID-19
    Chen, Weijie
    Sa, Rui C.
    Bai, Yuntong
    Napel, Sandy
    Gevaert, Olivier
    Lauderdale, Diane S.
    Giger, Maryellen L.
    HELIYON, 2023, 9 (07)