Politics of data reuse in machine learning systems: Theorizing reuse entanglements

被引:25
作者
Thylstrup, Nanna Bonde [1 ]
Hansen, Kristian Bondo [1 ]
Flyverbom, Mikkel [1 ]
Amoore, Louise [2 ]
机构
[1] Copenhagen Business Sch, Dept Management Soc & Commun, Frederiksberg, Denmark
[2] Univ Durham, Dept Geog, Durham, England
关键词
Data reuse; machine learning; ethics; entanglements; datasets; algorithms; KNOWLEDGE; MATTER;
D O I
10.1177/20539517221139785
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Policy discussions and corporate strategies on machine learning are increasingly championing data reuse as a key element in digital transformations. These aspirations are often coupled with a focus on responsibility, ethics and transparency, as well as emergent forms of regulation that seek to set demands for corporate conduct and the protection of civic rights. And the Protective measures include methods of traceability and assessments of 'good' and 'bad' datasets and algorithms that are considered to be traceable, stable and contained. However, these ways of thinking about both technology and ethics obscure a fundamental issue, namely that machine learning systems entangle data, algorithms and more-than-human environments in ways that challenge a well-defined separation. This article investigates the fundamental fallacy of most data reuse strategies as well as their regulation and mitigation strategies that data can somehow be followed, contained and controlled in machine learning processes. Instead, the article argues that we need to understand the reuse of data as an inherently entangled phenomenon. To examine this tension between the discursive regimes and the realities of data reuse, we advance the notion of reuse entanglements as an analytical lens. The main contribution of the article is the conceptualization of reuse that places entanglements at its core and the articulation of its relevance using empirical illustrations. This is important, we argue, for our understanding of the nature of data and algorithms, for the practical uses of data and algorithms and our attitudes regarding ethics, responsibility and regulation.
引用
收藏
页数:10
相关论文
共 76 条
[1]   The dark side of data ecosystems: A longitudinal study of the DAMD project [J].
Aaen, Jon ;
Nielsen, Jeppe Agger ;
Carugati, Andrea .
EUROPEAN JOURNAL OF INFORMATION SYSTEMS, 2022, 31 (03) :288-312
[2]   Chroma key dreams: Algorithmic visibility, fleshy images and scenes of recognition [J].
Agostinho, Daniela .
PHILOSOPHY OF PHOTOGRAPHY, 2018, 9 (02) :131-155
[3]  
Ahmed S, 2019, WHAT'S THE USE?, P1
[4]  
Aitken R, 2017, COMPET CHANG, V21, P274, DOI 10.1177/1024529417712830
[5]  
Alaimo C., 2020, Handbook of Digital Innovation, P162
[6]   Managing by Data: Algorithmic Categories and Organizing [J].
Alaimo, Cristina ;
Kallinikos, Jannis .
ORGANIZATION STUDIES, 2021, 42 (09) :1385-1407
[7]  
Alden W., 2018, BUZZFEED NEWS
[8]  
Amoore L, 2020, CLOUD ETHICS, P1
[9]   Machine learning political orders [J].
Amoore, Louise .
REVIEW OF INTERNATIONAL STUDIES, 2023, 49 (01) :20-36
[10]  
Amoore Louise., 2013, The Politics of Possibility: Risk and Security Beyond Probability