Developing computer vision and machine learning strategies to unlock government-created records

被引：0

作者：

Jansen, Greg ^{[1
]}

Marciano, Richard ^{[1
]}

机构：

[1] Univ Maryland, College Pk, MD 20742 USA

来源：

AI & SOCIETY | 2025年

关键词：

Computer vision; Machine learning; Artificial intelligence; 1950 US Census records; Sacramento; WWII Japanese American incarceration;

D O I：

10.1007/s00146-025-02231-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper outlines the development of a proof-of-concept workflow using machine learning and computer vision techniques to unlock the data within digitized handwritten US Census forms from the 1950s. The 1950s US Census includes over 6.5 million page images and was only recently made available to the public on April 1, 2022, following a 72-year access restriction period. Our project uses computational treatments to assist researchers in their efforts to recover and preserve the history of the erased Sacramento Japantown. Sacramento once housed the fourth largest Japantown in the United States before experiencing WWII Japanese American Incarceration and the 1950s US Government program of urban renewal. The goal is to augment a researcher's work in selecting a subset of Census pages for further transcription and analysis. We demonstrate a workflow for extracting demographic information using computer vision for image segmentation, and machine learning for handwritten character recognition. The workflow consists of a computational filtering process for Census records and a user interface for page review. These computational techniques are suitable for other cities, states, and communities, and demonstrate new strategies to unlock vital demographic information. The approach highlights the potential benefits of computational techniques for the analysis of form-based historical records of the twentieth century that can have an impact on social justice.

引用

页数：17

共 50 条

[21] Computer vision and machine learning for the detection and classification of pavement cracks
Tello-Cifuentes, Lizette
Marulanda, Johannio
Thomson, Peter
INGENIERIA Y COMPETITIVIDAD, 2023, 25 (02):
[22] Blood type classification using computer vision and machine learning
Ferraz, Ana
Brito, Jose Henrique
Carvalho, Vitor
Machado, Jose
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (08): : 2029 - 2040
[23] Computer Vision and Machine Learning for Tuna and Salmon Meat Classification
Medeiros, Erika Carlos
Almeida, Leandro Maciel
Teixeira Filho, Jose Gilson de Almeida
INFORMATICS-BASEL, 2021, 8 (04):
[24] Overview: Computer Vision and Machine Learning for Microstructural Characterization and Analysis
Elizabeth A. Holm
Ryan Cohn
Nan Gao
Andrew R. Kitahara
Thomas P. Matson
Bo Lei
Srujana Rao Yarasi
Metallurgical and Materials Transactions A, 2020, 51 : 5985 - 5999
[25] Recognition of Explosive Objects Using Computer Vision and Machine Learning
Mordyk, Oleksandr
2022 IEEE OPEN CONFERENCE OF ELECTRICAL, ELECTRONIC AND INFORMATION SCIENCES (ESTREAM), 2022,
[26] DARWIN: A Framework for Machine Learning and Computer Vision Research and Development
Gould, Stephen
JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 3533 - 3537
[27] Computer vision by unsupervised machine learning in seed drying process
Pinheiro, Romario de Mesquita
Gadotti, Gizele Ingrid
Bernardy, Ruan
Tim, Rafael Rico
Pinto, Karine Von Ahn
Buck, Graciela
CIENCIA E AGROTECNOLOGIA, 2023, 47
[28] Discriminating rapeseed varieties using computer vision and machine learning
Kurtulmus, F.
Unal, H.
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (04) : 1880 - 1891
[29] Stress Monitoring with Computer Vision and Machine Learning for Software Employees
Manikandan, N. K.
Manivannan, D.
Kavitha, M.
2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1016 - 1021
[30] Ergonomic risk assessment based on computer vision and machine learning
Massiris Fernandez, Manlio
Alvaro Fernandez, J.
Bajo, Juan M.
Delrieux, Claudio A.
COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149

← 1 2 3 4 5 →