Developing computer vision and machine learning strategies to unlock government-created records

被引:0
|
作者
Jansen, Greg [1 ]
Marciano, Richard [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
关键词
Computer vision; Machine learning; Artificial intelligence; 1950 US Census records; Sacramento; WWII Japanese American incarceration;
D O I
10.1007/s00146-025-02231-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper outlines the development of a proof-of-concept workflow using machine learning and computer vision techniques to unlock the data within digitized handwritten US Census forms from the 1950s. The 1950s US Census includes over 6.5 million page images and was only recently made available to the public on April 1, 2022, following a 72-year access restriction period. Our project uses computational treatments to assist researchers in their efforts to recover and preserve the history of the erased Sacramento Japantown. Sacramento once housed the fourth largest Japantown in the United States before experiencing WWII Japanese American Incarceration and the 1950s US Government program of urban renewal. The goal is to augment a researcher's work in selecting a subset of Census pages for further transcription and analysis. We demonstrate a workflow for extracting demographic information using computer vision for image segmentation, and machine learning for handwritten character recognition. The workflow consists of a computational filtering process for Census records and a user interface for page review. These computational techniques are suitable for other cities, states, and communities, and demonstrate new strategies to unlock vital demographic information. The approach highlights the potential benefits of computational techniques for the analysis of form-based historical records of the twentieth century that can have an impact on social justice.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Computer vision and machine learning for the detection and classification of pavement cracks
    Tello-Cifuentes, Lizette
    Marulanda, Johannio
    Thomson, Peter
    INGENIERIA Y COMPETITIVIDAD, 2023, 25 (02):
  • [22] Blood type classification using computer vision and machine learning
    Ferraz, Ana
    Brito, Jose Henrique
    Carvalho, Vitor
    Machado, Jose
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (08): : 2029 - 2040
  • [23] Computer Vision and Machine Learning for Tuna and Salmon Meat Classification
    Medeiros, Erika Carlos
    Almeida, Leandro Maciel
    Teixeira Filho, Jose Gilson de Almeida
    INFORMATICS-BASEL, 2021, 8 (04):
  • [24] Overview: Computer Vision and Machine Learning for Microstructural Characterization and Analysis
    Elizabeth A. Holm
    Ryan Cohn
    Nan Gao
    Andrew R. Kitahara
    Thomas P. Matson
    Bo Lei
    Srujana Rao Yarasi
    Metallurgical and Materials Transactions A, 2020, 51 : 5985 - 5999
  • [25] Recognition of Explosive Objects Using Computer Vision and Machine Learning
    Mordyk, Oleksandr
    2022 IEEE OPEN CONFERENCE OF ELECTRICAL, ELECTRONIC AND INFORMATION SCIENCES (ESTREAM), 2022,
  • [27] Computer vision by unsupervised machine learning in seed drying process
    Pinheiro, Romario de Mesquita
    Gadotti, Gizele Ingrid
    Bernardy, Ruan
    Tim, Rafael Rico
    Pinto, Karine Von Ahn
    Buck, Graciela
    CIENCIA E AGROTECNOLOGIA, 2023, 47
  • [28] Discriminating rapeseed varieties using computer vision and machine learning
    Kurtulmus, F.
    Unal, H.
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (04) : 1880 - 1891
  • [29] Stress Monitoring with Computer Vision and Machine Learning for Software Employees
    Manikandan, N. K.
    Manivannan, D.
    Kavitha, M.
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE COMPUTING AND SMART SYSTEMS, ICSCSS 2024, 2024, : 1016 - 1021
  • [30] Ergonomic risk assessment based on computer vision and machine learning
    Massiris Fernandez, Manlio
    Alvaro Fernandez, J.
    Bajo, Juan M.
    Delrieux, Claudio A.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2020, 149