Developing computer vision and machine learning strategies to unlock government-created records

被引:0
|
作者
Jansen, Greg [1 ]
Marciano, Richard [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
关键词
Computer vision; Machine learning; Artificial intelligence; 1950 US Census records; Sacramento; WWII Japanese American incarceration;
D O I
10.1007/s00146-025-02231-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper outlines the development of a proof-of-concept workflow using machine learning and computer vision techniques to unlock the data within digitized handwritten US Census forms from the 1950s. The 1950s US Census includes over 6.5 million page images and was only recently made available to the public on April 1, 2022, following a 72-year access restriction period. Our project uses computational treatments to assist researchers in their efforts to recover and preserve the history of the erased Sacramento Japantown. Sacramento once housed the fourth largest Japantown in the United States before experiencing WWII Japanese American Incarceration and the 1950s US Government program of urban renewal. The goal is to augment a researcher's work in selecting a subset of Census pages for further transcription and analysis. We demonstrate a workflow for extracting demographic information using computer vision for image segmentation, and machine learning for handwritten character recognition. The workflow consists of a computational filtering process for Census records and a user interface for page review. These computational techniques are suitable for other cities, states, and communities, and demonstrate new strategies to unlock vital demographic information. The approach highlights the potential benefits of computational techniques for the analysis of form-based historical records of the twentieth century that can have an impact on social justice.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Sifting US Census Records with Computer Vision and Machine Learning
    Jansen, Gregory N.
    Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024, 2024, : 2431 - 2439
  • [2] Machine Learning in Computer Vision
    Khan, Asharul Islam
    Al-Habsi, Salim
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1444 - 1451
  • [3] Machine learning in computer vision
    Esposito, F
    Malerba, D
    APPLIED ARTIFICIAL INTELLIGENCE, 2001, 15 (08) : 693 - 705
  • [4] Efficient object tracking through machine learning and optimization strategies in computer vision
    Thakur, Uma
    Thakur, Reena
    Somkuwar, Sahil
    Shendare, Abhay
    Ingole, Rita
    Verma, Achal
    Tidke, Swejal
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2024, 27 (02) : 327 - 336
  • [5] Machine Learning in Computer Vision: A Review
    Khan, Abdullah Ayub
    Laghari, Asif Ali
    Awan, Shafique Ahmed
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2021, 8 (32): : 1 - 11
  • [6] Computer vision and machine learning for phenotyping
    Steibel, J.
    JOURNAL OF DAIRY SCIENCE, 2022, 105 : 98 - 99
  • [7] International conference on computer vision and machine learning
    Rambabu, Sri Kommareddy
    Journal of Physics: Conference Series, 2019, 1228 (01)
  • [8] Computer vision and machine learning to quantify microstructure
    Holm, Elizabeth A.
    Cohn, Ryan
    Gao, Nan
    Kitahara, Andrew R.
    Lei, Bo
    Yarasi, Srujana Rao
    Matson, Thomas P.
    Advanced Materials and Processes, 2021, 179 (02): : 13 - 18
  • [9] ADVANCED MACHINE LEARNING TECHNIQUES FOR COMPUTER VISION
    MOSCATELLI, S
    KODRATOFF, Y
    LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1992, 617 : 161 - 197
  • [10] Computer vision and machine learning in science fiction
    Murphy, Robin R.
    SCIENCE ROBOTICS, 2019, 4 (30)