Developing computer vision and machine learning strategies to unlock government-created records

被引：0

作者：

Jansen, Greg ^{[1
]}

Marciano, Richard ^{[1
]}

机构：

[1] Univ Maryland, College Pk, MD 20742 USA

来源：

AI & SOCIETY | 2025年

关键词：

Computer vision; Machine learning; Artificial intelligence; 1950 US Census records; Sacramento; WWII Japanese American incarceration;

D O I：

10.1007/s00146-025-02231-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper outlines the development of a proof-of-concept workflow using machine learning and computer vision techniques to unlock the data within digitized handwritten US Census forms from the 1950s. The 1950s US Census includes over 6.5 million page images and was only recently made available to the public on April 1, 2022, following a 72-year access restriction period. Our project uses computational treatments to assist researchers in their efforts to recover and preserve the history of the erased Sacramento Japantown. Sacramento once housed the fourth largest Japantown in the United States before experiencing WWII Japanese American Incarceration and the 1950s US Government program of urban renewal. The goal is to augment a researcher's work in selecting a subset of Census pages for further transcription and analysis. We demonstrate a workflow for extracting demographic information using computer vision for image segmentation, and machine learning for handwritten character recognition. The workflow consists of a computational filtering process for Census records and a user interface for page review. These computational techniques are suitable for other cities, states, and communities, and demonstrate new strategies to unlock vital demographic information. The approach highlights the potential benefits of computational techniques for the analysis of form-based historical records of the twentieth century that can have an impact on social justice.

引用

页数：17

共 50 条

[1] Sifting US Census Records with Computer Vision and Machine Learning
Jansen, Gregory N.
Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024, 2024, : 2431 - 2439
[2] Machine Learning in Computer Vision
Khan, Asharul Islam
Al-Habsi, Salim
INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1444 - 1451
[3] Machine learning in computer vision
Esposito, F
Malerba, D
APPLIED ARTIFICIAL INTELLIGENCE, 2001, 15 (08) : 693 - 705
[4] Efficient object tracking through machine learning and optimization strategies in computer vision
Thakur, Uma
Thakur, Reena
Somkuwar, Sahil
Shendare, Abhay
Ingole, Rita
Verma, Achal
Tidke, Swejal
JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2024, 27 (02) : 327 - 336
[5] Machine Learning in Computer Vision: A Review
Khan, Abdullah Ayub
Laghari, Asif Ali
Awan, Shafique Ahmed
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2021, 8 (32): : 1 - 11
[6] Computer vision and machine learning for phenotyping
Steibel, J.
JOURNAL OF DAIRY SCIENCE, 2022, 105 : 98 - 99
[7] International conference on computer vision and machine learning
Rambabu, Sri Kommareddy
Journal of Physics: Conference Series, 2019, 1228 (01)
[8] Computer vision and machine learning to quantify microstructure
Holm, Elizabeth A.
Cohn, Ryan
Gao, Nan
Kitahara, Andrew R.
Lei, Bo
Yarasi, Srujana Rao
Matson, Thomas P.
Advanced Materials and Processes, 2021, 179 (02): : 13 - 18
[9] ADVANCED MACHINE LEARNING TECHNIQUES FOR COMPUTER VISION
MOSCATELLI, S
KODRATOFF, Y
LECTURE NOTES IN ARTIFICIAL INTELLIGENCE, 1992, 617 : 161 - 197
[10] Computer vision and machine learning in science fiction
Murphy, Robin R.
SCIENCE ROBOTICS, 2019, 4 (30)

← 1 2 3 4 5 →