Natural Sciences Meet Social Sciences: Census Data Analytics for Detecting Home Language Shifts

被引:11
作者
Choy, Christian M. [1 ]
Co, M. Kiefer [1 ]
Fogel, Matthew J. [1 ]
Garrioch, Clarke D. [1 ]
Leung, Carson K. [1 ]
Martchenko, Ekaterina [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
来源
PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021) | 2021年
基金
加拿大自然科学与工程研究理事会;
关键词
information management; data science; data analytics; data mining; census data; language cohorts; allophones; mother tongue; language persistence; BIG DATA;
D O I
10.1109/IMCOM51814.2021.9377412
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As we are living in a global environment, it is not unusual to have more than one languages or dialects used in a country. Examples include Canada in the Americas, Singapore in Asia, and Switzerland in Europe. With the initiatives of globalization, many people immigrate or live in a country other than their birthplace. As a result, different people in the same country may have different home language (i.e., first language). For instance, as a nation composed of a highly diverse language population, Canada provides a unique opportunity to study the factors causing certain languages (or families of language) to be lost over subsequent generations among allophones (i.e., people whose mother tongue is neither English or French). In this paper, we focus on census data analytics. Specifically, we analyze census microdata by exploring machine learning and data mining techniques-such as decision tree induction, random forest, and categorical naive Bayes-to study the influence of various social and economic factors on the probability that allophones adopt official languages as their language spoken at home. This study is a showcase where natural sciences and engineering (NSE) meet social sciences, in which NSE solutions (e.g., census data analytics) are applicable for the study of social science related phenomena (e.g., successful detection of shifts in home languages).
引用
收藏
页数:8
相关论文
共 33 条
[11]   Finding Popular Friends in Social Networks [J].
Jiang, Fan ;
Leung, Carson Kai-Sang ;
Tanbeer, Syed K. .
SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, :501-508
[12]  
Joo Y, IMCOM 2020, P27
[13]  
Klosgen W., 2003, Intelligent Data Analysis, V7, P521
[14]   Emerging trends, issues and challenges in Internet of Things, Big Data and cloud computing [J].
Kobusinska, Anna ;
Leung, Carson ;
Hsu, Ching-Hsien ;
Raghavendra, S. ;
Chang, Victor .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 :416-419
[15]   The effect of native-language retention and insecurity on Asian Indian fertility in the United States [J].
Kohli, V .
JOURNAL OF SOCIAL PSYCHOLOGY, 1998, 138 (03) :358-367
[16]  
Leung Carson K., 2021, Big Data Analyses, Services, and Smart Data. Advances in Intelligent Systems and Computing (AISC 899), P28, DOI 10.1007/978-981-15-8731-3_3
[17]  
Leung C.K., IEEE SOCIALCOM 2010, P419
[18]  
Leung C.K, 2014, Frequent Pattern Mining, P417
[19]  
Leung CK, 2018, ENCYCLOPEDIA OF INFORMATION SCIENCE AND TECHNOLOGY, 4TH EDITION, P338, DOI 10.4018/978-1-5225-2255-3.ch030
[20]   A Machine Learning Approach for Stock Price Prediction [J].
Leung, Carson Kai-Sang ;
MacKinnon, Richard Kyle ;
Wang, Yang .
PROCEEDINGS OF THE 18TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM (IDEAS14), 2014, :274-277