Evaluating a programming topic using GitHub data: what we can learn about machine learning

被引:1
作者
Dello Vicario, Paolo [1 ]
Tortolini, Valentina [1 ]
机构
[1] Univ Tuscia, Dipartimento Econ Ingn Soc & Impresa, Viterbo, Italy
关键词
Web mining; Web search and information extraction; Applications of web mining and searching; GitHub; Machine learning; Network analysis; Mining software repositories; Software engineering; Open source;
D O I
10.1108/IJWIS-11-2020-0072
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose The purpose of this paper is to define a methodology to analyze links between programming topics and libraries starting from GitHub data. Design/methodology/approach This paper developed an analysis over machine learning repositories on GitHub, finding communities of repositories and studying the anatomy of collaboration around a popular topic such as machine learning. Findings This analysis indicates the significant importance of programming languages and technologies such as Python and Jupyter Notebook. It also shows the rise of deep learning and of specific libraries such as Tensorflow from Google. Originality/value There exists no survey or analysis based on how developers influence each other for specific topics. Other researchers focused their analysis on the collaborative structure and social impact instead of topic impact. Using this methodology to analyze programming topics is important not just for machine learning but also for other topics.
引用
收藏
页码:54 / 64
页数:11
相关论文
共 50 条
  • [41] You are what you click: using machine learning to model trace data for psychometric measurement
    Landers, Richard. N.
    Auer, Elena. M.
    Mersy, Gabriel
    Marin, Sebastian
    Blaik, Jason
    INTERNATIONAL JOURNAL OF TESTING, 2022, 22 (3-4) : 243 - 263
  • [42] Can machine learning make naturalism about health truly naturalistic? A reflection on a data-driven concept of health
    Guersenzvaig, Ariel
    ETHICS AND INFORMATION TECHNOLOGY, 2024, 26 (01)
  • [43] Evaluating Foehn Occurrence in a Changing Climate Based on Reanalysis and Climate Model Data Using Machine Learning
    Mony, Christoph
    Jansing, Lukas
    Sprenger, Michael
    WEATHER AND FORECASTING, 2021, 36 (06) : 2039 - 2055
  • [44] A Physics-Informed, Machine Learning Emulator of a 2D Surface Water Model: What Temporal Networks and Simulation-Based Inference Can Help Us Learn about Hydrologic Processes
    Maxwell, Reed M.
    Condon, Laura E.
    Melchior, Peter
    WATER, 2021, 13 (24)
  • [45] Comparisons of Machine Learning Algorithms for Driving Behavior Recognition Using In-Vehicle CAN Bus Data
    Chen, Wen-Hui
    Lin, Yu-Chen
    Chen, Wei-Hao
    2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 268 - 273
  • [46] Evaluating the sales potential of new products using machine learning techniques and data collected from mobile applications
    Sleiman, Rita
    Nguyen, Quoc-Thong
    Lacaze, Sandra
    Tran, Kim-Phuc
    Thomassey, Sebastien
    INTERNATIONAL JOURNAL OF CLOTHING SCIENCE AND TECHNOLOGY, 2024, 36 (03) : 454 - 473
  • [47] What We Are Missing: Using Machine Learning Models to Predict Vitamin C Deficiency in Patients with Metabolic and Bariatric Surgery
    Julie M. Parrott
    Austen J. Parrott
    Armaun D. Rouhi
    J. Scott Parrott
    Kristoffel R. Dumon
    Obesity Surgery, 2023, 33 : 1710 - 1719
  • [48] Evaluating soil moisture content under maize coverage using UAV multimodal data by machine learning algorithms
    Zhang, Yu
    Han, Wenting
    Zhang, Huihui
    Niu, Xiaotao
    Shao, Guomin
    JOURNAL OF HYDROLOGY, 2023, 617
  • [49] What We Are Missing: Using Machine Learning Models to Predict Vitamin C Deficiency in Patients with Metabolic and Bariatric Surgery
    Parrott, Julie M. M.
    Parrott, Austen J. J.
    Rouhi, Armaun D. D.
    Parrott, J. Scott
    Dumon, Kristoffel R. R.
    OBESITY SURGERY, 2023, 33 (06) : 1710 - 1719
  • [50] Accuracy or novelty: what can we gain from target-specific machine-learning-based scoring functions in virtual screening?
    Shen, Chao
    Weng, Gaoqi
    Zhang, Xujun
    Leung, Elaine Lai-Han
    Yao, Xiaojun
    Pang, Jinping
    Chai, Xin
    Li, Dan
    Wang, Ercheng
    Cao, Dongsheng
    Hou, Tingjun
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)