Proactive Institutional Repository Collection Development Techniques: Archiving Gold Open Access Articles and Metadata Retrieved with Web Scraping

被引:1
|
作者
Clark, Brian [1 ]
机构
[1] Univ Lib, Univ Alabama, Syst & Tech Proc Librarian, Tuscaloosa, AL 35487 USA
基金
美国国家卫生研究院;
关键词
Collection development; institutional repositories; open access; !text type='Python']Python[!/text; scholarly communication; web scraping; MOTIVATIONS;
D O I
10.1080/01930826.2023.2240190
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Many institutions face low deposit rates with their institutional repositories despite investing substantial resources in implementing and supporting these systems. Deposit rates are higher in IRs that offer mediated deposits; however, this can be a time and labor intensive process. This article describes a method for copying open access articles and corresponding descriptive metadata from open repositories for archiving in an institutional repository using Beautiful Soup and Selenium as web scraping tools. This method quickly added hundreds of articles to an IR without relying on faculty participation or consulting publisher policies, increasing repository downloads and usage.
引用
收藏
页码:743 / 765
页数:23
相关论文
empty
未找到相关数据