Venue Classification of Research Papers in Scholarly Digital Libraries

被引:1
作者
Caragea, Cornelia [1 ]
Florescu, Corina [2 ]
机构
[1] Kansas State Univ, Comp Sci, Manhattan, KS 66502 USA
[2] Univ North Texas, Comp Sci & Engn, Denton, TX 76207 USA
来源
DIGITAL LIBRARIES FOR OPEN KNOWLEDGE, TPDL 2018 | 2018年 / 11057卷
基金
美国国家科学基金会;
关键词
Text classification; Digital libraries; Venue classification;
D O I
10.1007/978-3-030-00066-0_11
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Open-access scholarly digital libraries crawl periodically a list of URLs in order to obtain appropriate collections of freely-available research papers. The metadata of the crawled papers, e.g., title, authors, and references, are automatically extracted before the papers are indexed in a digital library. The venue of publication is another important aspect about a scientific paper, which reflects its authoritativeness. However, the venue is not always readily available for a paper. Instead, it needs to be extracted from the references lists of other papers that cite the target paper. We explore a supervised learning approach to automatically classifying the venue of a research paper using information solely available from the content of the paper and show experimentally on a dataset of approximately 44,000 papers that this approach outperforms several baselines and prior work.
引用
收藏
页码:129 / 136
页数:8
相关论文
共 16 条
  • [1] [Anonymous], 2003, ICML
  • [2] [Anonymous], 2008, COLING 2008 P WORKSH, DOI DOI 10.3115/1613172.1613178
  • [3] Caragea Cornelia, 2014, Advances in Information Retrieval. 36th European Conference on IR Research, ECIR 2014. Proceedings: LNCS 8416, P311, DOI 10.1007/978-3-319-06028-6_26
  • [4] Caragea C., 2014, P 2014 C EMP METH NA, P1435, DOI [DOI 10.3115/V1/D14-1150, 10.3115/v1/D14-1150, 10.3115/v1/d14-1150]
  • [5] Caragea Cornelia., 2015, P 2015 C EMP METH AN, P2357
  • [6] Councill IG, 2008, SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, P661
  • [7] Das Gollapalli S, 2014, AAAI CONF ARTIF INTE, P1629
  • [8] PositionRank: An Unsupervised Approach to Keyphrase Extraction from Scholarly Documents
    Florescu, Corina
    Caragea, Cornelia
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1105 - 1115
  • [9] Giles C. L., 1998, Digital 98 Libraries. Third ACM Conference on Digital Libraries, P89, DOI 10.1145/276675.276685
  • [10] HaCohen-Kerner Yaakov, 2013, Advanced Data Mining and Applications. 9th International Conference, ADMA 2013. Proceedings: LNCS 8346, P529, DOI 10.1007/978-3-642-53914-5_45