NLP and Large-Scale Information Retrieval on Mathematical Texts

被引:1
|
作者
Dong, Yihe [1 ]
机构
[1] Wolfram Res, Champaign, IL 61820 USA
来源
MATHEMATICAL SOFTWARE - ICMS 2018 | 2018年 / 10931卷
关键词
NLP; Information retrieval;
D O I
10.1007/978-3-319-96418-8_19
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present a recommender system covering math and math physics papers from the arXiv, to assist researchers to quickly retrieve theorems and discover similar results from this vast corpus. The retrieval aims to discover not just syntactic, but also semantic similarity. We will discuss the challenges encountered and the experimental methodologies used.
引用
收藏
页码:156 / 164
页数:9
相关论文
共 50 条
  • [21] Self-adaptive approximate queries for large-scale information aggregation
    Brunner, Rene
    Freitag, Felix
    Navarro, Leandro
    Rana, Omer F.
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2012, 8 (03) : 225 - 247
  • [22] A cloud-based framework for large-scale traditional Chinese medical record retrieval
    Liu, Lijun
    Liu, Li
    Fu, Xiaodong
    Huang, Qingsong
    Zhang, Xianwen
    Zhang, Yin
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 : 21 - 33
  • [23] Efficient Large Scale NLP Feature Engineering with Apache Spark
    Esmaeilzadeh, Armin
    Heidari, Maryam
    Abdolazimi, Reyhaneh
    Hajibabaee, Parisa
    Malekzadeh, Masoud
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 274 - 280
  • [24] RELEVANCE JUDGMENTS EXCLUSIVE OF HUMAN ASSESSORS IN LARGE SCALE INFORMATION RETRIEVAL EVALUATION EXPERIMENTATION
    Rajagopal, Prabha
    Ravana, Devi
    Ismail, Maizatul Akmar
    MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2014, 27 (02) : 80 - 94
  • [25] A study on emotion based information retrieval system for Korean texts
    Kim, MG
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL I AND II, 1999, : 617 - 622
  • [26] Large-scale Heterogeneous Program Retrieval through Frequent Pattern Discovery and Feature Correlation Analysis
    Liu, Bo
    Wu, Liang
    Dong, Qiuxiang
    Zhou, Yuanchun
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 780 - +
  • [27] TwinBERT: Distilling Knowledge to Twin-Structured Compressed BERT Models for Large-Scale Retrieval
    Lu, Wenhao
    Jiao, Jian
    Zhang, Ruofei
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2645 - 2652
  • [28] Transformer-Encoder-Based Mathematical Information Retrieval
    Reusch, Anja
    Thiele, Maik
    Lehner, Wolfgang
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION (CLEF 2022), 2022, 13390 : 175 - 189
  • [29] Signal Phrase Extraction: A Gateway to Information Retrieval Improvement in Law Texts
    Van Der Veen, Michael
    Sidorova, Natalia
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 346 : 127 - 130
  • [30] Retrieval of Mathematical Information with Syntactic and Semantic Structure over Web
    Hussain, Sharaf
    Khoja, Shakeel
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2020, 36 (01) : 75 - 89