NLP and Large-Scale Information Retrieval on Mathematical Texts

被引:1
|
作者
Dong, Yihe [1 ]
机构
[1] Wolfram Res, Champaign, IL 61820 USA
来源
MATHEMATICAL SOFTWARE - ICMS 2018 | 2018年 / 10931卷
关键词
NLP; Information retrieval;
D O I
10.1007/978-3-319-96418-8_19
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present a recommender system covering math and math physics papers from the arXiv, to assist researchers to quickly retrieve theorems and discover similar results from this vast corpus. The retrieval aims to discover not just syntactic, but also semantic similarity. We will discuss the challenges encountered and the experimental methodologies used.
引用
收藏
页码:156 / 164
页数:9
相关论文
共 50 条
  • [41] MathUSE: Mathematical information retrieval system using universal sentence encoder model
    Dadure, Pankaj
    Pakray, Partha
    Bandyopadhyay, Sivaji
    JOURNAL OF INFORMATION SCIENCE, 2024, 50 (01) : 66 - 84
  • [42] Large-scale Bayesian logistic regression for text categorization
    Genkin, Alexander
    Lewis, David D.
    Madigan, David
    TECHNOMETRICS, 2007, 49 (03) : 291 - 304
  • [43] Utility of Large-Scale Recipe Data in Food Computing
    Kale, Maija
    Agbozo, Ebenezer
    BALTIC JOURNAL OF MODERN COMPUTING, 2021, 9 (02): : 155 - 165
  • [44] LASH: Large-Scale Academic Deep Semantic Hashing
    Guo, Jia-Nan
    Mao, Xian-Ling
    Lan, Tian
    Tu, Rong-Xin
    Wei, Wei
    Huang, Heyan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1734 - 1746
  • [45] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
  • [46] PIRAT: A Personalized Information Retrieval System in Arabic Texts Based on a Hybrid Representation of a User Profile
    Safi, Houssem
    Jaoua, Maher
    Belguith, Lamia Hadrich
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2016, 2016, 9612 : 326 - 334
  • [47] Improving large-scale search engines with semantic annotations
    Fuentes-Lorenzo, Damaris
    Fernandez, Norberto
    Fisteus, Jesus A.
    Sanchez, Luis
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (06) : 2287 - 2296
  • [48] Periscoping: Private Key Distribution for Large-Scale Mixnets
    Liu, Shuhao
    Chen, Li
    Fu, Yuanzhong
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 681 - 690
  • [49] Problems on large-scale speech corpus and the applications in TTS
    Zhang S.
    Liu L.
    Diao L.-H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (04): : 687 - 696
  • [50] Multi-Style Language Model for Web Scale Information Retrieval
    Wang, Kuansan
    Li, Xiaolong
    Gao, Jianfeng
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 467 - 474