Towards Layered Events and Schema Representations in Long Documents

被引:0
|
作者
Hatzel, Hans Ole [1 ]
Biemann, Chris [1 ]
机构
[1] Univ Hamburg, Language Technol Grp, Hamburg, Germany
关键词
EXTRACTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this thesis proposal, we explore event extraction and event representation on literary texts. Due to its variety of genres and varying document length, literature is a challenging domain, yet the representation of literary content has received relatively little attention. As most individual events contribute little to the overall semantics of literary documents, we model events at different granularities. On the conceptual level, we adapt the previous definition of schemas as sequences of events, all describing a single process connected through shared participants, and extend the notion to allow modeling a document's content using sequences of schemas. Technically, the segmentation of event sequences into schemas is approached by modeling such sequences, making use of the narrative cloze task, which is the prediction of masked events in event sequence contexts. We propose building on sequences of event embeddings to form schema representations, thereby summarizing sections of documents using a fixed-size representation. This approach will give rise to comparisons of sections such as chapters up to the comparison of entire literary works on the level of their schema structure, paving the way to a computational approach to quantitative literary research.
引用
收藏
页码:32 / 39
页数:8
相关论文
共 50 条
  • [1] TOWARDS AN INTEGRATION OF SOCIAL REPRESENTATIONS AND SOCIAL SCHEMA THEORY
    AUGOUSTINOS, M
    INNES, JM
    BRITISH JOURNAL OF SOCIAL PSYCHOLOGY, 1990, 29 : 213 - 231
  • [2] Towards Reading Comprehension for Long Documents
    Zhang, Yuanxing
    Zhang, Yangbin
    Bian, Kaigui
    Li, Xiaoming
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4588 - 4594
  • [3] FETILDA: Evaluation Framework for Effective Representations of Long Financial Documents
    Xia, Bolun
    Rawte, Vipula
    Gupta, Aparna
    Zaki, Mohammed J.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (07)
  • [4] On correcting XML documents with respect to a schema
    Bouchou, B. (beatrice.bouchou@univ-tours.fr), 1600, Oxford University Press (57):
  • [5] Hidden schema extraction in web documents
    Carchiolo, V
    Longheu, A
    Malgeri, M
    DATABASES IN NETWORKED INFORMATION SYSTEMS, PROCEEDINGS, 2003, 2822 : 42 - 52
  • [6] Hidden schema extraction in web documents
    1600, International Affairs Committee; University of Aizu, (Springer Verlag):
  • [7] On Correcting XML Documents with Respect to a Schema
    Amavi, Joshua
    Bouchou, Beatrice
    Savary, Agata
    COMPUTER JOURNAL, 2014, 57 (05): : 639 - 674
  • [8] Compression of Layered Documents
    Carpentieri, Bruno
    NETWORKED DIGITAL TECHNOLOGIES, PT 1, 2010, 87 : 91 - 97
  • [9] Detecting data and schema changes in scientific documents
    Adam, N
    Adiwijaya, I
    Critchlow, T
    Musick, R
    IEEE ADVANCES IN DIGITAL LIBRARIES 2000, PROCEEDINGS, 2000, : 160 - 170
  • [10] XML Schema in XML Documents with Usage Control
    Sun, Lili
    Li, Yan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (10): : 170 - 177