AUTOMATIC EXTRACTION OF QURANIC LEXIS REPRESENTING TWO DIFFERENT NOTIONS OF LINGUISTIC SALIENCE: KEYNESS AND PROSODIC PROMINENCE

被引:2
作者
Brierley, Claire [1 ]
Sawalha, Majdi [2 ]
Islam, Tajul [1 ]
Dickins, James [1 ]
Atwell, Eric [1 ]
机构
[1] Univ Leeds, Leeds, W Yorkshire, England
[2] Univ Jordan, Amman, Jordan
基金
英国工程与自然科学研究理事会;
关键词
STRESS; ENGLISH; INTONATION; DURATION;
D O I
10.1093/jss/fgy005
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This paper presents two sets of lexical items automatically extracted from the Arabic Qur'an, and denoting two different notions of linguistic salience: keyness and prosodic prominence. Our novel hypothesis investigates a possible correlation between them. Our novel findings discover distributionally significant keywords that also occur strategically in phrase-final position so as to maximise their prominence, and thus meaningfulness, for reader, reciter, and aural recipient. Our methodology first computes Quranic keywords via the Corpus Linguistics technique of Keyword Extraction, and maps them to major Quranic themes in Islamic scholarship. Next, we implement a bespoke algorithm for rule-based capture of words annotated with madd or prolongation, a specific type of prosodic highlighting in Quranic recitation rules or tajwid. We find it especially interesting that the concept of final syllable lengthening (madd before pause) is encoded in tajwid and effectively demarcates phrase boundaries in the Qur'an. We concentrate on nominal keywords (i.e. nouns and adjectives) since these are more likely to be aligned with phrase edges and to bear the hallmarks of pre-boundary lengthening. This correlation between keyness and prominence occurs 43.29% of the time in our data, since 526 keywords appear in our extracted subset of nominal types tagged with madd before pause: ((526/1215)*100). Finally, we identify which Quranic keywords are most likely to be annotated with enhanced prolongation in the final syllable before pause, using an easy-to-interpret, single value metric: the Laplace Point Estimate. Keywords that emerge as semantically weighted in terms of both distributional and prosodic significance are most likely to reflect the Quranic themes of God, Nature, and Eschatology.
引用
收藏
页码:407 / 456
页数:50
相关论文
共 67 条
  • [1] Abdul-Fattah A., 1989, TAJWID UL QURAN NEW
  • [2] Al-Dabba A., 1997, MINHAT DHI L JALAL F
  • [3] Al-Jilani H. A., 1992, SECRET SECRETS
  • [4] Al-Lahham M. S., 2007, AL MUJAM AL MUFAHRAS
  • [5] Al-Masiri K., 2002, AL JAMIFI QIRAAT AL
  • [6] Al-Sharif M. M., 2006, ASMA ALLAH AL HUSNA
  • [7] Alrabiah M., 2013, Proceedings of WACL'2 Second Workshop on Arabic Corpus Linguistics, P5
  • [8] [Anonymous], 2000, THE HOLY QURAN
  • [9] [Anonymous], 1997, SYSTEM
  • [10] [Anonymous], THESIS