Human Versus Machine Intelligence: Assessing Natural Language Generation Models Through Complex Systems Theory

被引:1
|
作者
De Santis, Enrico [1 ]
Martino, Alessio [2 ]
Rizzi, Antonello [1 ]
机构
[1] Univ Roma La Sapienza, Dept Informat Engn Elect & Telecommun, I-00184 Rome, Italy
[2] LUISS Univ, Dept Business & Management, Rome, Italy
关键词
Complexity theory; Writing; Correlation; Time series analysis; Fractals; Complex systems; Task analysis; Natural language generation; GPT models; multifractal analysis; recurrence quantification analysis; Zipf's law; quantitative linguistics; complexity science; text classification; LONG-RANGE CORRELATIONS; RECURRENCE PLOTS; NETWORKS; ORIGIN;
D O I
10.1109/TPAMI.2024.3358168
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The introduction of Transformer architectures - with the self-attention mechanism - in automatic Natural Language Generation (NLG) is a breakthrough in solving general task-oriented problems, such as the simple production of long text excerpts that resemble ones written by humans. While the performance of GPT-X architectures is there for all to see, many efforts are underway to penetrate the secrets of these black-boxes in terms of intelligent information processing whose output statistical distributions resemble that of natural language. In this work, through the complexity science framework, a comparative study of the stochastic processes underlying the texts produced by the English version of GPT-2 with respect to texts produced by human beings, notably novels in English and programming codes, is offered. The investigation, of a methodological nature, consists first of all of an analysis phase in which the Multifractal Detrended Fluctuation Analysis and the Recurrence Quantification Analysis - together with Zipf's law and approximate entropy - are adopted to characterize long-term correlations, regularities and recurrences in human and machine-produced texts. Results show several peculiarities and trends in terms of long-range correlations and recurrences in the last case. The synthesis phase, on the other hand, uses the complexity measures to build synthetic text descriptors - hence a suitable text embedding - which serve to constitute the features for feeding a machine learning system designed to operate feature selection through an evolutionary technique. Using multivariate analysis, it is then shown the grouping tendency of the three analyzed text types, allowing to place GTP-2 texts in between natural language texts and computer codes. Similarly, the classification task demonstrates that, given the high accuracy obtained in the automatic discrimination of text classes, the proposed set of complexity measures is highly informative. These interesting results allow us to add another piece to the theoretical understanding of the surprising results obtained by NLG systems based on deep learning and let us to improve the design of new informetrics or text mining systems for text classification, fake news detection, or even plagiarism detection.
引用
收藏
页码:4812 / 4829
页数:18
相关论文
共 3 条
  • [1] Assessing the Applicability of Complex Network Theory Models and Importance Measures to Vulnerability Studies of Cyber-physical Systems
    Zhu, Wentao
    Milanovic, Jovica V.
    Mihic, Bojana
    PROCEEDINGS OF 2019 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE), 2019,
  • [2] Mutually trustworthy human-machine knowledge automation and hybrid augmented intelligence: mechanisms and applications of cognition, management, and control for complex systems
    Wang, Fei-Yue
    Guo, Jianbo
    Bu, Guangquan
    Zhang, Jun Jason
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (08) : 1142 - 1157
  • [3] Mutually trustworthy human-machine knowledge automation and hybrid augmented intelligence: mechanisms and applications of cognition, management, and control for complex systems人机互信的知识自动化与混合增强智能: 复杂系统认知管控机制及其应用
    Fei-Yue Wang
    Jianbo Guo
    Guangquan Bu
    Jun Jason Zhang
    Frontiers of Information Technology & Electronic Engineering, 2022, 23 : 1142 - 1157