TY - JOUR
T1 - Neural Generative Models and the Parallel Architecture of Language
T2 - A Critical Review and Outlook
AU - Rambelli, Giulia
AU - Chersoni, Emmanuele
AU - Testa, Davide
AU - Blache, Philippe
AU - Lenci, Alessandro
N1 - Publisher Copyright:
© 2024 The Authors. Topics in Cognitive Science published by Wiley Periodicals LLC on behalf of Cognitive Science Society.
PY - 2024/4/18
Y1 - 2024/4/18
N2 - According to the parallel architecture, syntactic and semantic information processing are two separate streams that interact selectively during language comprehension. While considerable effort is put into psycho- and neurolinguistics to understand the interchange of processing mechanisms in human comprehension, the nature of this interaction in recent neural Large Language Models remains elusive. In this article, we revisit influential linguistic and behavioral experiments and evaluate the ability of a large language model, GPT-3, to perform these tasks. The model can solve semantic tasks autonomously from syntactic realization in a manner that resembles human behavior. However, the outcomes present a complex and variegated picture, leaving open the question of how Language Models could learn structured conceptual representations.
AB - According to the parallel architecture, syntactic and semantic information processing are two separate streams that interact selectively during language comprehension. While considerable effort is put into psycho- and neurolinguistics to understand the interchange of processing mechanisms in human comprehension, the nature of this interaction in recent neural Large Language Models remains elusive. In this article, we revisit influential linguistic and behavioral experiments and evaluate the ability of a large language model, GPT-3, to perform these tasks. The model can solve semantic tasks autonomously from syntactic realization in a manner that resembles human behavior. However, the outcomes present a complex and variegated picture, leaving open the question of how Language Models could learn structured conceptual representations.
KW - Neural large language models
KW - Statistical learning
KW - Parallel architecture
KW - Syntax-semantics interface
KW - GPT-3 prompting
KW - Enriched composition
KW - Semantic composition
UR - http://www.scopus.com/inward/record.url?scp=85191004644&partnerID=8YFLogxK
U2 - 10.1111/tops.12733
DO - 10.1111/tops.12733
M3 - Journal article
SN - 1756-8757
JO - Topics in Cognitive Science
JF - Topics in Cognitive Science
ER -