A Processing-Oriented Investigation of Inflectional Complexity

Due to the typological diversity of their inflectional processes, some languages are intuitively more difficult than other languages. Yet, finding a single measure to quantitatively assess the comparative complexity of an inflectional system proves an exceedingly difficult endeavor. In this paper we...

Full description

Bibliographic Details
Main Authors: Claudia Marzi, Marcello Ferro, Vito Pirrelli
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-09-01
Series:Frontiers in Communication
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fcomm.2019.00048/full
id doaj-bdf4bb0557db4f6a8dc0be5bb5535815
record_format Article
spelling doaj-bdf4bb0557db4f6a8dc0be5bb55358152020-11-25T02:36:29ZengFrontiers Media S.A.Frontiers in Communication2297-900X2019-09-01410.3389/fcomm.2019.00048441828A Processing-Oriented Investigation of Inflectional ComplexityClaudia MarziMarcello FerroVito PirrelliDue to the typological diversity of their inflectional processes, some languages are intuitively more difficult than other languages. Yet, finding a single measure to quantitatively assess the comparative complexity of an inflectional system proves an exceedingly difficult endeavor. In this paper we propose to investigate the issue from a processing-oriented standpoint, using data processed by a type of recurrent neural network to quantitatively model the dynamic of word processing and learning in different input conditions. We evaluate the relative complexity of a set of typologically different inflectional systems (Greek, Italian, Spanish, German, English and Standard Modern Arabic) by training a Temporal Self-Organizing Map (TSOM), a recurrent variant of Kohonen's Self-Organizing Maps, on a fixed set of verb forms from top-frequency verb paradigms, with no information about the morphosemantic and morphosyntactic content conveyed by the forms. After training, the behavior of each language-specific TSOM is assessed on different tasks, looking at self-organizing patterns of temporal connectivity and functional responses. Our simulations show that word processing is facilitated by maximally contrastive inflectional systems, where verb forms exhibit the earliest possible point of lexical discrimination. Conversely, word learning is favored by a maximally generalizable system, where forms are inferred from the smallest possible number of their paradigm companions. Based on evidence from the literature and our own data, we conjecture that the resulting balance is the outcome of the interaction between form frequency and morphological regularity. Big families of stem-sharing, regularly inflected forms are the productive core of an inflectional system. Such a core is easier to learn but slower to discriminate. In contrast, less predictable verb forms, based on alternating and possibly suppletive stems, are easier to process but are learned by rote. Inflection systems thus strike a balance between these conflicting processing and communicative requirements, while staying within tight learnability bounds, in line with Ackermann and Malouf's Low Conditional Entropy Conjecture. Our quantitative investigation supports a discriminative view of morphological inflection as a collective, emergent system, whose global self-organization rests on a surprisingly small handful of language-independent principles of word coactivation and competition.https://www.frontiersin.org/article/10.3389/fcomm.2019.00048/fullmorphological complexitydiscriminative learningrecurrent neural networksself-organizationemergenceprocessing uncertainty
collection DOAJ
language English
format Article
sources DOAJ
author Claudia Marzi
Marcello Ferro
Vito Pirrelli
spellingShingle Claudia Marzi
Marcello Ferro
Vito Pirrelli
A Processing-Oriented Investigation of Inflectional Complexity
Frontiers in Communication
morphological complexity
discriminative learning
recurrent neural networks
self-organization
emergence
processing uncertainty
author_facet Claudia Marzi
Marcello Ferro
Vito Pirrelli
author_sort Claudia Marzi
title A Processing-Oriented Investigation of Inflectional Complexity
title_short A Processing-Oriented Investigation of Inflectional Complexity
title_full A Processing-Oriented Investigation of Inflectional Complexity
title_fullStr A Processing-Oriented Investigation of Inflectional Complexity
title_full_unstemmed A Processing-Oriented Investigation of Inflectional Complexity
title_sort processing-oriented investigation of inflectional complexity
publisher Frontiers Media S.A.
series Frontiers in Communication
issn 2297-900X
publishDate 2019-09-01
description Due to the typological diversity of their inflectional processes, some languages are intuitively more difficult than other languages. Yet, finding a single measure to quantitatively assess the comparative complexity of an inflectional system proves an exceedingly difficult endeavor. In this paper we propose to investigate the issue from a processing-oriented standpoint, using data processed by a type of recurrent neural network to quantitatively model the dynamic of word processing and learning in different input conditions. We evaluate the relative complexity of a set of typologically different inflectional systems (Greek, Italian, Spanish, German, English and Standard Modern Arabic) by training a Temporal Self-Organizing Map (TSOM), a recurrent variant of Kohonen's Self-Organizing Maps, on a fixed set of verb forms from top-frequency verb paradigms, with no information about the morphosemantic and morphosyntactic content conveyed by the forms. After training, the behavior of each language-specific TSOM is assessed on different tasks, looking at self-organizing patterns of temporal connectivity and functional responses. Our simulations show that word processing is facilitated by maximally contrastive inflectional systems, where verb forms exhibit the earliest possible point of lexical discrimination. Conversely, word learning is favored by a maximally generalizable system, where forms are inferred from the smallest possible number of their paradigm companions. Based on evidence from the literature and our own data, we conjecture that the resulting balance is the outcome of the interaction between form frequency and morphological regularity. Big families of stem-sharing, regularly inflected forms are the productive core of an inflectional system. Such a core is easier to learn but slower to discriminate. In contrast, less predictable verb forms, based on alternating and possibly suppletive stems, are easier to process but are learned by rote. Inflection systems thus strike a balance between these conflicting processing and communicative requirements, while staying within tight learnability bounds, in line with Ackermann and Malouf's Low Conditional Entropy Conjecture. Our quantitative investigation supports a discriminative view of morphological inflection as a collective, emergent system, whose global self-organization rests on a surprisingly small handful of language-independent principles of word coactivation and competition.
topic morphological complexity
discriminative learning
recurrent neural networks
self-organization
emergence
processing uncertainty
url https://www.frontiersin.org/article/10.3389/fcomm.2019.00048/full
work_keys_str_mv AT claudiamarzi aprocessingorientedinvestigationofinflectionalcomplexity
AT marcelloferro aprocessingorientedinvestigationofinflectionalcomplexity
AT vitopirrelli aprocessingorientedinvestigationofinflectionalcomplexity
AT claudiamarzi processingorientedinvestigationofinflectionalcomplexity
AT marcelloferro processingorientedinvestigationofinflectionalcomplexity
AT vitopirrelli processingorientedinvestigationofinflectionalcomplexity
_version_ 1724799887849029632