Please use this identifier to cite or link to this item: http://hdl.handle.net/10553/43488
Title: Automatic syllabification for Spanish using lemmatization and derivation to solve the prefix's prominence issue
Authors: Hernández-Figueroa, Zenón 
Carreras-Riudavets, Francisco J. 
Rodríguez-Rodríguez, Gustavo
UNESCO Clasification: 570104 Lingüística informatizada
Keywords: Syllabification
Lemmatization
Derivation
Prefix
Issue Date: 2013
Journal: Expert Systems with Applications 
Abstract: The syllabification of Spanish's words follows a few basic rules, but the syllabification of some words deviates from the general rules according to a number of factors described in this paper. Prefixes are major cause of variations on syllabification. Since, in Spanish, prefixes tend to do not integrate into other syllables when they are prominent, the syllabification of words can vary depending on the prominence of the prefixes. This paper shows that, in many cases, the prominence of a prefix can be inferred by means of some morphological and lexical knowledge. This paper proposes a syllabification algorithm that implements the basic syllabification rules and combines them with morphological and lexical information obtained from three sources: a lemmatizer, a derivation database, and the Corpus de Referencia del Español Actual (CREA) of Royal Spanish Academy. Using this additional information, this paper attempts to provide a solution to the problem of taken into account the prefixes according to its prominence for a correct syllabification.
URI: http://hdl.handle.net/10553/43488
ISSN: 0957-4174
DOI: 10.1016/j.eswa.2013.06.056
Source: Expert Systems with Applications[ISSN 0957-4174],v. 40, p. 7122-7131
Appears in Collections:Artículos
Show full item record

SCOPUSTM   
Citations

9
checked on Nov 17, 2024

WEB OF SCIENCETM
Citations

4
checked on Nov 17, 2024

Page view(s)

190
checked on May 25, 2024

Google ScholarTM

Check

Altmetric


Share



Export metadata



Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.