Identificador persistente para citar o vincular este elemento: http://hdl.handle.net/10553/129014
Campo DC Valoridioma
dc.contributor.authorArnold, Tayloren_US
dc.contributor.authorBallier, Nicolasen_US
dc.contributor.authorLissón Hernández, Paula Joséen_US
dc.contributor.authorTilton, Laurenen_US
dc.date.accessioned2024-02-20T19:15:51Z-
dc.date.available2024-02-20T19:15:51Z-
dc.date.issued2019en_US
dc.identifier.issn1574-020Xen_US
dc.identifier.urihttp://hdl.handle.net/10553/129014-
dc.description.abstractThis paper presents a combination of R packages—user contributed toolkits written in a common core programming language—to facilitate the humanistic investigation of digitised, text-based corpora.Our survey of text analysis packages includes those of our own creation (cleanNLP and fasttextM) as well as packages built by other research groups (stringi, readtext, hyphenatr, quanteda, and hunspell). By operating on generic object types, these packages unite research innovations in corpus linguistics, natural language processing, machine learning, statistics, and digital humanities. We begin by extrapolating on the theoretical benefits of R as an elaborate gluing language for bringing together several areas of expertise and compare it to linguistic concordancers and other tool-based approaches to text analysis in the digital humanities. We then showcase the practical benefits of an ecosystem by illustrating how R packages have been integrated into a digital humanities project. Throughout, the focus is on moving beyond the bag-of-words, lexical frequency model by incorporating linguistically-driven analyses in research.en_US
dc.languageengen_US
dc.relation.ispartofLanguage Resources and Evaluationen_US
dc.sourceLanguage Resources and Evaluation [1574-020X], vol. 53, p. 707–733en_US
dc.subject5701 Lingüística aplicadaen_US
dc.subject.otherDigital humanitiesen_US
dc.subject.otherText miningen_US
dc.subject.otherRen_US
dc.subject.otherText interoperabilityen_US
dc.titleBeyond lexical frequencies: using R for text analysis in the digital humanitiesen_US
dc.typeArticleen_US
dc.identifier.doi10.1007/s10579-019-09456-6en_US
dc.identifier.scopus2-s2.0-85064342350-
dc.identifier.isiWOS:000501297700007-
dc.contributor.orcid0000-0003-0576-0669-
dc.contributor.orcid#NODATA#-
dc.contributor.orcid#NODATA#-
dc.contributor.orcid#NODATA#-
dc.description.lastpage733en_US
dc.identifier.issue4-
dc.description.firstpage707en_US
dc.investigacionArtes y Humanidadesen_US
dc.utils.revisionen_US
dc.identifier.ulpgcNoen_US
dc.contributor.buulpgcBU-HUMen_US
dc.description.sjr0,441
dc.description.jcr1,014
dc.description.sjrqQ1
dc.description.jcrqQ4
dc.description.scieSCIE
dc.description.erihplusERIH PLUS
item.grantfulltextopen-
item.fulltextCon texto completo-
crisitem.author.deptGIR IATEXT: Variación y Cambio Lingüístico-
crisitem.author.deptIU de Análisis y Aplicaciones Textuales-
crisitem.author.deptDepartamento de Filología Moderna, Traducción e Interpretación-
crisitem.author.parentorgIU de Análisis y Aplicaciones Textuales-
crisitem.author.fullNameLissón Hernández, Paula José-
Colección:Artículos
Adobe PDF (1,92 MB)
Vista resumida

Google ScholarTM

Verifica

Altmetric


Comparte



Exporta metadatos



Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor con todos los derechos reservados, a menos que se indique lo contrario.