Please use this identifier to cite or link to this item: http://hdl.handle.net/10553/53443
DC FieldValueLanguage
dc.contributor.authorRodríguez del Pino, Juan Carlosen_US
dc.contributor.authorRubio-Royo, E.en_US
dc.contributor.authorHernandez-Figueroa, Z.en_US
dc.contributor.otherHernandez Figueroa, Zenon Jose-
dc.date.accessioned2019-02-04T16:40:27Z-
dc.date.available2019-02-04T16:40:27Z-
dc.date.issued2011en_US
dc.identifier.issn2340-1117en_US
dc.identifier.urihttp://hdl.handle.net/10553/53443-
dc.description.abstractPlagiarism in academia is becoming a major problem; this can be seen in the increasing number of cases that come to light. Increasing use of information and communication technologies seems to be an important factor to facilitate plagiarism, although it can be part of the solution too. Assignments where students should develop computer programs to submit their source code are no strangers to this problem, being even one of the areas in which more cases of plagiarism are reported. The most common form of plagiarism is produced by making changes in an original source file to get a different version while maintaining the behavior of the new program as in the original. In this paper, we show a tool that uses different metrics and methods to look for and show the most similar source files in a set. Similarity among files is directly related to the possibility that they are the outcome of a process of plagiarism. The metrics used are three: two proposed by the authors and a third commonly used. The reason for using three metrics is that each one is sensitive to different forms of systematic changes in source code files, so the combination of them increases the capability of discover plagiarism attempts. Searching for the most similar files requires a preprocessing consisting of: a lexical analysis, a filtering and a normalization of expressions, to get a signature for each file. These signatures will then be compared using the proposed metrics. The search process is optimized to run using a minimum memory and little time. As result of the process, we get a list of the most similar pairs of files sorted from highest to lowest similarity, besides a list of clusters of the most similar files. Both lists are shown using a gradation of colours to express the similarity levels in a friendly manner; the numeric results of the applied metrics are shown as well. This interface is designed to facilitate taken appropriate decisions. The proposed tool is part of VPL, a Virtual Programming Lab module for Moodle, a popular Learning Management System distributed under GNU/GPL license. The anti-plagiarism tool offers a user-friendly interface allowing compare files from VPL activities among them or against external sources, with online response.en_US
dc.languageengen_US
dc.publisher2340-1117en_US
dc.sourceEdulearn11: 3Rd International Conference On Education And New Learning Technologies [ISSN 2340-1117], p. 4339-4346en_US
dc.subject120308 Código y sistemas de codificaciónen_US
dc.subject710202 Códigos de conducta éticaen_US
dc.subject1203 Ciencia de los ordenadoresen_US
dc.subject.otherPlagiarismen_US
dc.subject.otherVirtual programming laben_US
dc.subject.otherProgramming assignment managementen_US
dc.subject.othersimilarity metricsen_US
dc.titleFighting plagiarism: metrics and methods to measure and find similarities among source code of computer programs In VPLen_US
dc.typeinfo:eu-repo/semantics/conferenceObjecten_US
dc.typeConferenceObjecten_US
dc.relation.conference3rd International Conference on Education and New Learning Technologies (EDULEARN)
dc.identifier.isi000326292904054-
dcterms.isPartOfEdulearn11: 3Rd International Conference On Education And New Learning Technologies-
dcterms.sourceEdulearn11: 3Rd International Conference On Education And New Learning Technologies[ISSN 2340-1117], p. 4339-4346-
dc.description.lastpage4346-
dc.description.firstpage4339-
dc.investigacionIngeniería y Arquitecturaen_US
dc.type2Actas de congresosen_US
dc.identifier.wosWOS:000326292904054-
dc.contributor.daisngid8096003-
dc.contributor.daisngid2706797-
dc.contributor.daisngid7868966
dc.contributor.daisngid5640231-
dc.identifier.investigatorRIDH-9706-2015-
dc.utils.revisionen_US
dc.contributor.wosstandardWOS:Rodriguez-del-Pino, JC
dc.contributor.wosstandardWOS:Rubio-Royo, E
dc.contributor.wosstandardWOS:Hernandez-Figueroa, Z
dc.date.coverdate2011
dc.identifier.conferenceidevents120846
dc.identifier.ulpgces
item.grantfulltextnone-
item.fulltextSin texto completo-
crisitem.event.eventsstartdate04-07-2011-
crisitem.event.eventsenddate06-07-2011-
crisitem.author.deptDepartamento de Informática y Sistemas-
crisitem.author.deptGIR IATEXT: Cognition, linguistic, text and information processing-
crisitem.author.deptIU de Análisis y Aplicaciones Textuales-
crisitem.author.deptDepartamento de Informática y Sistemas-
crisitem.author.orcid0000-0001-7126-0406-
crisitem.author.orcid0000-0002-1657-4020-
crisitem.author.parentorgIU de Análisis y Aplicaciones Textuales-
crisitem.author.fullNameRodríguez Del Pino, Juan Carlos-
crisitem.author.fullNameHernández Figueroa, Zenón José-
Appears in Collections:Actas de congresos
Show simple item record

Google ScholarTM

Check


Share



Export metadata



Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.