Fighting plagiarism: metrics and methods to measure and find similarities among source code of computer programs In VPL

Rodríguez del Pino, Juan Carlos; Rubio-Royo, E.; Hernandez-Figueroa, Z.

Please use this identifier to cite or link to this item: https://accedacris.ulpgc.es/jspui/handle/10553/53443

DC Field	Value	Language
dc.contributor.author	Rodríguez del Pino, Juan Carlos	en_US
dc.contributor.author	Rubio-Royo, E.	en_US
dc.contributor.author	Hernandez-Figueroa, Z.	en_US
dc.contributor.other	Hernandez Figueroa, Zenon Jose	-
dc.date.accessioned	2019-02-04T16:40:27Z	-
dc.date.available	2019-02-04T16:40:27Z	-
dc.date.issued	2011	en_US
dc.identifier.issn	2340-1117	en_US
dc.identifier.uri	https://accedacris.ulpgc.es/handle/10553/53443	-
dc.description.abstract	Plagiarism in academia is becoming a major problem; this can be seen in the increasing number of cases that come to light. Increasing use of information and communication technologies seems to be an important factor to facilitate plagiarism, although it can be part of the solution too. Assignments where students should develop computer programs to submit their source code are no strangers to this problem, being even one of the areas in which more cases of plagiarism are reported. The most common form of plagiarism is produced by making changes in an original source file to get a different version while maintaining the behavior of the new program as in the original. In this paper, we show a tool that uses different metrics and methods to look for and show the most similar source files in a set. Similarity among files is directly related to the possibility that they are the outcome of a process of plagiarism. The metrics used are three: two proposed by the authors and a third commonly used. The reason for using three metrics is that each one is sensitive to different forms of systematic changes in source code files, so the combination of them increases the capability of discover plagiarism attempts. Searching for the most similar files requires a preprocessing consisting of: a lexical analysis, a filtering and a normalization of expressions, to get a signature for each file. These signatures will then be compared using the proposed metrics. The search process is optimized to run using a minimum memory and little time. As result of the process, we get a list of the most similar pairs of files sorted from highest to lowest similarity, besides a list of clusters of the most similar files. Both lists are shown using a gradation of colours to express the similarity levels in a friendly manner; the numeric results of the applied metrics are shown as well. This interface is designed to facilitate taken appropriate decisions. The proposed tool is part of VPL, a Virtual Programming Lab module for Moodle, a popular Learning Management System distributed under GNU/GPL license. The anti-plagiarism tool offers a user-friendly interface allowing compare files from VPL activities among them or against external sources, with online response.	en_US
dc.language	eng	en_US
dc.publisher	2340-1117	en_US
dc.source	Edulearn11: 3Rd International Conference On Education And New Learning Technologies [ISSN 2340-1117], p. 4339-4346	en_US
dc.subject	120308 Código y sistemas de codificación	en_US
dc.subject	710202 Códigos de conducta ética	en_US
dc.subject	1203 Ciencia de los ordenadores	en_US
dc.subject.other	Plagiarism	en_US
dc.subject.other	Virtual programming lab	en_US
dc.subject.other	Programming assignment management	en_US
dc.subject.other	similarity metrics	en_US
dc.title	Fighting plagiarism: metrics and methods to measure and find similarities among source code of computer programs In VPL	en_US
dc.type	info:eu-repo/semantics/conferenceObject	en_US
dc.type	ConferenceObject	en_US
dc.relation.conference	3rd International Conference on Education and New Learning Technologies (EDULEARN)
dc.identifier.isi	000326292904054	-
dcterms.isPartOf	Edulearn11: 3Rd International Conference On Education And New Learning Technologies	-
dcterms.source	Edulearn11: 3Rd International Conference On Education And New Learning Technologies[ISSN 2340-1117], p. 4339-4346	-
dc.description.lastpage	4346	-
dc.description.firstpage	4339	-
dc.investigacion	Ingeniería y Arquitectura	en_US
dc.type2	Actas de congresos	en_US
dc.identifier.wos	WOS:000326292904054	-
dc.contributor.daisngid	8096003	-
dc.contributor.daisngid	2706797	-
dc.contributor.daisngid	5640231	-
dc.contributor.daisngid	7868966
dc.identifier.investigatorRID	H-9706-2015	-
dc.utils.revision	Sí	en_US
dc.contributor.wosstandard	WOS:Rodriguez-del-Pino, JC
dc.contributor.wosstandard	WOS:Rubio-Royo, E
dc.contributor.wosstandard	WOS:Hernandez-Figueroa, Z
dc.date.coverdate	2011
dc.identifier.conferenceid	events120846
dc.identifier.ulpgc	Sí	es
item.fulltext	Sin texto completo	-
item.grantfulltext	none	-
crisitem.event.eventsstartdate	04-07-2011	-
crisitem.event.eventsenddate	06-07-2011	-
crisitem.author.dept	Departamento de Informática y Sistemas	-
crisitem.author.dept	GIR IATEXT: Cognición, Lingüística, Texto y Procesamiento de la Información	-
crisitem.author.dept	IU de Análisis y Aplicaciones Textuales	-
crisitem.author.dept	Departamento de Informática y Sistemas	-
crisitem.author.orcid	0000-0001-7126-0406	-
crisitem.author.orcid	0000-0002-1657-4020	-
crisitem.author.parentorg	IU de Análisis y Aplicaciones Textuales	-
crisitem.author.fullName	Rodríguez Del Pino, Juan Carlos	-
crisitem.author.fullName	Hernández Figueroa, Zenón José	-
Appears in Collections:	Actas de congresos

Show simple item record

WEB OF SCIENCE^TM
Citations

1

checked on May 31, 2020

Page view(s)

129

checked on Oct 26, 2024

Google Scholar^TM

Check

WEB OF SCIENCE^TM
Citations

Page view(s)

Google Scholar^TM

Share

Export metadata

Dirección

Contacto

Legal

De interés

WEB OF SCIENCETM Citations

Page view(s)

Google ScholarTM

Share

Export metadata

Dirección

WEB OF SCIENCE^TM
Citations

Google Scholar^TM