Identificador persistente para citar o vincular este elemento: https://accedacris.ulpgc.es/jspui/handle/10553/156556
Título: Combining Synthetic Patient Data Generation with Machine Learning Methods for Diabetes Prediction
Autores/as: Rodríguez Almeida, Antonio José 
Castro Fernández, María 
Déniz García, Alejandro 
Fabelo Gómez, Himar Antonio 
Ortega Sarmiento, Samuel 
Quevedo Gutiérrez, Eduardo Gregorio 
Soguero Ruiz, Cristina
Wägner, Anna Maria Claudia 
Granjad, Conceiçao
Marrero Callicó, Gustavo Iván 
Clasificación UNESCO: 32 Ciencias médicas
320502 Endocrinología
Palabras clave: Machine Learning
Synthetic Data Generation
Diabetes Mellitus
Classification
Fecha de publicación: 2021
Conferencia: Digitalized methods and tools for industry and healthcare
Resumen: Diabetes Mellitus (DM) is a chronic disease caused by different disorders in the insulin production or use. Its prevalence has not stopped increasing during the last years, becoming a major public health concern. Thus, tools for its prediction and early diagnosis are needed. In this context, Machine Learning (ML) could be a suitable choice due to its capability of extracting useful information from medical records. However, the lack of available and reliable datasets makes this a complex task. Synthetic data generation is emerging as a solution for this issue, as it takes a real dataset as the basis to generate similar instances. In this work, a framework based on ML and synthetic data generation methods is presented to evaluate whether classification performance between presence or absence of DM could be improved. The obtained results show that ADASYN and Borderline SMOTE algorithms fairly keep the underlying structure of the original data. They also prove that the ML models trained with mixed synthetic and original data perform as well as those trained with original data.
URI: https://accedacris.ulpgc.es/jspui/handle/10553/156556
Colección:Actas de congresos
Adobe PDF (420,59 kB)
Vista completa

Google ScholarTM

Verifica


Comparte



Exporta metadatos



Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor con todos los derechos reservados, a menos que se indique lo contrario.