Please use this identifier to cite or link to this item:
https://accedacris.ulpgc.es/jspui/handle/10553/156556
| Title: | Combining Synthetic Patient Data Generation with Machine Learning Methods for Diabetes Prediction | Authors: | Rodríguez Almeida, Antonio José Castro Fernández, María Déniz García, Alejandro Fabelo Gómez, Himar Antonio Ortega Sarmiento, Samuel Quevedo Gutiérrez, Eduardo Gregorio Soguero Ruiz, Cristina Wägner, Anna Maria Claudia Granjad, Conceiçao Marrero Callicó, Gustavo Iván |
UNESCO Clasification: | 32 Ciencias médicas 320502 Endocrinología |
Keywords: | Machine Learning Synthetic Data Generation Diabetes Mellitus Classification |
Issue Date: | 2021 | Conference: | Digitalized methods and tools for industry and healthcare | Abstract: | Diabetes Mellitus (DM) is a chronic disease caused by different disorders in the insulin production or use. Its prevalence has not stopped increasing during the last years, becoming a major public health concern. Thus, tools for its prediction and early diagnosis are needed. In this context, Machine Learning (ML) could be a suitable choice due to its capability of extracting useful information from medical records. However, the lack of available and reliable datasets makes this a complex task. Synthetic data generation is emerging as a solution for this issue, as it takes a real dataset as the basis to generate similar instances. In this work, a framework based on ML and synthetic data generation methods is presented to evaluate whether classification performance between presence or absence of DM could be improved. The obtained results show that ADASYN and Borderline SMOTE algorithms fairly keep the underlying structure of the original data. They also prove that the ML models trained with mixed synthetic and original data perform as well as those trained with original data. | URI: | https://accedacris.ulpgc.es/jspui/handle/10553/156556 |
| Appears in Collections: | Actas de congresos |
Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.