Please use this identifier to cite or link to this item: https://accedacris.ulpgc.es/jspui/handle/10553/151480
Title: Addressing class imbalance in handwritten script identification using sampling techniques
Authors: Djaffal, Souhaila
Benmabrouk, Yasmina
Djeddi, Chawki
Diaz, Moises 
UNESCO Clasification: 330405 Sistemas de reconocimiento de caracteres
Keywords: Handwritten page level script identification
Hybrid-sampling
Mdiw-13 dataset
Over-sampling
Under-sampling
Issue Date: 2025
Journal: Lecture Notes in Networks and Systems 
Conference: 6th Mediterranean Conference on Pattern Recognition and Artificial Intelligence (MedPRAI 2024) 
Abstract: In real-world datasets, class imbalance is common, where certain classes are underrepresented, leading to skewed distributions that negatively impact classifier performance, particularly for minority classes. This issue is prevalent in script identification tasks, where underrepresented scripts lead to biased models that struggle to predict minority classes accurately. To address this problem, we explored the effectiveness of various resampling techniques, grouped into under-sampling, over-sampling, and hybrid-sampling methods. Our study evaluates these techniques by testing multiple classifiers on a subset of the MDIW-13 dataset, focusing on handwritten page level script identification. The results demonstrate significant improvements in various performance metrics when applying resampling techniques, emphasizing the crucial role of hybrid sampling in mitigating class imbalance in script identification tasks.
URI: https://accedacris.ulpgc.es/jspui/handle/10553/151480
ISBN: 9783031908927
ISSN: 2367-3370
DOI: 10.1007/978-3-031-90893-4_30
Source: Lecture Notes in Networks and Systems [ISSN 2367-3370], v. 1393 LNNS, p. 455-470, (Enero 2026)
Appears in Collections:Actas de congresos
Show full item record

Google ScholarTM

Check

Altmetric


Share



Export metadata



Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.