Please use this identifier to cite or link to this item: https://accedacris.ulpgc.es/jspui/handle/10553/151478
Title: Breaking boundaries: enhancing script identification using a learnable MULLER resizer
Authors: Djaffal, Souhaila
Benmabrouk, Yasmina
Djeddi, Chawki
Diaz, Moises 
UNESCO Clasification: 330405 Sistemas de reconocimiento de caracteres
Keywords: Machine-printed text
Hybrid word level script identification
Muller resizer
Conventional resizers
Mdiw-13 dataset, et al
Issue Date: 2025
Journal: Lecture Notes in Computer Science 
Conference: 27th International Conference on Pattern Recognition (ICPR 2024) 
Abstract: Effective script identification is pivotal in document analysis and recognition, especially when dealing with hybrid multiscript documents at fine-grained levels. Recent advancements in deep learning models have revolutionized document analysis and recognition tasks, leveraging script document images. However, these images often require resizing, which can result in data loss. This paper explores the impact of MULLER, a learnable resizer on a hybrid word-level script identification task using the Multi-lingual and Multi-script Documents In the Wild (MDIW-13) dataset. Our approach integrates MULLER resizer with a pre-module that employs k-means clustering to determine the optimal target size. When jointly trained with MobileNet, it achieves an impressive average accuracy of 98.16%. In summary, our findings underscore the potential of the MULLER resizer to outperform conventional resizers, thereby evaluating script classification performance.
URI: https://accedacris.ulpgc.es/jspui/handle/10553/151478
ISSN: 0302-9743
DOI: 10.1007/978-3-031-78119-3_16
Source: Pattern Recognition. ICPR 2024. Lecture Notes in Computer Science [ISSN 0302-9743],v. 15331, p. 222-236, (2025)
Appears in Collections:Actas de congresos
Show full item record

Google ScholarTM

Check

Altmetric


Share



Export metadata



Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.