Please use this identifier to cite or link to this item:
https://accedacris.ulpgc.es/jspui/handle/10553/151478
| Title: | Breaking boundaries: enhancing script identification using a learnable MULLER resizer | Authors: | Djaffal, Souhaila Benmabrouk, Yasmina Djeddi, Chawki Diaz, Moises |
UNESCO Clasification: | 330405 Sistemas de reconocimiento de caracteres | Keywords: | Machine-printed text Hybrid word level script identification Muller resizer Conventional resizers Mdiw-13 dataset, et al |
Issue Date: | 2025 | Journal: | Lecture Notes in Computer Science | Conference: | 27th International Conference on Pattern Recognition (ICPR 2024) | Abstract: | Effective script identification is pivotal in document analysis and recognition, especially when dealing with hybrid multiscript documents at fine-grained levels. Recent advancements in deep learning models have revolutionized document analysis and recognition tasks, leveraging script document images. However, these images often require resizing, which can result in data loss. This paper explores the impact of MULLER, a learnable resizer on a hybrid word-level script identification task using the Multi-lingual and Multi-script Documents In the Wild (MDIW-13) dataset. Our approach integrates MULLER resizer with a pre-module that employs k-means clustering to determine the optimal target size. When jointly trained with MobileNet, it achieves an impressive average accuracy of 98.16%. In summary, our findings underscore the potential of the MULLER resizer to outperform conventional resizers, thereby evaluating script classification performance. | URI: | https://accedacris.ulpgc.es/jspui/handle/10553/151478 | ISSN: | 0302-9743 | DOI: | 10.1007/978-3-031-78119-3_16 | Source: | Pattern Recognition. ICPR 2024. Lecture Notes in Computer Science [ISSN 0302-9743],v. 15331, p. 222-236, (2025) |
| Appears in Collections: | Actas de congresos |
Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.