Please use this identifier to cite or link to this item:
http://hdl.handle.net/10553/128848
Title: | Information extraction from electricity invoices through named entity recognition with transformers | Authors: | Salgado De La Nuez, Agustín Sánchez Pérez, Javier |
UNESCO Clasification: | 120304 Inteligencia artificial | Keywords: | Machine learning Natural Language Processing (NLP) Named entity recognition Transformer Electricity invoices |
Issue Date: | 2023 | Publisher: | International Frequency Sensor Association (IFSA) Publishing, S. L. | Conference: | 5th International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI 2023) | Abstract: | This article describes a method for automatically extracting information from electricity invoices. This type of documents contains rich information about the billing of each supply point and data about the customer, the contract, or the electricity company. In this work, we train a neural network to classify the input data among eighty-six different labels. We use the IDSEM dataset that contains 75.000 electricity invoices of the Spanish electricity market in PDF format. Each document is converted into text format and the classification is carried out through a named entity recognition (NER) process. The underlying neural network used in the process is a Transformer. The results demonstrate that the proposed method correctly classifies the majority of the labels with high accuracy. Furthermore, the method exhibits robustness in handling invoices with different layouts and contents, highlighting its versatility and reliability. | URI: | http://hdl.handle.net/10553/128848 | ISBN: | 978-84-09-48561-1 | ISSN: | 2938-5350 | DOI: | 10.13140/RG.2.2.27945.77924 | Source: | Advances in Signal Processing and Artificial Intelligence. Proceedings of the 5th International Conference on Advances in Signal Processing and Artificial Intelligence (ASPAI' 2023), p. 140-145. 7-9 June 2023, Tenerife |
Appears in Collections: | Actas de congresos |
Page view(s)
108
checked on Oct 5, 2024
Download(s)
198
checked on Oct 5, 2024
Google ScholarTM
Check
Altmetric
Share
Export metadata
Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.