Please use this identifier to cite or link to this item:
http://hdl.handle.net/10553/41437
Title: | Who is really talking? a visual-based speaker diarization strategy | Authors: | Marín-Reyes, Pedro A. Lorenzo-Navarro, Javier Castrillón-Santana, Modesto Sánchez-Nielsen, Elena |
UNESCO Clasification: | 120304 Inteligencia artificial | Keywords: | Visual diarization strategies Local descriptors Histogram distances F-reid |
Issue Date: | 2018 | Publisher: | Springer | Journal: | Lecture Notes in Computer Science | Conference: | 16th International Conference on Computer Aided Systems Theory, (EUROCAST 2017) | Abstract: | The speaker activity at the Canary Islands Parliament is recorded, and later manually annotated. This task can be modelled as a diarization problem, that is a way to automatically annotated who and when is speaking. In this paper, we propose the use of the visual cue to solve the diarization task. To perform this approach, it is mandatory to detect individuals, determine the one speaking, and extract features for matching. In order to test the performance of our proposal, we evaluate four different strategies based on the visual shot features. | URI: | http://hdl.handle.net/10553/41437 | ISBN: | 978-3-319-74726-2 | ISSN: | 0302-9743 | DOI: | 10.1007/978-3-319-74727-9_38 | Source: | Computer Aided Systems Theory – EUROCAST 2017. EUROCAST 2017. Lecture Notes in Computer Science, v. 10672 LNCS, p. 322-329 |
Appears in Collections: | Capítulo de libro |
SCOPUSTM
Citations
2
checked on Nov 17, 2024
WEB OF SCIENCETM
Citations
2
checked on Nov 17, 2024
Page view(s)
213
checked on Oct 31, 2024
Download(s)
104
checked on Oct 31, 2024
Google ScholarTM
Check
Altmetric
Share
Export metadata
Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.