Please use this identifier to cite or link to this item: http://hdl.handle.net/10553/41437
Title: Who is really talking? a visual-based speaker diarization strategy
Authors: Marín-Reyes, Pedro A. 
Lorenzo-Navarro, Javier 
Castrillón-Santana, Modesto 
Sánchez-Nielsen, Elena
UNESCO Clasification: 120304 Inteligencia artificial
Keywords: Visual diarization strategies
Local descriptors
Histogram distances
F-reid
Issue Date: 2018
Publisher: Springer 
Journal: Lecture Notes in Computer Science 
Conference: 16th International Conference on Computer Aided Systems Theory, (EUROCAST 2017) 
Abstract: The speaker activity at the Canary Islands Parliament is recorded, and later manually annotated. This task can be modelled as a diarization problem, that is a way to automatically annotated who and when is speaking. In this paper, we propose the use of the visual cue to solve the diarization task. To perform this approach, it is mandatory to detect individuals, determine the one speaking, and extract features for matching. In order to test the performance of our proposal, we evaluate four different strategies based on the visual shot features.
URI: http://hdl.handle.net/10553/41437
ISBN: 978-3-319-74726-2
ISSN: 0302-9743
DOI: 10.1007/978-3-319-74727-9_38
Source: Computer Aided Systems Theory – EUROCAST 2017. EUROCAST 2017. Lecture Notes in Computer Science, v. 10672 LNCS, p. 322-329
Appears in Collections:Capítulo de libro
Thumbnail
pdf
Adobe PDF (1,66 MB)
Show full item record

SCOPUSTM   
Citations

2
checked on Apr 21, 2024

Page view(s)

157
checked on Jan 27, 2024

Download(s)

64
checked on Jan 27, 2024

Google ScholarTM

Check

Altmetric


Share



Export metadata



Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.