TY - CONF
T1 - On automatic annotation of meeting databases
AU - Gatica-Perez, Daniel
AU - McCowan, Iain
AU - Barnard, Mark
AU - Bengio, Samy
AU - Bourlard, Herve
N1 - Note: Published in: Proceedings 2003 International Conference on Image Processing. Piscataway, U.S. : Institute of Electrical and Electronics Engineers, Inc. Volume III, pp.629-632. ISSN 1522-4880 ISBN 0780377508
Organising Body: Institute of Electrical and Electronics Engineers
PY - 2003/9/14
Y1 - 2003/9/14
N2 - In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition, and information retrieval. We specifically focus on the task of semantic annotation of audio-visual (AV) events, where annotation consists of assigning labels (event names) to the data. In order to develop an automatic annotation system in a principled manner, it is essential to have a well-defined task, a standard corpus and an objective performance measure. In this work we address each of these issues to automatically annotate events based on participant interactions.
AB - In this paper, we discuss meetings as an application domain for multimedia content analysis. Meeting databases are a rich data source suitable for a variety of audio, visual and multi-modal tasks, including speech recognition, people and action recognition, and information retrieval. We specifically focus on the task of semantic annotation of audio-visual (AV) events, where annotation consists of assigning labels (event names) to the data. In order to develop an automatic annotation system in a principled manner, it is essential to have a well-defined task, a standard corpus and an objective performance measure. In this work we address each of these issues to automatically annotate events based on participant interactions.
KW - Computer science and informatics
U2 - 10.1109/ICIP.2003.1247323
DO - 10.1109/ICIP.2003.1247323
M3 - Paper
T2 - IEEE International Conference on Image Processing
Y2 - 14 September 2003 through 17 September 2003
ER -