Xavier Anguera, Ph.D. - List of publications
2010
- "MuViSync: Realtime Music Video Alignment",
Robert Macrae, Xavier Anguera and Nuria Oliver, to appear in Proc. ICME 2010 pdf
- "Enriching Music Mood Annotation by Semantic Association Reasoning",
Jun Wang, Xavier Anguera, Xiaoou Chen and Deshun Yang, to appear in Proc. AdMiRe Workshop, in ICME 2010 pdf
- "Partial Sequence Matching Using an Unbounded Dynamic Time Warping Algorithm",
Xavier Anguera, Robert Macrae and Nuria Oliver, to appear in Proc. ICASSP 2010 pdf
- "Unrestricted Voice Annotations and Search of Personal Photographs in a Mobile Phone",
Xavier Anguera, Mauro Cherubini and Nuria Oliver, to appear in Proc. Of Spoken Query 2010 Workshop on voice search, in ICASSP 2010 pdf
2009
- "Telefonica Research Content-Based Copy Detection TRECVID Submission",
Xavier Anguera, Pere Obrador, Tomasz Adamek, David Marimon and Nuria Oliver, NIST Trecvid 2009 Workshop notebook paper pdf
- "Revisiting the use of GMM-SVM for speaker verification",
Xavier Anguera, in Proc. Interspeech 2009 pdf
- "Multimodal video copy detection of social media",
Xavier Anguera, Pere Obrador and Nuria Oliver, in Proc. first SIGMM Workshop on Social Media (WSM2009) at ACM MM09 pdf
- "The role of tags and image aesthetics in social image search",
Pere Obrador, Xavier Anguera, Rodrigo de Oliveira and Nuria Oliver, in Proc. first SIGMM Workshop on Social Media (WSM2009) at ACM MM09 pdf
- "Text versus Speech: A Comparison of Tagging Input Modalities for Camera Phones",
M. Cherubini, X. Anguera, N. Oliver and R. de Oliveira, in Proc. MobileHCI, Bonn, Germany, September 2009, (best paper award nominee) pdf
- "Audio-Based Soccer Game Summarization", Helenca Duxans,
Xavier Anguera and David Conejero, in Proc. IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB09) pdf
- "Audio-Based Automatic Management of Audio Commercials",
H. Duxans, D. Conejero and X. Anguera, in Proc. ICASSP 2009, Taipei, Taiwan. April 2009 pdf
2008
- "TV advertisements detection and clustering based on acoustic
information",
D. Conejero and X. Anguera, in proc. International
Conference on Computational Intelligence for Modelling, Control and
Automation - CIMCA08, Viena, Austria, December 2008 pdf
- "MAMI: Multimodal Annotations on a Camera Phone",
X. Anguera
and N. Oliver, in Proc. MobileHCI, Amsterdam, September 2008 pdf
- "Sistema de Indexación Automática de Contenidos Multimedia",
U.Urdapilleta, D.Conejero, X. Anguera, D. Cacenabes and F.J. Caminero,
in Proc. XVIII Jornadas Telecom I+D, Bilbao, Spain pdf
- "Multimodal and Mobile Personal Image Retrieval: A User Study",
X.
Anguera, N.Oliver and M. Cherubini, in Proc. Workshop on Mobile
Information Retrieval, MOBIR'08, Singapore pdf
- "Multimodal Photo Annotation and Retrieval on a Mobile Phone",
X.
Anguera, J.Xu, N. Oliver, in Proc. ACM Intl. Conference on Multimedia
Information Retrieval, Vancouver, Canada. 2008 pdf
2007
- "Speaker Diarization For Multiple-Distant-Microphone Meetings Using
Several Sources of Information",
Jose M. Pardo, Xavier Anguera and
Check Wooters, IEEE Transactions on Computers, September 2007, volume
56, number 9, pp. 1189-1224. pdf
- "Acoustic beamforming for speaker diarization of meetings",
Xavier
Anguera, Chuck Wooters and Javier Hernando, IEEE Transactions on Audio,
Speech and Language Processing, September 2007, volume 15, number 7,
pp.2011-2023. pdf
- "Model Complexity Selection and Cross-Validation EM Training for
Robust Speaker Diarization",
Xavier Anguera, Takahiro Shinozaki, Chuck
Wooters and Javier Hernando, ICASSP, Hawaii, USA, April 2007. pdf
- "Automatic Weighting for the Combination of TDOA and Acoustic
Features in Speaker Diarization for Meetings",
Xavier Anguera, Chuck
Wooters, J.M Pardo and Javier Hernando, ICASSP, Hawaii, USA, April 2007.
pdf
- "Speaker Diarization for Conference Room: The UPC RT07s Evaluation
System",
Jordi Luque, Xavier Anguera, Andrey Temko, and Javier Hernando,
RT07s Rich Transcription evaluation workshop, Washington, May 2007 pdf
- "The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System",
Andreas Stolcke, Xavier Anguera, Kofi Boakye, Ozgur Çetin, Adam Janin,
Mathew Magimai-Doss, Chuck Wooters, and Jing Zheng, RT07s Rich
Transcriptionevaluation workshop, Washington, May 2007 pdf
2006
- "Purity Algorithms for Speaker Diarization of Meetings Data",
Xavier Anguera, Chuck Wooters and Javier Hernando. ICASSP 2006,
Toulouse, France, May 2006. pdf
- "Speaker Diarization for Multi-Microphone Meetings Using only
Between-Channel Differences",
Jose M. Pardo, Xavier Anguera, Chuck
Wooters, In S. Renals and S. Bengio, editors, Machine Learning for
Multimodal Interaction: Third InternationalWorkshop (MLMI 2006), Lecture
Notes in Computer Science. Springer pdf
- "Automatic Cluster Complexity and Quantity Selection: Towards
Robust Speaker Diarization",
Xavier Anguera, Chuck Wooters, Javier
Hernando, In S. Renals and S. Bengio, editors, Machine Learning for
Multimodal Interaction: Third International Workshop (MLMI 2006),
Lecture Notes in Computer Science. Springer pdf
- "Robust Speaker Diarization for Meetings: ICSI RT06s Meetings
Evaluation System",
Xavier Anguera, Chuck Wooters and Jose M. Pardo, In
S. Renals and S. Bengio, editors, Machine Learning for Multimodal
Interaction: Third International Workshop (MLMI 2006), Lecture Notes in
Computer Science. Springer pdf
- "Frame Purification for Cluster Comparison in Speaker Diarization",
Xavier Anguera, Chuck Wooters, Javier Hernando, MMUA 2006, Toulouse,
France, May 2006. pdf
- "Hybrid Speech/Non-Speech Detector Applied to Speaker Diarization
of Meetings",
Xavier Anguera, Mateu Aguilo, Chuck Wooters, Climent Nadeu
and Javier Hernando, Speaker Odyssey 2006, San Juan de Puerto Rico,
USA, June 2006. pdf
- "Friends and Enemies: A Novel Initialization for Speaker
Diarization",
Xavier Anguera, Chuck Wooters and Javier Hernando,
ICSLP06, Pittsburgh, Pensilvania, USA, September 2006. pdf
- "Robust Speaker Diarization for Meetings: ICSI RT06s evaluation
system",
Xavier Anguera, Chuck Wooters and Jose M. Pardo, ICSLP06,
Pittsburgh, Pensilvania, USA, September 2006. pdf
- "The ICSI-SRI Spring 2006 Meeting Recognition System",
A. Janin, A.
Stolcke, X. Anguera, K. Boakye, O. Cetin, J. Frankel, and J. Zheng, In
S. Renals and S. Bengio, editors, Machine Learning for Multimodal
Interaction: Third International Workshop (MLMI 2006), Lecture Notes in
Computer Science. Springer pdf
- "Multi-Stream Speaker Diarization Systems for the Meetings Domain",
Ascension Gallardo, Xavier Anguera and Chuck Wooters, ICSLP06,
Pittsburgh, Pensilvania, USA, September 2006. pdf
- "Speaker Diarization for Multiple Distant Microphone Meetings:
Mixing Acoustic Features And Inter-Channel Time Differences",
Jose M.
Pardo, Xavier Anguera and Chuck Wooters, ICSLP06, Pittsburgh,
Pensilvania, USA, September 2006. pdf
2005
- "XBIC: Real-Time Cross Probabilities Measure for Speaker
Segmentation",
Xavier Anguera. International Computer Science Institute
Technical Report TR-05-008. pdf
- "Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005
Diarization System",
Xavier Anguera, Chuck Wooters, Barbara Peskin and
Mateu Aguilo. In S. Renals and S. Bengio, editors, Machine Learning for
Multimodal Interaction: Third International Workshop (MLMI 2005),
Lecture Notes in Computer Science. Springer pdf
- "Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005
Speech-to-Text Evaluation System",
Andreas Stolcke, Xavier Anguera, Kofy
Boakye, Ozgur Cetin, Frantisek Grezl, Adam Janin, Arindam Mandal,
Barbara Peskin, Chuck Wooters and Jing Zheng. In S. Renals and S.
Bengio, editors, Machine Learning for Multimodal Interaction: Third
International Workshop (MLMI 2005), Lecture Notes in Computer Science.
Springer pdf
- "Speaker Diarization for Multi-Party Meetings Using Acoustic
Fusion",
Xavier Anguera, Chuck Wooters and Javier Hernando. Automatic
Speech Recognition and Understanding (ASRU). Puerto Rico, November 2005.
pdf
- "PETRA: Advanced Oral Interfaces for Unified Messaging
Applications",
David Hernando, Javier Hernando and Xavier Anguera. Buran
magazine, IEEE Barcelona student branch. Number 22, September 2005.
2004
- "Evolutive Speaker Segmentation using a Repository System",
Xavier
Anguera and Javier Hernando. ICSLP, Korea 2004. pdf
- "Segmentació de locutor per a la indexació automàtica de bases de
dades multimèdia en català",
Xavier Anguera, Mireia Farrús , Javier
Hernando and Alberto Abad. II Congrés d'enginyeria en llengua catalana,
Andorra 2004. pdf
- "Els sistemes de reconeixement de veu i traducció automàtica en
català: present i futur",
Mireia Farrús, Jan Anguita, Xavier Anguera,
Josep M. Crego, Adrià de Gispert, Javier Hernando, Climent Nadeu. II
Congrés d’enginyeria en llengua catalana, Andorra 2004.
- "XBIC: Nueva Medida para Segmentación de Locutor hacia el Indexado
Automático de la Señal de Voz",
Xavier Anguera, Javier Hernando and Jan
Anguita. III Jornadas en Tecnología del Habla, Valencia, 17-10 Nov 2004.
- "Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004
Diarization System",
Chuck Wooters, James Fung, Barbara Peskin and
Xavier Anguera. EARS Program RT-04 Workshop, nov 7-10 2004. pdf
The documents distributed in this page are provided as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that the works are offered here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be distributed without the explicit permission of the copyright holder.