Xavier Anguera, Ph.D. - List of publications
2008
- "TV advertisements detection and clustering based on acoustic information", D. Conejero and X. Anguera, in proc. International Conference on Computational Intelligence for Modelling, Control and Automation - CIMCA08, Viena, Austria, December 2008 pdf
- "MAMI: Multimodal Annotations on a Camera Phone", X. Anguera and N. Oliver, in Proc. MobileHCI, Amsterdam, September 2008 pdf
- "Sistema de Indexación Automática de Contenidos Multimedia", U.Urdapilleta, D.Conejero, X. Anguera, D. Cacenabes and F.J. Caminero, in Proc. XVIII Jornadas Telecom I+D, Bilbao, Spain pdf
- "Multimodal and Mobile Personal Image Retrieval: A User Study", X. Anguera, N.Oliver and M. Cherubini, in Proc. Workshop on Mobile Information Retrieval, MOBIR'08, Singapore pdf
- "Multimodal Photo Annotation and Retrieval on a Mobile Phone", X. Anguera, J.Xu, N. Oliver, in Proc. ACM Intl. Conference on Multimedia Information Retrieval, Vancouver, Canada. 2008 pdf
2007
- "Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information", Jose M. Pardo, Xavier Anguera and Check Wooters, IEEE Transactions on Computers, September 2007, volume 56, number 9, pp. 1189-1224. pdf
- "Acoustic beamforming for speaker diarization of meetings", Xavier Anguera, Chuck Wooters and Javier Hernando, IEEE Transactions on Audio, Speech and Language Processing, September 2007, volume 15, number 7, pp.2011-2023. pdf
- "Model Complexity Selection and Cross-Validation EM Training for Robust Speaker Diarization", Xavier Anguera, Takahiro Shinozaki, Chuck Wooters and Javier Hernando, ICASSP, Hawaii, USA, April 2007. pdf
- "Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings", Xavier Anguera, Chuck Wooters, J.M Pardo and Javier Hernando, ICASSP, Hawaii, USA, April 2007. pdf
- "Speaker Diarization for Conference Room: The UPC RT07s Evaluation System", Jordi Luque, Xavier Anguera, Andrey Temko, and Javier Hernando, RT07s Rich Transcription evaluation workshop, Washington, May 2007 pdf
- "The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System", Andreas Stolcke, Xavier Anguera, Kofi Boakye, Ozgur Çetin, Adam Janin, Mathew Magimai-Doss, Chuck Wooters, and Jing Zheng, RT07s Rich Transcriptionevaluation workshop, Washington, May 2007 pdf
2006
- "Purity Algorithms for Speaker Diarization of Meetings Data", Xavier Anguera, Chuck Wooters and Javier Hernando. ICASSP 2006, Toulouse, France, May 2006. pdf
- "Speaker Diarization for Multi-Microphone Meetings Using only Between-Channel Differences", Jose M. Pardo, Xavier Anguera, Chuck Wooters, In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third InternationalWorkshop (MLMI 2006), Lecture Notes in Computer Science. Springer pdf
- "Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization", Xavier Anguera, Chuck Wooters, Javier Hernando, In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer pdf
- "Robust Speaker Diarization for Meetings: ICSI RT06s Meetings Evaluation System", Xavier Anguera, Chuck Wooters and Jose M. Pardo, In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer pdf
- "Frame Purification for Cluster Comparison in Speaker Diarization", Xavier Anguera, Chuck Wooters, Javier Hernando, MMUA 2006, Toulouse, France, May 2006. pdf
- "Hybrid Speech/Non-Speech Detector Applied to Speaker Diarization of Meetings", Xavier Anguera, Mateu Aguilo, Chuck Wooters, Climent Nadeu and Javier Hernando, Speaker Odyssey 2006, San Juan de Puerto Rico, USA, June 2006. pdf
- "Friends and Enemies: A Novel Initialization for Speaker Diarization", Xavier Anguera, Chuck Wooters and Javier Hernando, ICSLP’06, Pittsburgh, Pensilvania, USA, September 2006. pdf
- "Robust Speaker Diarization for Meetings: ICSI RT06s evaluation system", Xavier Anguera, Chuck Wooters and Jose M. Pardo, ICSLP’06, Pittsburgh, Pensilvania, USA, September 2006. pdf
- "The ICSI-SRI Spring 2006 Meeting Recognition System", A. Janin, A. Stolcke, X. Anguera, K. Boakye, O. Cetin, J. Frankel, and J. Zheng, In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer pdf
- "Multi-Stream Speaker Diarization Systems for the Meetings Domain", Ascension Gallardo, Xavier Anguera and Chuck Wooters, ICSLP’06, Pittsburgh, Pensilvania, USA, September 2006. pdf
- "Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time Differences", Jose M. Pardo, Xavier Anguera and Chuck Wooters, ICSLP’06, Pittsburgh, Pensilvania, USA, September 2006. pdf
2005
- "XBIC: Real-Time Cross Probabilities Measure for Speaker Segmentation”, Xavier Anguera. International Computer Science Institute Technical Report TR-05-008. pdf
- "Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System", Xavier Anguera, Chuck Wooters, Barbara Peskin and Mateu Aguilo. In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2005), Lecture Notes in Computer Science. Springer pdf
- "Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System”, Andreas Stolcke, Xavier Anguera, Kofy Boakye, Ozgur Cetin, Frantisek Grezl, Adam Janin, Arindam Mandal, Barbara Peskin, Chuck Wooters and Jing Zheng. In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2005), Lecture Notes in Computer Science. Springer pdf
- "Speaker Diarization for Multi-Party Meetings Using Acoustic Fusion", Xavier Anguera, Chuck Wooters and Javier Hernando. Automatic Speech Recognition and Understanding (ASRU). Puerto Rico, November 2005. pdf
- "PETRA: Advanced Oral Interfaces for Unified Messaging Applications", David Hernando, Javier Hernando and Xavier Anguera. Buran magazine, IEEE Barcelona student branch. Number 22, September 2005.
2004
- "Evolutive Speaker Segmentation using a Repository System", Xavier Anguera and Javier Hernando. ICSLP, Korea 2004. pdf
- "Segmentació de locutor per a la indexació automàtica de bases de dades multimèdia en català", Xavier Anguera, Mireia Farrús , Javier Hernando and Alberto Abad. II Congrés d’enginyeria en llengua catalana, Andorra 2004. pdf
- "Els sistemes de reconeixement de veu i traducció automàtica en català: present i futur", Mireia Farrús, Jan Anguita, Xavier Anguera, Josep M. Crego, Adrià de Gispert, Javier Hernando, Climent Nadeu. II Congrés d’enginyeria en llengua catalana, Andorra 2004.
- "XBIC: Nueva Medida para Segmentación de Locutor hacia el Indexado Automático de la Señal de Voz", Xavier Anguera, Javier Hernando and Jan Anguita. III Jornadas en Tecnología del Habla, Valencia, 17-10 Nov 2004.
- "Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization System", Chuck Wooters, James Fung, Barbara Peskin and Xavier Anguera. EARS Program RT-04 Workshop, nov 7-10 2004. pdf