Xavier Anguera, Ph.D. - List of publications
2014
- "Query by Example Search on Speech at Mediaeval 2014",
Xavier Anguera, Luis-Javier Rodriguez-Fuentes, Igor Szöke, Andi Buso and Florian Metze, in proc. Mediaeval 2014. pdf
- "Global Speaker Clustering towards Optimal Stopping Criterion in Binary Key Speaker Diarization",
Hector Delgado, Xavier Anguera, Corinne Fredouille and Javier Serrano, book chapter in Advances in Speech and Language Technologies for Iberian Languages. Lecture Notes in Computer Science, Volume 8854, 2014, pp. 59-68. Presented at Iberspeech 2014, Las Palmas, Spain. pdf
- "Phoneme-Lattice to Phoneme-Sequence matching algorithm based on Dynamic Programming",
Ciro Gracia, Xavier Anguera, Jordi Luque and Ittai Artzi, book chapter in Advances in Speech and Language Technologies for Iberian Languages. Lecture Notes in Computer Science, Volume 8854, 2014, pp. 99-108. Presented at Iberspeech 2014, Las Palmas, Spain. pdf
- "Flexible Stand-alone Keyword RecognitionApplication using Dynamic Time Warping",
Miquel Ferrarons, Xavier Anguera and Jordi Luque, book chapter in Advances in Speech and Language Technologies for Iberian Languages. Lecture Notes in Computer Science, Volume 8854, 2014, pp. 158-167. Presented at Iberspeech 2014, Las Palmas, Spain. pdf
- "Audio-to-text Alignment for speech recognition with very limited resources",
Xavier Anguera, Jordi Luque and Ciro Gracia, in proc. Interspeech 2014, Singapore. pdf
- "Query-by-Example Spoken Term Detection on Multilingual Unconstrained Speech",
Xavier Anguera, Luis Javier Rodriguez-Fuentes, Igor Szoke, Andi Buzo, Florian Metze and Mikel Penagarikano, in proc. Interspeech 2014, Singapore. pdf
- "On the Modeling of Natural Vocal Emotion Expressions Through Binary Key",
Jordi Luque and Xavier Anguera, in proc. Eusipco 2014, Lisboa, Portugal. pdf
- "Combining Temporal and Spectral Information for Query-By-Example Spoken Term Detection",
Ciro Gracia, Xavier Anguera and Xavier Binefa, in proc. Eusipco 2014, Lisboa, Portugal. pdf
- "Query-by-Example Spoken Term Detection Evaluation o Low-Resource Languages",
Xavier Anguera, Luis J. Rodriguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, Mikel Penagarikano, in Proc. SLTU 2014, Saint Petersburg, Russia. pdf
- "Sentiment Retrieval on Web Reviews Using Spontaneous Natural Speech",
Jose Costa Pereira, Jordi Luque and Xavier Anguera, in Proc. ICASSP 2014, Florence, Italy. pdf
- "Inferring Social Relationships in a Phone Call from a Single Party’s Speech",
Sree Harsha Yella, Xavier Anguera and Jordi Luque, in Proc. ICASSP 2014, Florence, Italy. pdf
- "Language independent search in MediaEval’s Spoken Web Search task",
Florian Metze, Xavier Anguera, Etienne Barnard, Marelie Davel and Guillaume Gravier, Elsevier Journal on Computer, Speech and language, January 2014. pdf
2013
- "Query-by-Example Spoken Term Detection ALBAYZIN 2012 evaluation: overview, systems, results, and discussion",
J. Tejedor, D.T. Toledano, Xavier Anguera, A. Varona, L.F. Hurtado, A. Miguel and J. Colás, EURASIP Journal on Audio, Speech, and Music Processing 2013, 2013:23, September 2013. pdf
- "The Telefonica Research Spoken Web Search System for Mediaeval 2013",
Xavier Anguera, Miroslav Skázel, Volker Vorwerk and Jordi Luque, in Proc. Mediaeval 2013 evaluation Workshop, Barcelona, Spain. pdf
- "The Spoken Web Search Task",
Xavier Anguera, Florian Metze, Andi Buso, Ifor Szöke and Luis-Javier Rodriguez Fuentes, in Proc. Mediaeval 2013 evaluation Workshop, Barcelona, Spain. pdf
- "A Riemannian stopping criterion for unsupervised phonetic segmentation",
Ciro Gracia, Xavier Anguera and Xavier Binefa, in Proc. ICMLA 2013, Florida, USA. pdf
- "Two-level clustering towards unsupervised discovery of acoustic classes",
Ciro Gracia, Xavier Anguera and Xavier Binefa, in Proc. ICMLA 2013, Florida, USA. pdf
- "Activating the Crowd: Exploiting User-Item Reciprocity for Recommendation",
Martha Larson, Alan Said, Yue Shi, Paolo Cremonesi, Domonkos Tikk, Alexandros Karatzoglou, Linas Baltrunas, Jozef Geurts, Xavier Anguera and Frank Hopfgartner, in Proc. CrowdRec: Crowdsourcing and Human Computation for Recommender Systems Workshop, ACM RecSys 2013. pdf
- "Speed Improvements to Information Retrieval-Based Dynamic Time Warping Using Hierarchical K-means Clustering ",
Gautam Mantena and Xavier Anguera, in Proc. ICASSP 2013, Vancouver, Canada. pdf
- "Perceptually Inspired Features for Speaker Likability Classification ",
Sira Gonzalez and Xavier Anguera, in Proc. ICASSP 2013, Vancouver, Canada. pdf
- "The Spoken Web Search Task at Mediaeval 2012 ",
Florian Metze, Xavier Anguera, Etienne Barnard, Marelie Davel and Guillaume Gravier, in Proc. ICASSP 2013, Vancouver, Canada. pdf
- "Memory Efficient Subsequence DTW for Query-by-Example Spoken Term Detection ",
Xavier Anguera and Miquel Ferrarons, in Proc. ICME 2013, San Jose, CA, USA. pdf
- "Information Retrieval-based Dynamic Time Warping ",
Xavier Anguera, in Proc. Interspeech 2013, Lyon, France. pdf
2012
- "Multimodal Video Copy Detection using local features ",
Xavier Anguera and Tomasz Adamek, in IEEE COMSOC MMTC E-Letter. pdf
- "Telefonica Research system for the Query-by-example task at Albayzin 2012 ",
Xavier Anguera, in Proc. Iberspeech 2012, Madrid, Spain. pdf
- "Telefonica Research System for the Spoken Web Search task at Mediaeval 2012 ",
Xavier Anguera, in Proc. Mediaeval 2012 evaluation Workshop, Pisa, Italy. pdf
- "The Spoken Web Search Task ",
Florian Metze, Etienne Barnard, Marelie Davel, Charl van Heerden, Xavier Anguera, Guillaume Gravier and Nitendra Rajput, in Proc. Mediaeval 2012 evaluation Workshop, Pisa, Italy. pdf
- "Emotions recognition using binary fingerprints ",
Xavier Anguera, Esperança Movellan and Miquel Ferrarons, in Proc. Iberspeech 2012, Madrid, Spain. pdf
- "The Spoken Web search task at Mediaeval 2011 ",
Florian Metze, Nitendra Rajput, Xavier Anguera, Marelie Davel, Guillaume Gravier, Charl van Heerden, Gautam V. Mantena, Armando Muscariello, Kishore Prahallad, Igor Szoke, and Javier Tejedor, in Proc. ICASSP 2012, Kyoto, Japan. pdf
- "Speaker Independent Discriminant Feature Extraction for Acoustic Pattern-Matching ",
Xavier Anguera, in Proc. ICASSP 2012, Kyoto, Japan. pdf
- "MASK: Robust Local Features for Audio Fingerprinting ",
Xavier Anguera, Antonio Garzon and Tomasz Adamek, in Proc. ICME 2012, Melbourne, Australia. (BEST PAPER AWARD ICME 2012)pdf
2011
- "Multimodal fusion for video copy detection ",
Xavier Anguera, Juan Manuel Barrios, Tomasz Adamek and Nuria Oliver, in Proc. ACM Multimedia 2011. pdf
- "Speaker modeling using local binary decisions ",
Jean-Francois Bonastre, Xavier Anguera, Gabriel H. Sierra and Pierre-Michel Bousquet, in Proc. Interspeech 2011. pdf
- "Telefonica Research at TRECVID 2011 Content-Based Copy Detection ",
Xavier Anguera, Tomasz Adamek, Daru Xu and Juan Manuel Barrios, NIST-TRECVID workshop 2011. pdf
- "Combining Features at Search Time: PRISMA at Video Copy Detection Task ",
Juan Manuel Barrios, Benjamin Bustos and Xavier Anguera, NIST-TRECVID workshop 2011. pdf
- "Telefonica System for the Spoken Web Search Task at Mediaeval 2011 ",
Xavier Anguera, MediaEval Workshop, November 2011, Pisa, Italy. pdf
- "Speaker Diarization: a review of recent research",
Xavier Anguera, Simon Bozonnet, Nicholas Evans, Corinne Fredouille, Gerald Friedland, and Oriol Vinyals, accepted for publication in Transactions on Audio, Speech and Language Processing (TASLP), special issue on New Frontiers in Rich Transcription. pdf
- "The ICSI RT-09 Speaker Diarization System",
Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox and Oriol Vinyals, accepted for publication in Transactions on Audio, Speech and Language Processing (TASLP), special issue on New Frontiers in Rich Transcription, July 2011. pdf
- "Real-Time synchronization of multimedia streams in a mobile device",
Robert Macrae, Joachim Neumann, Xavier Anguera, Nuria Oliver and Simon Dixon, to appear in Proc. ADMIRE Workshop within ICME 2011, Barcelona, Spain. pdf
- "Automatic Synchronization of Electronic and Audio Books via TTS Alignment and Silence Filtering",
Xavier Anguera, Néstor Pérez, Andreu Urruela and Nuria Oliver, in Proc. Hot Topics in Multimedia within ICME 2011, Barcelona, Spain. pdf
- "Spoken Wordcloud: clustering recurrent patterns in speech",
Remy Flamary, Xavier Anguera and Nuria Oliver, in Proc. CBMI 2011, Madrid, Spain. pdf
- "Fast Speaker Diarization Based on Binary Keys",
Xavier Anguera and Jean-Francois Bonastre, in Proc. ICASSP 2011, Prague, Check Republic. pdf
- "Discriminant Binary Data Representation for Speaker Recognition",
Jean-Francois Bonastre, Xavier Anguera Miro, Pierre-Michel Bousquet, Driss Matrouf, in Proc. ICASSP 2011, Prague, Check Republic. pdf
- "Closed-Form Expressions vs. BIC: a Comparison for Speaker Clustering",
Themos Stafylakis, Xavier Anguera, Vassilis Katsouros, George Carayannis, in Proc. ICASSP 2011, Prague, Check Republic. pdf
2010
- "Telefonica Research at TRECVID 2010 Content-Based Copy Detection",
Ehsan Younessian, Xavier Anguera, Tomasz Adamek, Nuria Oliver and David Marimon, NIST Trecvid Workshop notebook paper.pdf
- "Novel binary key representation for biometric speaker recognition",
Xavier Anguera and Jean-François Bonastre, in Proc. Interspeech 2010, Makuhari, Japan.pdf
- "System output combination for improved speaker diarization",
Simon Bozonet, Nicholas Evans, Xavier Anguera, Oriol Vinyals, Gerald Friedland and Corinne Fredouille, in Proc. Interspeech 2010, Makuhari, Japan.
- "Improvements to the equal-parameter BIC for Speaker Diarization",
Themos Stafylakis, Xavier Anguera, in Proc. Interspeech 2010, Makuhari, Japan.pdf
- "MuViSync: Realtime Music Video Alignment",
Robert Macrae, Xavier Anguera and Nuria Oliver, in Proc. ICME 2010 pdf
- "Enriching Music Mood Annotation by Semantic Association Reasoning",
Jun Wang, Xavier Anguera, Xiaoou Chen and Deshun Yang, in Proc. AdMiRe Workshop, in ICME 2010 pdf
- "Partial Sequence Matching Using an Unbounded Dynamic Time Warping Algorithm",
Xavier Anguera, Robert Macrae and Nuria Oliver, in Proc. ICASSP 2010 pdf
- "Unrestricted Voice Annotations and Search of Personal Photographs in a Mobile Phone",
Xavier Anguera, Mauro Cherubini and Nuria Oliver, in Proc. Of Spoken Query 2010 Workshop on voice search, in ICASSP 2010 pdf
2009
- "Telefonica Research Content-Based Copy Detection TRECVID Submission",
Xavier Anguera, Pere Obrador, Tomasz Adamek, David Marimon and Nuria Oliver, NIST Trecvid 2009 Workshop notebook paper pdf
- "MiniVectors: an Improved GMM-SVM Approach for Speaker Verification",
Xavier Anguera, in Proc. Interspeech 2009 pdf
- "Multimodal video copy detection of social media",
Xavier Anguera, Pere Obrador and Nuria Oliver, in Proc. first SIGMM Workshop on Social Media (WSM2009) at ACM MM09 pdf
- "The role of tags and image aesthetics in social image search",
Pere Obrador, Xavier Anguera, Rodrigo de Oliveira and Nuria Oliver, in Proc. first SIGMM Workshop on Social Media (WSM2009) at ACM MM09 pdf
- "Text versus Speech: A Comparison of Tagging Input Modalities for Camera Phones",
M. Cherubini, X. Anguera, N. Oliver and R. de Oliveira, in Proc. MobileHCI, Bonn, Germany, September 2009, (best paper award nominee) pdf
- "Audio-Based Soccer Game Summarization", Helenca Duxans,
Xavier Anguera and David Conejero, in Proc. IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB09) pdf
- "Audio-Based Automatic Management of Audio Commercials",
H. Duxans, D. Conejero and X. Anguera, in Proc. ICASSP 2009, Taipei, Taiwan. April 2009 pdf
2008
- "TV advertisements detection and clustering based on acoustic information",
D. Conejero and X. Anguera, in proc. International Conference on Computational Intelligence for Modelling, Control and Automation - CIMCA08, Viena, Austria, December 2008 pdf
- "MAMI: Multimodal Annotations on a Camera Phone",
X. Anguera and N. Oliver, in Proc. MobileHCI, Amsterdam, September 2008 pdf
- "Sistema de Indexación Automática de Contenidos Multimedia",
U.Urdapilleta, D.Conejero, X. Anguera, D. Cacenabes and F.J. Caminero, in Proc. XVIII Jornadas Telecom I+D, Bilbao, Spain pdf
- "Multimodal and Mobile Personal Image Retrieval: A User Study",
X. Anguera, N.Oliver and M. Cherubini, in Proc. Workshop on Mobile Information Retrieval, MOBIR'08, Singapore pdf
- "Multimodal Photo Annotation and Retrieval on a Mobile Phone",
X. Anguera, J.Xu, N. Oliver, in Proc. ACM Intl. Conference on Multimedia Information Retrieval, Vancouver, Canada. 2008 pdf
2007
- "Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information",
Jose M. Pardo, Xavier Anguera and Chuck Wooters, IEEE Transactions on Computers, September 2007, volume 56, number 9, pp. 1189-1224. pdf
- "Acoustic beamforming for speaker diarization of meetings",
Xavier Anguera, Chuck Wooters and Javier Hernando, IEEE Transactions on Audio, Speech and Language Processing, September 2007, volume 15, number 7, pp.2011-2023. pdf
- "Model Complexity Selection and Cross-Validation EM Training for Robust Speaker Diarization",
Xavier Anguera, Takahiro Shinozaki, Chuck Wooters and Javier Hernando, ICASSP, Hawaii, USA, April 2007. pdf
- "Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings",
Xavier Anguera, Chuck Wooters, J.M Pardo and Javier Hernando, ICASSP, Hawaii, USA, April 2007. pdf
- "Speaker Diarization for Conference Room: The UPC RT07s Evaluation System",
Jordi Luque, Xavier Anguera, Andrey Temko, and Javier Hernando, RT07s Rich Transcription evaluation workshop, Washington, May 2007 pdf
- "The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System",
Andreas Stolcke, Xavier Anguera, Kofi Boakye, Ozgur Çetin, Adam Janin, Mathew Magimai-Doss, Chuck Wooters, and Jing Zheng, RT07s Rich Transcription evaluation workshop, Washington, May 2007 pdf
2006
- "Purity Algorithms for Speaker Diarization of Meetings Data",
Xavier Anguera, Chuck Wooters and Javier Hernando. ICASSP 2006, Toulouse, France, May 2006. pdf
- "Speaker Diarization for Multi-Microphone Meetings Using only Between-Channel Differences",
Jose M. Pardo, Xavier Anguera, Chuck Wooters, In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third InternationalWorkshop (MLMI 2006), Lecture Notes in Computer Science. Springer pdf
- "Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization",
Xavier Anguera, Chuck Wooters, Javier Hernando, In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer pdf
- "Robust Speaker Diarization for Meetings: ICSI RT06s Meetings Evaluation System",
Xavier Anguera, Chuck Wooters and Jose M. Pardo, In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer pdf
- "Frame Purification for Cluster Comparison in Speaker Diarization",
Xavier Anguera, Chuck Wooters, Javier Hernando, MMUA 2006, Toulouse, France, May 2006. pdf
- "Hybrid Speech/Non-Speech Detector Applied to Speaker Diarization of Meetings",
Xavier Anguera, Mateu Aguilo, Chuck Wooters, Climent Nadeu and Javier Hernando, Speaker Odyssey 2006, San Juan de Puerto Rico, USA, June 2006. pdf
- "Friends and Enemies: A Novel Initialization for Speaker Diarization",
Xavier Anguera, Chuck Wooters and Javier Hernando, ICSLP06, Pittsburgh, Pensilvania, USA, September 2006. pdf
- "Robust Speaker Diarization for Meetings: ICSI RT06s evaluation system",
Xavier Anguera, Chuck Wooters and Jose M. Pardo, ICSLP06, Pittsburgh, Pensilvania, USA, September 2006. pdf
- "The ICSI-SRI Spring 2006 Meeting Recognition System",
A. Janin, A. Stolcke, X. Anguera, K. Boakye, O. Cetin, J. Frankel, and J. Zheng, In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer pdf
- "Multi-Stream Speaker Diarization Systems for the Meetings Domain",
Ascension Gallardo, Xavier Anguera and Chuck Wooters, ICSLP06, Pittsburgh, Pensilvania, USA, September 2006. pdf
- "Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time Differences",
Jose M. Pardo, Xavier Anguera and Chuck Wooters, ICSLP06, Pittsburgh, Pensilvania, USA, September 2006. pdf
2005
- "XBIC: Real-Time Cross Probabilities Measure for Speaker Segmentation",
Xavier Anguera. International Computer Science Institute Technical Report TR-05-008. pdf
- "Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System",
Xavier Anguera, Chuck Wooters, Barbara Peskin and Mateu Aguilo. In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2005), Lecture Notes in Computer Science. Springer pdf
- "Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System",
Andreas Stolcke, Xavier Anguera, Kofy Boakye, Ozgur Cetin, Frantisek Grezl, Adam Janin, Arindam Mandal, Barbara Peskin, Chuck Wooters and Jing Zheng. In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2005), Lecture Notes in Computer Science. Springer pdf
- "Speaker Diarization for Multi-Party Meetings Using Acoustic Fusion",
Xavier Anguera, Chuck Wooters and Javier Hernando. Automatic Speech Recognition and Understanding (ASRU). Puerto Rico, November 2005. pdf
- "PETRA: Advanced Oral Interfaces for Unified Messaging Applications",
David Hernando, Javier Hernando and Xavier Anguera. Buran magazine, IEEE Barcelona student branch. Number 22, September 2005.
2004
- "Evolutive Speaker Segmentation using a Repository System",
Xavier Anguera and Javier Hernando. ICSLP, Korea 2004. pdf
- "Segmentació de locutor per a la indexació automàtica de bases de dades multimèdia en català",
Xavier Anguera, Mireia Farrús , Javier Hernando and Alberto Abad. II Congrés d'enginyeria en llengua catalana, Andorra 2004. pdf
- "Els sistemes de reconeixement de veu i traduccio automatica en catala: present i futur",
Mireia Farrus, Jan Anguita, Xavier Anguera, Josep M. Crego, Adria de Gispert, Javier Hernando, Climent Nadeu. II Congres d'enginyeria en llengua catalana, Andorra 2004. pdf
- "XBIC: Nueva Medida para Segmentación de Locutor hacia el Indexado Automático de la Señal de Voz",
Xavier Anguera, Javier Hernando and Jan Anguita. III Jornadas en Tecnología del Habla, Valencia, 17-10 Nov 2004.pdf
- "Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization System",
Chuck Wooters, James Fung, Barbara Peskin and Xavier Anguera. EARS Program RT-04 Workshop, nov 7-10 2004. pdf
Copyright note:
The documents distributed in this page are provided as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that the works are offered here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be distributed without the explicit permission of the copyright holder.