Welcome to UCSD's Computer Audition Laboratory. Listen up!


Supported by NSF CAREER grant IIS-1054960, "An integrated framework for multimodal music search and discovery".


faculty affiliated faculty graduate students alumni research projects news
publications
2013
J.C. Pereira, E. Coviello, G. Doyle, N. Rasiwasia, G. Lanckriet, R. Levy, N. Vasconcelos, - "On the role of Correlation and Abstraction in Cross-Modal Multimedia Retrieval", To appear on IEEE Transactions on Pattern Analysis and Machine Intelligence.
K. Ellis, E. Coviello, A. Chan and G. Lanckriet, - "A Bag of Systems Representation for Music Auto-tagging". To appear on IEEE Transactions on Audio, Speech and Language Processing.
T. Chuk, A.C.W. Ng, E. Coviello, A.B. Chan and J.H. Hsiao, - "Understanding eye movements in face recognition with hidden Markov model". On CogSci 2013, Berlin, Germany. 31 July - 3 Aug. 2013.
E. Coviello, A. Mumtaz, A.B. Chan and G. Lanckriet, - "That was fast! Speeding up NN search of high dimensional distributions". ICML 2013, Atlanta, Georgia (USA). 16 - 21 June 2013.
D. Lim, B. McFee, G. Lanckriet, - "Robust Structural Metric Learning". ICML 2013, Atlanta, Georgia (USA). 16 - 21 June 2013.
G. Surges - "PyOracle - Analysis of Musical Structure Using Python". PyCon 2013. Python Software Foundation. Santa Clara, CA. 17 March 2013

2012
E. Coviello, A.B. Chan & G. Lanckriet - The variational hierarchical EM algorithm for clustering hidden Markov model. NIPS 2012
S. Dubnov, G. Assayag, - Music Design with Audio Oracle using Information Rate MUME Workshop, AAAI 2012
E. Coviello, Y. Vaizman & G. Lanckriet - Multivariate Autoregressive Mixture Models for Music. ISMIR 2012
J. Urbano, S. Downie, B. McFee & M. Schedl - How significant is statistically significant? The case of audio music similarity and retrieval. ISMIR 2012
B. McFee & G. Lanckriet- Hypergraph models of playlist dialects. ISMIR 2012
E. Coviello, A. Mumtaz, A. Chan & G. Lanckriet - Growing a Bag of Systems Tree for Fast and Accurate Classification. IEEE CVPR 2012
L. Barrington, D. Turnbull, & Lanckriet - Game-Powered Machine Learning. Proceedings of the National Academy of Sciences (2012), Vol. 109, pp. 6411-6416.
B. McFee, T. Bertin-Mahieux, D.P.W. Ellis, & G. Lanckriet- The Million Song Dataset Challenge. 4th International Workshop on Advances in Music Information Research (AdMIRe), 2012. - MSD Challenge on Kaggle
McFee, B., Barrington, L., & Lanckriet, G.R.G. Learning content similarity for music recommendation. In IEEE Transactions on Audio, Speech and Language Processing, 2012.

2011
S.Dubnov - Changes in Musical Culture and Practices as a result of new Multimedia Technologies. Keynote challenge talk at IEEE ISM 2011
K. Ellis, E. Coviello, & G. Lanckriet - Semantic Annotation and Retrieval of Music Using a Bag of Systems Representation. ISMIR 2011
E. Coviello, R. Miotto, & G. Lanckriet - Combining Content-Based Auto-Taggers with Decision-Fusion. ISMIR 2011
B. McFee & G. Lanckriet - The natural language of playlists. ISMIR 2011 (data)
B. McFee & G. Lanckriet - Large-scale music similarity search with spatial trees. ISMIR 2011 (code)
Y. Vaizman & R.Y. Granot & J. Israel & G. Lanckriet - Modeling Dynamic Patterns for Emotional Content in Music. ISMIR 2011
S. Dubnov, G. Assayag and A. Cont - Audio Oracle analysis of Musical Information Rate, Proceedings of IEEE Semantic Computing Conference. . ICSC , September 2011
S. Dubnov, G. Assayag and A. Cont - On the Information Geometry of Audio Streams with Applications to Similarity Computing. IEEE Transactions on Audio, Speech and Language Processing, 19(4), pp. 837 - 846, 2011.
J. Keshet, C-C Cheng, M. Stoehr, C. McAllester & L. K. Saul - Direct Error Rate Minimization of Hidden Markov Models. INTERSPEECH, August 2011
E. Coviello, A.B. Chan & G. Lanckriet - Time Series Models for Semantic Music Annotation. IEEE Transactions on Audio, Speech, and Language Processing, July 2011
C-C Cheng and B. Kingsbury - Arccosine Kernels: Acoustic Modeling with Infinite Neural Networks. ICASSP, May 2011
B. McFee, L. Barrington & G. Lanckriet - Learning content similarity for music recommendation. Submitted to IEEE Transactions on Audio, Speech and Language Processing, 2011.
B. McFee & G. Lanckriet - Learning multi-modal similarity. Journal of Machine Learning Research (JMLR), February, 2011.

2010
B. McFee, L. Barrington & G. Lanckriet - Learning Similarity from Collaborative Filters. ISMIR 2010
R. Miotto, L. Barrington & G. Lanckriet - Improving Auto-tagging by Modeling Semantic Co-occurrences. ISMIR 2010
E. Coviello, L. Barrington, A.B. Chan & G. Lanckriet - Automatic Music Tagging With Time Series Models. ISMIR 2010
N. Koenigstein, G. Lanckriet, B. McFee and Y. Shavitt - Collaborative Filtering Based on P2P Networks. ISMIR 2010
B. McFee & G. Lanckriet - Metric learning to rank. ICML 2010
S. Dubnov - Musical Information Dynamics as Models of Auditory Anticipation. Machine Audition: Principles, Algorithms and Systems, ed. W. Weng, IGI Global publication, 2010.
L. Barrington, A.B. Chan, G. Lanckriet - Modeling Music as a Dynamic Texture. IEEE Transactions on Audio, Speech and Language Processing 18-3 pp 602-612. (project page)
C.-C. Cheng, F. Sha, & L. K. Saul - Online learning and acoustic feature adaptation in large margin hidden Markov models. EEE Journal of Selected Topics in Signal Processing 4(6): 926-942, 2010.

2009
C.-C. Cheng, F. Sha, and L. K. Saul - Large margin feature adaptation for automatic speech recognition. Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU-09). Merano, Italy.
B. McFee and G. Lanckriet - Heterogeneous embedding for subjective artist similarity. Tenth International Symposium for Music Information Retrieval (ISMIR). Kobe, Japan.
L. Barrington, R. Oda, G. Lanckriet - Smarter Than Genius? Human Evaluation of Muisc Recommender Systems. Tenth International Symposium for Music Information Retrieval (ISMIR). Kobe, Japan.
D. J. Hu and L. K. Saul - A probabilistic model of unsupervised learning for musical-key profiles. Tenth International Society for Music Information Retrieval Conference (ISMIR-09). Kobe, Japan.
S. Dubnov, Y. Kiyoki - Opera of Meaning: film and music performance with semantic associative search. Frontiers in Artificial Intelligence and Applications, Information Modelling and Knowledge Bases XX, Volume 190, pp. 384 391, 2009.
C.-C. Cheng, F. Sha, and L. K. Saul - A fast online algorithm for large margin training of continuous-density hidden Markov models. In Proceedings of the Tenth Annual Conference of the International Speech Communication Association (Interspeech-09). Brighton, UK.
L. Barrington, D. Turnbull, M. Yazdani, G. Lanckriet - Combining Audio Content and Social Context for Semantic Music Discovery. SIGIR, 2009.
B. McFee, G. Lanckriet - Partial order embedding with multiple kernels. Twenty-sixth International Conference on Machine Learning (ICML), 2009.
C.-C. Cheng, F. Sha, and L. K. Saul - Matrix updates for perceptron training of continuous-density hidden Markov models. In Proceedings of the Twenty Sixth International Conference on Machine Learning (ICML-09), pages 153-160. Montreal, Canada.
Y. Cho and L. K. Saul - Learning dictionaries of stable autoregressive models for audio scene analysis. Twenty Sixth International Conference on Machine Learning (ICML), pages 169-176. Montreal, Canada.
Y. Cho and L. K. Saul - Sparse decomposition of mixed audio signals by basis pursuit with autoregressive models. In Proceedings of the International Conference of Acoustics, Speech, and Signal Processing (ICASSP), pages 1705-1708. Taipei, Taiwan.
L. Barrington, A.B. Chan, G. Lanckriet - Dynamic Texture Models of Music. In Proceedings of the International Conference of Acoustics, Speech, and Signal Processing (ICASSP). Taipei, Taiwan.
S.Dubnov, M,J.Hinich - Analyzing several musical instrument tones using the randomly modulated periodicity model. Signal Processing, Volume 89 , Issue 1, pp 24-30, January 2009

2008
L. Barrington, M. Yazdani, D. Turnbull, G. Lanckriet - Combination of Feature Kernels for Semantic Music Retrieval. ISMIR 2008
D. Turnbull, L. Barrington, G. Lanckriet - Five Approaches to Coleecting Tags for Music. ISMIR 2008
C. C. Cheng, D. J. Hu, and L. K. Saul - Nonnegative matrix factorization for real time musical analysis and sight-reading evaluation. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP-08), pages 2017-2020. Las Vegas, NV.
D. Turnbull, L. Barrington, D. Torres, G. Lanckriet - Semantic Annotation and Retrieval of Music and Sound Effects. IEEE Transactions on Audio, Speech, and Language Processing, February 2008 bib
S. Dubnov - Unified View of Prediction and Repetition Structure in Audio Signals. IEEE Transactions on Audio, Speech and Language Processing, Februrary 2008
S. Dubnov and G. Assayag - Memex and Composer Duets: computer aided composition using style modeling and mixing. Open Music Composers book 2, 2008

2007
Turnbull, Liu, Barrington & Lanckriet - A Game-Based Approach for Collecting Semantic Annotations of Music ISMIR, Vienna, Austria, September 2007.
Torres, Turnbull, Barrington & Lanckriet - Identifying Words that are Musically Meaningful ISMIR, Vienna, Austria, September 2007. bib
Turnbull, Lanckriet, Pampalk, & Goto - A Supervised Approach for Detecting Boundaries in Music Using Difference Features and Boosting ISMIR, Vienna, Austria, September 2007.
Cont, Dubnov & Wessel - Realtime Multiple-pitch and Multiple-instrument Recognition For Music Signals using Sparse Non-negative Constraints. DAFx, Bordeaux, France, September 2007.
Cont, Dubnov & Assayag - GUIDAGE: A Fast Audio Query Guided Assemblage. ICMC, Copenhagen, Denmark, August 2007.
Dubnov, Cont & Assayag - Audio Oracle: A New Algorithm for Fast Learning of Audio Structures. ICMC, Copenhagen, Denmark, August 2007.
Turnbull, Barrington, Torres & Lanckriet - Towards Musical Query-by-Semantic Description using the CAL500 Data Set. To appear in SIGIR, Amsterdam, July 2007 bib
Cont, Dubnov & Assayag - Anticipatory Model of Musical Style Imitation using Collaborative and Competitive Reinforcement Learning. in Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior Butz, M.V.; Sigaud, O.; Pezzulo, G.; Baldassarre, G. (Eds.), Pages 285-306, LNCS 4520, Springer Verlag.
Barrington, Chan, Turnbull & Lanckriet - Audio Information Retrieval Using Semantic Similarity. International Conference on Acoustic, Speech and Signal Processing (ICASSP), Hawaii, April 2007 bib
Sriperumbudur, Torres & Lanckriet - Sparse Eigen Methods by D.C. Programming. To appear in International Conference on Machine Learning (ICML), 2007 bib
Turnbull, Barrington, Torres & Lanckriet - Exploring the Semantic Annotation and Retrieval of Sound. CAL Technical Report CAL-2007-01, San Diego, February 2007

2006
Turnbull, Barrington, Torres & Lanckriet - Modeling the Semantics of Sound NIPS Workshop on Advances in Models for Acoustic Processing, Vancouver, December 2006
Turnbull, Barrington & Lanckriet - Modeling Music and Words using a Multi-Class naive Bayes Approach. International Symposium on Music Information Retrieval (ISMIR), Victoria, October 2006
Cont - Realtime Multiple Pitch Observation using Sparse Non-negative Constraints. International Symposium on Music Information Retrieval (ISMIR), Victoria, October 2006.
Cont, Dubnov & Assayag - A framework for Anticipatory Machine Improvisation and Style Imitation. Anticipatory Behavior in Adaptive Learning Systems (ABiALS), Rome, September 2006.
Cont - Realtime Audio to Score Alignment for Polyphonic Music Instruments Using Sparse Non-negative constraints and Hierarchical HMMs. ICASSP'06, Toulouse, May 2006.
Barrington, Lyons, Diegmann & Abe - Ambient Display Using Musical Effects. Intelligent User Interfaces (IUI), Sydney, January 2006