J. Iso/iec and . Mpeg, Information technology -coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s -part 3: Audio, pp.11172-11175, 1992.

, Information technology -generic coding of moving pictures and associated audio information -part 3: Audio, pp.13818-13821, 1998.

, Information technology -coding of audio-visual objects -part 3: Audio, pp.14496-14499, 2001.

, Report on the MPEG-2 AAC stereo verification tests, 1998.

E. D. Scheirer, Tempo and beat analysis of acoustic musical signals, J. Acoust. Soc. Am, vol.103, issue.1, pp.588-601, 1998.

S. Dixon, Automatic extraction of tempo and beat from expressive performances, J. New Music Research, vol.30, issue.1, pp.39-58, 2001.

A. P. Klapuri, A. J. Eronen, and J. T. Astola, Analysis of the meter of acoustic musical signals, IEEE Trans. on Audio, Speech and Lang. Proc, vol.14, issue.1, pp.342-355, 2006.

M. E. Davies and M. D. Plumbley, Context-dependant beat tracking of musical audio, IEEE Trans. on Audio, Speech and Lang. Proc, vol.15, issue.3, pp.1009-1020, 2007.

T. Fujishima, Realtime chord recognition of musical sound: A system using common lisp music, Proc. Int. Comput. Music Conf, pp.464-467, 1999.

A. Sheh and D. P. Ellis, Chord segmentation and recognition using EM-trained hidden Markov models, Proc. Int. Conf. Music Inf. Retrieval, pp.185-191, 2003.

J. P. Bello and J. Pickens, A robust mid-level representation for harmonic content in music signals, Proc. Int. Conf. Music Inf. Retrieval, pp.304-311, 2005.

K. Lee and M. Slaney, Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio, IEEE Trans. on Audio, Speech and Lang. Proc, vol.16, issue.2, pp.291-301, 2008.

G. Tzanetakis and P. Cook, Musical genre classification of audio signals, IEEE Trans. Acoust., Speech, Sig. Proc, vol.10, issue.5, pp.293-302, 2002.

J. Bergstra, N. Casagrande, D. Erhan, D. Eck, and B. Kégl, Aggregate features and ADABOOST for music classification, Machine Learning, vol.65, pp.473-484, 2006.
URL : https://hal.archives-ouvertes.fr/inria-00176062

A. Holzapfel and Y. Stylianou, Musical genre classification using nonnegative matrix factorization-based features, IEEE Trans. on Audio, Speech and Lang. Proc, vol.16, issue.2, pp.424-434, 2008.

N. Patel and I. Sethi, Audio characterization for video indexing, Proc. SPIE, pp.373-384, 1996.

L. Yapp and G. Zick, Speech recognition on MPEG/Audio encoded files, Proc. IEEE Int. Conf. on Multimedia Computing and Systems, pp.624-625, 1997.

Y. Nakajima, Y. Lu, M. Sugano, A. Yoneyama, H. Yamagihara et al., A fast audio classification from MPEG coded data, Proc. IEEE Int. Conf. Acoustics, Speech and Sig. Proc, vol.6, pp.3005-3008, 1999.

D. Pye, Content-based methods for the management of digital music, Proc. IEEE Int. Conf. Acoustics, Speech and Sig. Proc, vol.6, pp.2437-2440, 2000.

G. Tzanetakis and F. Cook, Sound analysis using MPEG compressed audio, Proc. IEEE Int. Conf. Acoustics, Speech and Sig. Proc, vol.2, pp.761-764, 2000.

Y. Wang and M. Vilermo, A compressed domain beat detector using MP3 audio bitstreams, ACM Multimedia, pp.194-202, 2001.

X. Shao, C. Xu, Y. Wang, and M. Kankanhalli, Automatic music summarization in compressed domain, Proc. IEEE Int. Conf. Acoustics, Speech and Sig. Proc, vol.4, pp.261-264, 2004.

S. Kiranyaz, A. F. Qureshi, and M. Gabbouj, A generic audio classification and segmentation approach for multimedia indexing and retrieval, IEEE Trans. on Audio, Speech and Lang. Proc, vol.14, issue.3, pp.1062-1081, 2006.

J. Zhu and Y. Wang, Complexity-scalable beat detection with MP3 audio bitstreams, Computer Music Journal, vol.32, issue.1, pp.71-87, 2008.

S. Pfeiffer and T. Vincent, Formalisation of MPEG-1 compressed domain audio features, CSIRO Mathematical and Information Sciences, 2001.

E. Ravelli, G. Richard, and L. Daudet, Union of MDCT bases for audio coding, IEEE Trans. on Audio, Speech and Lang. Proc, vol.16, issue.8, pp.1361-1372, 2008.
URL : https://hal.archives-ouvertes.fr/hal-02652697

, LAME mp3 encoder webpage, 2008.

N. , Nero aac codec webpage, 2008.

, Apple iTunes 7 webpage, 2008.

, United States Advanced Television Systems Committee (ATSC), 1995.

J. Bello, C. Duxbury, M. Davies, and M. Sandler, On the use of phase and energy for musical onset detection in the complex domain, IEEE Sig. Proc. Letters, vol.11, issue.6, pp.553-556, 2004.

S. Pauws, Musical key extraction from audio, Proc. of the 5th ISMIR, pp.96-99, 2004.

E. Gomez and P. Herrera, Estimating the tonality of polyphonic audio files: Cognitive versus machine learning modelling strategies, Proc. of the 5th ISMIR, pp.92-95, 2004.

S. Davis and P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Trans. Acoust., Speech, Sig. Proc, vol.28, issue.4, pp.357-366, 1980.

F. Pachet and J. J. Aucouturier, Improving timbre similarity: How high is the sky?, J. Negative Results Speech Audio Sci, vol.1, issue.1, 2004.

P. Leveau, P. Leveau, E. Vincent, G. Richard, and L. Daudet, Instrument-specific harmonic atoms for mid-level music representation, IEEE Trans. on Audio, Speech and Lang. Proc, vol.16, issue.1, pp.116-128, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00544175

, libMAD mpeg audio decoder webpage, 2008.

&. Faac and . Faac, , 2008.

S. Hainsworth, Techniques for the automated analysis of musical audio, 2004.

C. Harte and M. Sandler, Automatic chord identification using a quantized chromagram, Proc. of the 118th AES Convention, 2005.