|
[1] Y. Ariki, S. Kato and T. Takiguchi, Phoneme recognition based on Fisher weight map to higher-order local auto-correlation, Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP), September 2006, pp. 377-380.
[2] I. M.-Chagnolleau, G. Durou and F. Bimbot, Application of time-frequency principal component analysis to text-independent speaker identification, IEEE Trans. on Speech and Audio Processing 10(6) (2002), 371-378.
[3] M. P. Cooke, P. D. Green, L. B. Josifovski and A. Vizinho, Robust automatic speech recognition with missing and uncertain acoustic data, Speech Commun. 24 (2001), 267-285.
[4] M. Heckmann, X. Domont, F. Joublin and C. Goerick, A closer look on hierarchical spectro-temporal features (HIST), Proceedings of the Interspeech 2008, September 2008, pp. 894-897.
[5] N. Kitaoka, K. Yamamoto, T. Kusamizu, S. Nakagawa, T. Yamada, S. Tsuge, C. Miyajima, T. Nishiura, M. Nakayama, Y. Denda, M. Fujimoto, T. Takiguchi, S. Tamura, S. Kuroiwa, K. Takeda and S. Nakamura, Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance, Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, December 2007, pp. 607-612.
[6] N. Malayath, H. Hermansky, S. Kajarekar and B. Yegnanarayana, Data-driven temporal filters and alternatives to GMM in speaker verification, Digital Signal Process. 10 (2000), 55-74.
[7] B. T. Meyer and B. Kollmeier, Optimization and evaluation of Gabor feature sets for ASR, Proceedings if the Interspeech 2008, September 2008, pp. 906-909.
[8] T. Nitta, Feature extraction for speech recognition based on orthogonal acoustic feature planes and LDA, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, May 1999, pp. 421-424.
[9] K. Schutte and J. Glass, Speech recognition with localized time-frequency pattern detectors, Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, December 2007, pp. 341-344.
[10] Y. Shinohara and N. Otsu, Facial expression recognition using Fisher weight maps, Proceedings of the 6th IEEE International Conference on Automatic Face and Gesture Recognition, May 2004, pp. 499-504.
[11] S. Y. Zhao and N. Morgan, Multi-stream spectro-temporal features for robust speech recognition, Proceedings of the Interspeech 2008, September 2008, pp. 898-901. |