Publication Search Results

TitleAuthorBibliographicsort descendingDateGroupLinks
Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual InformationH.H. Yang, S. van Vuuren, and H. HermanskyProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, ArizonaMarch 1999Speech
Using Boosting to Improve a Hybrid HMM/Neural Network Speech RecognizerH. SchwenkProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1009-1012March 1999Speech[PDF]

Size Matters: An Empirical Study of Neural Network Training for Large Vocabulary Continuous Speech RecognitionD. Ellis and N. MorganProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-1013-1016March 1999Speech[PDF]

Dynamic Classifier Combinations in Hybrid Speech Recognition Systems Using Utterance-Level Confidence ValuesK. Kirchhoff and J. BilmesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-693-696March 1999Speech[PDF]

Buried Markov Models for Speech RecognitionJ. BilmesProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1999), Phoenix, Arizona, pp. II-713-716March 1999Speech[PDF]

Feature Extraction Using Non-Linear Transformation for Robust Speech Recognition on the Aurora DatabaseS. Sharma, D. Ellis, S. Kajarekar, P. Jain, and H. HermanskyProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. II-1117-1120June 2000Speech[PDF]

Data-driven RASTA Filters in ReverberationM. Shire and B. ChenProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1627-1630June 2000Speech[PDF]

Tandem Connectionist Feature Stream Extraction for Conventional HMM SystemsH. Hermansky, D. Ellis, and S. SharmaProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Istanbul, Turkey, pp. III-1635-1638June 2000Speech[PDF]

Tandem Acoustic Modeling in Large-Vocabulary RecognitionD. Ellis, R. Singh, and S. SivadasProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, UtahMay 2001Speech
A Study of Two Dimensional Linear Descriminants For ASRS. Kajarekar, B. Yegnanarayana, and H. HermanskyProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, UtahMay 2001Speech
Multi-Stream ASR trained with Heterogeneous Reverberant EnvironmentsM.L. ShireProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, UtahMay 2001Speech[PDF]

Global Posterior Probability Estimates as Confidence Measures in an Automatic Speech Recognition SystemW. WarrenProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), Salt Lake City, UtahMay 2001Speech
Hierarchical Tandem Feature ExtractionS. Sivadas and H. HermanskyProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, FloridaMay 2002Speech[PDF]

Using Prosodic and Lexical Information for Speaker IdentificationF. Weber, L. Manganaro, B. Peskin, and E. ShribergProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2002), Orlando, FloridaMay 2002Speech[PDF]

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer WorkshopK. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. SaenkoProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, HawaiiApril 2007Speech
Word-Conditioned Phone N-Grams for Speaker RecognitionH. Lei and N. MirghaforiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, pp. 253-256April 2007Speech[PDF]

Wide-Band Perceptual Audio Coding Based on Frequency-Domain Linear PredictionP. Motlicek, V. Ullal, and H. HermanskyProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 1, pp. 265-268April 2007Speech
Statistical Sentence Extraction for Information DistillationD. Hakkani-Tur and G. TurProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 1-4April 2007Speech[PDF]

Entropy Based Classifier Combination for Sentence SegmentationM. Magimai Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. MirghaforiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 189-192April 2007Speech[PDF]

Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech RecognitionJ. Zheng, O. Cetin, M.-Y. Huang, X. Lei, A. Stolcke, and N. MorganProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 633-636April 2007Speech
An Articulatory Feature-Based Tandem Approach and Factored Observation ModelingO. Cetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and K. LivescuProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 645-648April 2007Speech
Manual Transcription of Conversational Speech at the Articulatory Feature LevelK. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. LavoieProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 953-956April 2007Speech
Comparing Evaluation Metrics for Sentence Boundary DetectionY. Liu and E. ShribergProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Vol. 4, pp. 185-188, Honolulu, HawaiiApril 2007Speech[PDF]

Multi-Modal Speaker Diarization of Real-World Meeting Using Compressed-Domain Video FeaturesG. Friedland, H. Hung, and C. YeoProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4069-4072April 2009Speech[PDF]

Fusing Short Term and Long Term Features for Improved Speaker DiarizationG. Friedland, O. Vinyals, Y. Huang, and C. MüllerProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4077-4080April 2009Speech[PDF]

The SRI NIST 2008 Speaker Recognition Evaluation SystemS. S. Kajarekar, N. Scheffer, M. Graciarena, E. Shriberg, A. Stolcke, L. Ferrer, and T. BockletProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4205-4208April 2009Speech[PDF]

Discriminative Pronunciation Learning Using Phonetic Decoder and Minimum-Classification-Error CriterionO. Vinyals, L. Deng, D. Yu, and A. AceroProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4445-4448April 2009Speech[PDF]

Speaker Recognition Using Syllable-Based Constraints for Cepstral Frame SelectionT. Bocklet and E. ShribergProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4525-4528April 2009Speech[PDF]

Syntactically Informed Models for Comma PredictionB. Favre, D. Hakkani-Tür, and E. ShribergProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4697-4700April 2009Speech[PDF]

Genre Effects on Automatic Sentee Segmentation of Speech: A Comparison of Broadcast News and Broadcast ConversationsncJ. Kolar, Y. Liu, and E. ShribergProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4701-4704April 2009Speech[PDF]

Exploiting User Feedback for Language Model Adaptation in Meeting RecognitionD. Vergyri, A. Stolcke, and G. TurProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4737-4740April 2009Speech[PDF]

Associating Children’s Non-Verbal and Verbal Behaviour: Body Movements, Emotions, and Laughter in a Human-Robot InteractionA. Batliner, S. Steidl, and E. NöthProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 22-27May 2011Speech[PDF]

Bird Species Recognition Combining Acoustic and Sequence ModelingM. Graciarena, M. Delplanche, E. Shriberg, and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 341-344May 2011Speech[PDF]

The IBM 2009 GALE Arabic Speech Transcription SystemB. Kingsbury, H. Soltau, G. Saon, S. Chu, H.-K. Kuo, L. Mangu, S. Ravuri, A. Janin, and N. MorganProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4672-4675May 2011Speech[PDF]

Making the Most from Multiple Microphones in Meeting RecognitionA. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 4992-4995May 2011Speech[PDF]

The SRI NIST 2010 Speaker Recognition Evaluation SystemN. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg, and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5292-5295May 2011Speech[PDF]

Language-Independent Constrained Cepstral Features for Speaker RecognitionE. Shriberg and A. StolckeProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2011), Prague, Czech Republic, pp. 5296-5299May 2011Speech[PDF]

Multimodal City-Verification on Flickr Videos Using Acoustic and Textual FeaturesH. Lei, J. Choi, and G. FriedlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Spectro-Temporal Gabor Features for Speaker RecognitionH. Lei, B. T. Meyer, and N. MirghaforiProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Discriminative Training for Speech Recognition is Compensating for Statistical Dependence on the HMM FrameworkD. Gillick and S. Wegmann, L. GillickProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept DetectionP.-S. Huang, R. Mertens, A. Divakaran, G. Friedland, and M. Hasegawa-JohnsProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Easy Does It: Robust Spectro-Temporal Many-Stream ASR Without Fine Tuning StreamsS. Ravuri and N. MorganProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech
Articulatory Features for Expressive Speech SynthesisA. Black, H. T. Bunnell, Y. Dou, P. Kumar, F. Metze, D. Perry, T. Polzehl, K. Prahallad, S. Steidl, and C. VaugProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Language Model Combination and Adaptation Using Weighted Finite State TransducersX. Liu, M. J. F. Gales, J. L. Hieronymus, and P. C. WoodlandProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, TexasMarch 2010Speech
Speech Intelligibility in the Presence of Cross-Channel Spectral AsynchronyT. Arai and S. GreenbergProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-98), Seattle, Washington, pp. 933-936May 1998Speech[PDF]

Multimodal Location Estimation of Consumer Media – Dealing with Sparse Training DataJ. Choi, G. Friedland, V. Ekambaram, and K. RamchandranProceedings of the IEEE International Conference on Multimedia and Expo, Melbourne, Australia, pp. 43-48July 2012Speech[PDF]

Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location EstimationJ. Choi and G. FriedlandProceedings of the IEEE International Conference on Semantic Computing (ICSC 2011), Palo Alto, California, pp. 243-246September 2011Speech[PDF]

On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia RetrievalR. Mertens, P.-S. Huang, L. Gottlieb, G. Friedland, and A. DivakaranProceedings of the IEEE International Symposium on Multimedia, Dana Point, California, pp. 446-451December 2011Speech[PDF]

Parallel Training of MLP Probability Estimators for Speech Recognition: A Gender-Based ApproachN. Mirghafori, N. Morgan, and H. BourlardProceedings of the IEEE Neural Networks for Signal Processing Workshop (NNSP 94), Ermioni, GreeceSeptember 1994Speech[PDF]

Computational Auditory Scene Analysis Exploiting Speech-Recognition KnowledgeD. EllisProceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, p. 4October 1997Speech[PDF]

Pages