Publication Search Results

TitleAuthorsort ascendingBibliographicDateGroupLinks
The Modulation Spectrogram: In Pursuit of an Invariant Representation of SpeechS. Greenberg and B. KingsburyThe 22nd International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1997), Munich, Germany, Vol. 3, pp. 1647-1650April 1997Speech[PDF]

From Here to Utility - Melding Phonetic Insight with Speech TechnologyS. GreenbergProceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, DenmarkSeptember 2001Speech[PDF]

Whither Speech Technology? - A Twenty-First Century PerspectiveS. GreenbergProceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, DenmarkSeptember 2001Speech[PDF]

Recognition in a New Key - Towards a Science of Spoken LanguageS. GreenbergProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 1998), Seattle, Washington, pp. 1041-1045May 1998Speech[PDF]

Speaking in Shorthand - A Syllable-Centric Perspective for Understanding Pronunciation VariationS. GreenbergProceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kekrade, Netherlands, pp. 47-56 1998Speech[PDF]

On the Origins of Speech Intelligibility in the Real WorldS. GreenbergProceedings of the ESCA Workshop of Robust Speech Recognition, Pont-a-Mousson, France, pp. 23-32 1997Speech[PDF]

Understanding Speech UnderstandingS. GreenbergProceedings of the ESCA Workshop on the "Auditory Basis of Speech Perception," Keele University, Staffordshire, UK, pp. 1-8 1996Speech[PDF]

Temporal Masking for Bit-Rate Reduction in Audio Codec based on Frequency Domain Linear PredictionS. Ganapathy, P. Motlicek, H. Hermansky, and H. GarudadriProceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4781-4784April 2008Speech[PDF]

Autoregressive Modeling of Hilbert Envelopes for Wide-Band Audio CodingS. Ganapathy, P. Motlicek, H. Hermansky, and H. GarudadriProceedings of 124th Convention of Audio Engineering Society (AES), Amsterdam, the Netherlands, paper 7481May 2008Speech
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral DomainS. Ganapathy, P. Motlicek, H. Hermansky, and H. GarudadriProceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008), Brisbane, AustraliaSeptember 2008Speech
An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and MeetingsS. Cuendet, E. Shriberg, B. Favre, J. Fung, and D. Hakkani-TürProceedings of the SIGIR Workshop on Searching Conversational Spontaneous Speech, Amsterdam, Netherlands, pp. 43-59July 2007Speech
Model Adaptation for Sentence Segmentation from SpeechS. Cuendet, D. Hakkani-Tur, and G. TurProceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 102-105December 2006Speech[PDF]

Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational SpeechS. Cuendet, D. Hakkani-Tur, and E. ShribergProceedings of Fourth International Conference on Machine Learning and Multimodal Interaction, Brno, Czech Republic, pp. 144-155June 2007Speech[PDF]

Cross-Genre Feature Comparisons for Spoken Sentence SegmentationS. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, and B. FavreProceedings of International Conference on Semantic Computing, IEEE Computer Society, pp. 265-274, Irvine, California. Also published in International Journal of Semantic Computing, Volume 1, Issue 3, World Scientific, USA, pp. 335-346September 2007Speech[PDF]

An Elitist Approach to Articulatory-Acoustic Feature ClassificationS. Chang, S. Greenberg, and M. WesterProceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg, DenmarkSeptember 2001Speech[PDF]

Automatic Phonetic Transcription of Spontaneous Speech American EnglishS. Chang, L. Shastri, and S. GreenbergProceedings of the 6th International Conference on Spoken Language Processing (ICSLP 2000), Beijing, ChinaOctober 2000Speech[PDF]

A Syllable, Articulatory-Feature, and Stress-Accent Model of Speech RecognitionS. ChangPh.D. Thesis, University of California at Berkeley. Also ICSI Technical Report TR-02-007September 2002Speech[PDF]

System Output Combination for Improved Speaker DiarizationS. Bozonnet, N. Evans, X. Anguera, O. Vinyals, G. Friedland, and C. FredouilleProceedings of the 11th International Conference of the International Speech Communication Association (Interspeech 2010), Makuhari, Japan, pp. 2642-2645September 2010Speech[PDF]

Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty MeetingsS. Bhagat, H. Carvey, and E. ShribergProceedings of the 15th International Congress of Phonetic Sciences (ICPhS 2003), Barcelona, SpainAugust 2003Speech[PDF]

Source Separation Based on Binaural Cues and Source Model ConstraintsR. Weiss, M. Mandel, and D. EllisProceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 419-422September 2008Speech[PDF]

Temporal Constraints on Speech Intelligibility as Deduced From Exceedingly Sparse Spectral RepresentationsR. Silipo, S. Greenberg, and T. AraiProceedings of the 6th European Conference on Speech Communication and Technology (Eurospeech '99), Budapest, Hungary, pp. VI-2687-2690September 1999Speech[PDF]

Prosodic Stress Revisited: Reassessing the Fole of Fundamental FrequencyR. Silipo and S. GreenbergProceedings of the National Institute of Standards and Technology Speech Transcription Workshop, College Park, MarylandMay 2000Speech[PDF]

Automatic Transcription of Prosodic Stress for Spontaneous English DiscourseR. Silipo and S. GreenbergProceedings of the International Congress of Phonetic Sciences, San Francisco, California, Vol. 3, pp. 2351-2354August 1999Speech[PDF]

Introduction to the Special Issue on Processing Morphologically Rich LanguagesR. Sarikaya, K. Kirchhoff, T. Schultz, and D. Hakkani-TürIEEE Transactions on Audio, Speech and Language Processing, Special Issue on Processing Morphologically Rich Languages, Vol. 17, No. 5, pp. 861-862July 2009Speech[PDF]

From AUDREY to Siri: Is Speech Recognition A Solved Problem?R. PieracciniPresented at the Mobile Voice Conference, San Francisco, CaliforniaMarch 2012Speech[PDF]

A Human Benchmark for Language RecognitionR. Orr and D. A. Van LeeuwenProceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2175-2178September 2009Speech
On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia RetrievalR. Mertens, P.-S. Huang, L. Gottlieb, G. Friedland, and A. DivakaranProceedings of the IEEE International Symposium on Multimedia, Dana Point, California, pp. 446-451December 2011Speech[PDF]

Acoustic Super Models for Large Scale Video Event DetectionR. Mertens, H. Lei, L. Gottlieb, G. Friedland, and A. DivakaranProceedings of the ACM International Workshop on Events in Multimedia (EiMM11), Scottsdale, ArizonaNovember 2011Speech[PDF]

Features Based on Auditory Physiology and PerceptionR. M. Stern and N. MorganIn Techniques for Noise Robustness in Automatic Speech Recognition, T. Virtanen, B. Raj, and R. Singh, Wiley Publishing 2012Speech
Hearing is Believing: Biologically-Inspired Feature Extraction for Robust Automatic Speech RecognitionR. M. Stern and N. MorganSignal Processing Magazine, Vol. 29, No. 6, pp. 34-43November 2012Speech[PDF]

An Improved Approximation Algorithm for Vertex Cover with Hard CapacitiesR. Gandhi, E. Halperin, S. Khuller, G. Kortsarz, and A. SrinivasanProceedings of the 30th International Colloquium on Automata, Languages and Programming (ICALP 2003), Eindhoven, The Netherlands, pp. 164-175June 2003Speech[PDF]

Meeting Recorder Project: Dialog Act Labeling GuideR. Dhillon, S. Bhagat, H. Carvey, and E. ShribergICSI Technical Report TR-04-002February 2004Speech[PDF]

Automated Information Extraction in ProductionR. Desutter, J.P. Evain, G. Friedland, A. Messina, and M. SanoSpecial issue in Multimedia Tools and Applications, Springer 2011Speech
The Challenge of Spoken Language Systems: Research Directions for the NinetiesR. Cole, L. Hirschman, L. Atlas, M. Beckman, A. Biermann, M. Bush, M. Clements, J. Cohen, O. Garcia, B. Hanson, H. Hermansky, S. Levinson, K. McKeown, N. Morgan, D. Novick, M. Ostendorf, S. Oviatt, P. Price, H. Silverman, J. Spitz, A. Waibel, C. Weinstein, S. Zahorian, and V. ZueIEEE Transactions on Speech and Audio Processing, Vol. 3, No. 1, pp. 1-21January 1995Speech
Meeting Acts: A Labeling System for Group Interaction in MeetingsR. Bates, P. Menning, E. Willingham, and C. KuyperProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisbon, PortugalSeptember 2005Speech[PDF]

On Using MLP Features in LVCSRQ. Zhu, B. Chen, N. Morgan. and A. StolckeProceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.October 2004Speech[PDF]

Tandem Connectionist Feature Extraction for Conversational Speech RecognitionQ. Zhu, B. Chen, N. Morgan, and A.StolckeProceedings of the First International Workshop on Machine Learning for Multimodal Interaction (MLMI 2004), Martigny, SwitzerlandJune 2004Speech
Improved MLP Structures for Data-Driven Feature Extraction for ASRQ. Zhu, B. Chen, F. Grezl, and N. MorganProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132September 2005Speech[PDF]

Improved MLP Structures for Data-Driven Feature Extraction for ASRQ. Zhu, B. Chen, F. Grezl, and N. MorganProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132September 2005Speech
Using MLP Features in SRI's Conversational Speech Recognition SystemQ. Zhu, A. Stolcke, B.Y. Chen, and N. MorganProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2141-2144September 2005Speech[PDF]

Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition SystemQ. Zhu, A. Stolcke, B. Y. Chen, and N. MorganProceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004.November 2004Speech[PDF]

How to Put It Into Words - Using Random Forests to Extract Symbol Level Descriptions from Audio Content for Concept DetectionP.-S. Huang, R. Mertens, A. Divakaran, G. Friedland, and M. Hasegawa-JohnsProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2012), Kyoto, JapanMarch 2012Speech[PDF]

Feature Transformations and Combinations for Improving ASR PerformanceP. Somervuo, B. Chen, and Q. ZhuProceedings of EUROSPEECH 2003, GenevaSeptember 2003Speech[PDF]

Experiments with Linear and Nonlinear Feature Transformations in HMM Based Phone RecognitionP. SomervuoProceedings of ICASSP-2003, Hong KongApril 2003Speech[PDF]

Speech Modeling Using Variational Bayesian Mixture of GaussiansP. SomervuoProceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), Denver, ColoradoSeptember 2002Speech[PDF]

Wide-Band Perceptual Audio Coding Based on Frequency-Domain Linear PredictionP. Motlicek, V. Ullal, and H. HermanskyProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 1, pp. 265-268April 2007Speech
Perceptually Motivated Sub-Band Decomposition for FDLP Audio CodingP. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri, and M. AthineosProceedings of 11th International Conference on Text, Speech, and Dialogue (TSD 2008), Brno, Czech Republic, pp. 435-442September 2008Speech[PDF]

A Methodology for Comparing Grammar-Based and Robust Approaches to Speech UnderstandingP. Bouillon, N. Chatzichrisafis, B.A. Hockey, M. Rayner, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. NakaoProceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 1877-1880September 2005Speech
A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech TranslationP. Bouillon, M. Rayner, N. Chatzichrisafis, B.A. Hockey, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. NakaoProceedings of the 10th Annual Conference of the European Association of Machine Translation (EAMT 2005), Budapest, Hungary, pp. 5-58May 2005Speech
A Multilingual Shared Grammar for Recognition and Generation (in French)P. Bouillon, M. Rayner, B. Novellas, Y. Nakao, M. Santaholma, M. Starlander, and N. ChatzichrisafisProceedings of the 13th Conference on Natural Language Processing (TALN 2006), Leuwen, Belgium, pp. 93-102April 2006Speech

Pages