About ICSI Groups Projects Publications Events Partnerships Visitors' Program News Search
 
       
 

Publications

Speech

 

 
__/__/2009
Speaker Diarization and Identification

G. Friedland and D. van Leeuwen

In Semantic Computing, P. Sheu et al., eds., IEEE Press/Wiley

__/__/2009
Audio-Based Semantic Concept Classification for Consumer Video

K. Lee and D. Ellis

IEEE Transactions on Audio, Speech, and Language Processing, submitted

__/__/2009
Approaching On-Line Speaker Diarization and Audio-Visual Localization Through Aspects of Human Discourse

H. Hung, C. Yeo, and G. Friedland

Invited to Computer Vision and Image Understanding, Elsevier, to appear

__/__/2009
Prosodic Similarities of Dialog Act Boundaries Across Speaking Styles

E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur, and S. Cuendet

In Linguistic Patterns in Spontaneous Speech, S.-C. Tseng, ed., pp. 213-239, Institute of Linguistics

__/__/2009
Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech

U. Guz, S. Cuendet, G. Tur, and D. Hakkani-Tür

To appear in IEEE Transactions on Audio, Speech and Language Processing

__/__/2009
Cascaded Model Adaptation for Dialog Act Segmentation and Tagging

U. Guz, G. Tur, D. Hakkani-Tür, and S. Cuendet

To appear in the Journal of Computer Speech and Language

__/__/2009
The CALO Meeting Assistant System

G. Tur, A. Stolcke, L. Voss, S. Peters, D. Hakkani-Tür, J. Dowding, B. Favre, R. Fernandez, M. Frampton, M. Frandsen, C. Frederickson, M. Graciarena, D. Kintzing, K. Leveque, S. Mason, J. Niekrasz, M. Purver, K. Riedhammer, E. Shriberg, J. Tien, D. Vergyri, and F. Yang

To appear in IEEE Transactions on Audio, Speech, and Language Processing

__/__/2009
Speaker Adaptation of Language and Prosodic Models for Automatic Dialog Act Segmentation of Speech

J. Kolar, Y. Liu, and E. Shriberg

Speech Communication, in press

12/__/2009
Robust Speaker Diarization for Short Speech Recordings

D. Imseng and G. Friedland

To appear in the proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy

12/__/2009
Using Artistic Markers and Speaker Identification for Narrative-Theme Navigation of Seinfeld Episodes

G. Friedland, L. Gottlieb, and A. Janin

To appear in the proceedings of the 11th IEEE International Symposium on Multimedia (ISM2009), San Diego, California

12/__/2009
Any Questions? Automatic Question Detection in Meetings

K. Boakye, B. Favre, and D. Hakkani-Tür

To appear in the proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy

12/__/2009
Integrating Prosodic Features in Extractive Meeting Summarization

S. Xie, D. Hakkani-Tür, and G. Tur

To appear in the proceedings of the 11th Biannual IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy

10/__/2009
A View of the Parallel Computing Landscape

K. Asanovic, R. Bodik, J. Demmel, T. Keaveny, K. Keutzer, J. D. Kubiatowicz, N. Morgan, D. A. Patterson, K. Sen, J. Wawrzynek, D. Wessel, and K. A. Yelick

Communications of the ACM, Vol. 52, No. 10, pp. 56-67

10/__/2009
IXIR: A Statistical Information Distillation System

M. Levit, D. Hakkani-Tür, G. Tür, and D. Gillick

Journal of Computer Speech and Language, Vol. 23, Issue 4, pp. 527-542

10/__/2009
Visual Speaker Localization Aided by Acoustic Models

G. Friedland, C. Yeo, and H. Hung

Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 195-202

10/__/2009
Joke-o-Mat: Browsing Sitcoms Punchline by Punchline

G. Friedland, L. Gottlieb, and A. Janin

Proceedings of the ACM International Conference on Multimedia (ACM Multimedia 2009), Beijing, China, pp. 1115-1116

10/__/2009
Review of Cattelan, et al, "Watch-and-Comment as a Paradigm Toward Ubiquitous Interactive Video Editing"

G. Friedland

ACM Computer Reviews, CR136487

09/__/2009
Hill-Climbing Feature Selection for Multi-Stream ASR

D. Gelbart, N. Morgan, and A. Tsymbal

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2967-2970

09/__/2009
On the Use of Artificial Conversation Data for Speaker Recognition in Cars

L. Gottlieb and G. Friedland

Proceedings of the Third IEEE International Conference for Semantic Computing (ICSC-2009), Berkeley, California, pp. 124-128

09/__/2009
Combining Semantic and Syntactic Information Sources for 5-W Question Answering

S. Yaman, D. Hakkani-Tür, and G. Tur

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2707-2710

09/__/2009
Classification-Based Strategies for Combining Multiple 5-W Question Answering Systems

S. Yaman, D. Hakkani-Tür, G. Tur, R. Grishman, M. Harper, K. R. McKeown, A. Meyers, and K. Sharma

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2703-2706

09/__/2009
Leveraging Sentence Weights in a Concept-Based Optimization Framework for Extractive Meeting Summarization

S. Xie, B. Favre, D. Hakkani-Tür, and Y. Liu

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1503-1506

09/__/2009
ClusterRank: A Graph Based Method for Meeting Summarization

N. Garg, B. Favre, K. Riedhammer, and D. Hakkani-Tür

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1499-1502

09/__/2009
Phrase and Word Level Strategies for Detecting Appositions in Speech

B. Favre and D. Hakkani-Tür

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2711-2714

09/__/2009
Combined Low Level and High Level Features for Out-of-Vocabulary Word Detection

B. Lecouteux, G. Linarès, and B. Favre

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1187-1190

09/__/2009
Multi-Stream to Many-Stream: Using Spectro-Temporal Features for ASR

S. Y. Zhao, R. Ravuri, and N. Morgan

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2951-2954

09/__/2009
Importance of Nasality Measures for Speaker Recognition Data Selection and Performance Prediction

H. Lei and E. Lopez-Gonzalo

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 888-891

09/__/2009
Mel, Linear, and Antimel Frequency Cepstral Coefficients in Broad Phonetic Regions for Telephone Speaker Recognition

H. Lei and E. Lopez-Gonzalo

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2323-2326

09/__/2009
Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions?

E. Shriberg, S. Kajarekar, and N. Scheffer

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 1551-1554

09/__/2009
Modeling Other Talkers for Improved Dialog Act Recognition in Meetings

K. Laskowski and E. Shriberg

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2783-2786

09/__/2009
Feature-Based and Channel-Based Analyses of Intrinsic Variability in Speaker Verification

M. Graciarena, T. Bocklet, E. Shriberg, A. Stolcke, and S. Kajarekar

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2015-2018

09/__/2009
A Human Benchmark for Language Recognition

R. Orr and D. A. Van Leeuwen

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2175-2178

09/__/2009
Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR System

F. Valente, M. Magimai-Doss, C. Plahl, and S. Ravuri

Proceedings of the 10th International Conference of the International Speech Communication Association (Interspeech 2009), Brighton, United Kingdom, pp. 2963-2966

08/__/2009
ICSI-CRF: The Generation of References to the Main Subject and Named Entities Using Conditional Random Fields

B. Favre and B. Bohnet

Proceedings of the Language Generation and Summarisation (UCNLG+Sum) Workshop at the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Lanaguage Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore, pp. 99-100

08/__/2009
Who, What, When, Where, Why? Comparing Multiple Approaches to the Cross-Lingual 5W Task

K. Parton, K. R. McKeown, R. Coyne, M. T. Diab, R. Grishman, D. Hakkani-Tür, M. Harper, H. Ji, W. Y. Ma, A. Meyers, S. Stolbach, A. Sun, G. Tur, W. Xu, and S. Yaman

To appear in the proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Lanaguage Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore

08/__/2009
Review of G. Welch, "History: The Use of the Kalman Filter for Human Motion Tracking in Virtual Reality"

G. Friedland

ACM Computing Reviews, CR137162

08/__/2009
Consensus Training for Consensus Decoding in Machine Translation

A. Pauls, J. DeNero, and D. Klein

Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, pp. 1418-1427

08/__/2009
Fast Consensus Decoding over Translation Forests

J. DeNero, D. Chiang, and K. Knight

Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Lanaguage Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore

08/__/2009
Asynchronous Binarization for Synchronous Grammars

J. DeNero, A. Pauls, and D. Klein

Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Lanaguage Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore

08/__/2009
Better Word Alignments with Supervised ITG Models

A. Haghighi, J. Blitzer, J. DeNero, and D. Klein

Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the Fourth International Joint Conference on Natural Lanaguage Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Singapore

08/__/2009
Review of L. Cairco, et al., "AVARI: Animated Virtual Agent Retrieving Information

G. Friedland

ACM Computing Reviews, CR137225

07/__/2009
Prosodic and Other Long-Term Features for Speaker Diarization

G. Friedland, O. Vinyals, Y. Huang, and C. Müller

IEEE Transactions on Audio, Speech, and Language Processing, Vol. 17, No. 5, pp. 985-993

07/__/2009
Generative and Discriminative Methods Using Morphological Information for Sentence Segmentation of Turkish

U. Guz, B. Favre, D. Hakkani-Tur, and G. Tur

IEEE Transactions on Speech, Audio and Language Processing, Special Issue on Processing Morphologically Rich Languages, Vol. 17, No. 5, pp. 895-903

07/__/2009
Introduction to the Special Issue on Processing Morphologically Rich Languages

R. Sarikaya, K. Kirchhoff, T. Schultz, and D. Hakkani-Tür

IEEE Transactions on Audio, Speech and Language Processing, Special Issue on Processing Morphologically Rich Languages, Vol. 17, No. 5, pp. 861-862

07/__/2009
Research Developments and Directions in Speech Recognition and Understanding, Part 2

J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'Shgughnessy

IEEE Signal Processing Magazine, Vol. 26, No. 4, pp. 78-85

06/__/2009
Best Papers from the 10th IEEE International Symposium on Multimedia

G. Friedland and S.-C. Shen, eds.

International Journal on Semantic Computing (IJSC), World Scientific, Vol. 3, Issue 2

06/__/2009
Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition

H. Lei

Proceedings of the Third IAPR/IEEE International Conference on Biometrics (ICB 2009), Alghero, Italy

06/__/2009
Anchored Speech Recognition for Question Answering

S. Yaman, G. Tür, D. Vergyri, D. Hakkani-Tür, M. Harper, and W. Wang

Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009): Short Papers, Boulder, Colorado, pp. 265-268

06/__/2009
A Scalable Global Model for Summarization

D. Gillick and B. Favre

Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing at the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 10-18

06/__/2009
Sentence Boundary Detection and the Problem with the U.S.

D. Gillick

Proceedings of the North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009): Short Papers, Boulder, Colorado, pp. 241-244

06/__/2009
Synchronous Parsing of Syntactic and Semantic Structures

B. Bohnet

Proceedings of Quatrième Conférence Internationale Sur La Théorie Sens-Texte (Fourth International Conference on Meaning-Text Theory, MTT’09), Montreal, Canada

06/__/2009
Efficient Parsing of Syntactic and Semantic Dependency Structures

B. Bohnet

Presented at the 13th Conference on Computational Natural Language Learning (CoNLL-2009), Boulder, Colorado

06/__/2009
Review of P. Dev and W. Heinrichs, "Learning Medicine Through Collaboration and Action: Collaborative, Experimental, Networked Learning Environments"

G. Friedland

ACM Computing Reviews, CR136993

05/__/2009
Research Developments and Directions in Speech Recognition and Understanding, Part 1

J. Baker, L. Deng, J. Glass, S. Khudanpur, C.-H. Lee, N. Morgan, and D. O'Shgughnessy

IEEE Signal Processing Magazine, Vol. 26, No. 3, pp. 75-80

05/__/2009
Efficient Parsing for Transducer Grammars

J. DeNero, M. Bansal, A. Pauls, and D. Klein

Proceedings of North American Chapter of the Association for Computational Linguistics Human Language Technologies Conference (NAACL HLT 2009), Boulder, Colorado, pp. 227-235.

04/__/2009
Fusion of Short-Term and Long-Term Features for Improved Speaker Diarization

G. Friedland, O. Vinyals, Y. Huang, and C. Müller

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4077-4080

04/__/2009
Multi-Modal Speaker Diarization of Real-World Meeting Using Compressed-Domain Video Features

G. Friedland, H. Hung, and C. Yeo

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4069-4072

04/__/2009
A Global Optimization Framework for Meeting Summarization

D. Gillick, K. Riedhammer, B. Favre, and D. Hakkani-Tur

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4769-4772

04/__/2009
Towards Automatic Argument Diagramming of Multiparty Meetings

D. Hakkani-Tür

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4753-4756

04/__/2009
Syntactically Informed Models for Comma Prediction

B. Favre, D. Hakkani-Tür, and E. Shriberg

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4697-4700

04/__/2009
Genre Effects on Automatic Sentence Segmentation of Speech: A Comparison of Broadcast News and Broadcast Conversations

J. Kolar, Y. Liu, and E. Shriberg

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4701-4704

04/__/2009
A Variational EM Algorithm for Learning Eigenvoice Parameters in Mixed Signals

R. Weiss and D. Ellis

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 113-116, Taipei, Taiwan

04/__/2009
The SRI NIST 2008 Speaker Recognition Evaluation System

S. S. Kajarekar, N. Scheffer, M. Graciarena, E. Shriberg, A. Stolcke, L. Ferrer, and T. Bocklet

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4205-4208

04/__/2009
Speaker Recognition Using Syllable-Based Constraints for Cepstral Frame Selection

T. Bocklet and E. Shriberg

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, pp. 4525-4528

03/__/2009
Analytics for Experts

G. Friedland

Featured paper in ACM SIGMM Records, Vol. 1, Issue 1

03/__/2009
Review of E. Villalon, “High-Dimensionality Data Reduction in Java”

G. Friedland

ACM Computing Reviews

02/__/2009
Can We Escape the Trough of Disillusionment?--A Perspective on E-learning Technology Research from the ACM Workshop on Educational Multimedia and Multimedia Education

G. Friedland, L. Knipping, W. Huerst, and M. Muhlhauser

ACM E-Learn Journal

02/__/2009
Multimodal Interfaces for Automotive Applications (MIAA)

C. Müller and G. Friedland

Proceedings of the ACM International Conference on Intelligent User Interfaces (IUI 2009), Sanibel, Florida, pp. 493-494

02/__/2009
Hill-Climbing Ensemble Feature Selection with a Larger Ensemble

D. Gelbart

ICSI Technical Report TR-09-001

__/__/2008
ICSI System Description for SRE2008 Submission

H. Lei and D.V. Leeuwen

Speaker Recognition Evaluation 2008, National Institute of Standards and Technology

12/__/2008
A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy

O. Vinyals and G. Friedland

Proceedings of the 10th IEEE International Symposium on Multimedia, Berkeley, California, pp. 61-65

12/__/2008
A Keyphrase Based Approach to Interactive Meeting Summarization

K. Riedhammer, B. Favre, and D. Hakkani-Tur

Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 153-156

12/__/2008
Efficient Sentence Segmentation Using Syntactic Features

B. Favre, D. Hakkani-Tur, S. Petrov, and D. Klein

Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 77-80

12/__/2008
The CALO Meeting Speech Recognition and Understanding System

G. Tur, A. Stolcke, L. Voss, J. Dowding, B. Favre, R. Fernandez, M. Frampton, M. Frandsen, C. Frederickson, M. Graciarena, D. Hankkani-Tur, D. Kintzing, K. Leveque, S. Mason, J. Niekrasz, S. Peters, M. Purver, K. Riedhammer, E. Shriberg, J. Tien, D. Vergyri, and F. Yang

Proceedings of IEEE Workshop on Spoken Language Technologies (SLT2008), Goa, India, pp. 69-72

12/__/2008
Ensemble Feature Selection for Multi-stream Automatic Speech Recognition

D. Gelbart

UC Berkeley dissertation

12/__/2008
Audio Segmentation for Meetings Speech Processing

K. A. Boakye

UC Berkeley dissertation

12/__/2008
Efficient Data Selection for Machine Translation

A. Mandal, D. Vergyri, W. Wang, J. Zheng, A. Stolcke, G. Tür, D. Hakkani-Tür, and N. Fazil Ayan

Proceedings of IEEE/ACL Workshop on Spoken Languge Technologies (SLT), Goa, India, pp. 261-264

11/__/2008
The ICSI Summarization System at TAC 2008

D. Gillick, B. Favre, and D. Hakkani-Tur

Proceedings of Text Analysis Conference (TAC), Gaithersburg, Maryland

11/__/2008
Multimedia Information Extraction Roadmap

G. Myers, G. Tür, L. Voss, B. Bolles, S. Kajarekar, E. Shriberg, and D. Hakkani-Tür

Proceedings of the AAAI Fall Symposium on Multimedia Information Extraction, Arlington, Virginia

11/__/2008
A Comparison of Single- and Multi-Objective Programming Approaches to Problems with Multiple Design Objectives

S. Yaman and C.-H. Lee

Journal of Signal Processing Systems, MLSP special issue

10/__/2008
Role Recognition for Meeting Participants: An Approach Based on Lexical Information and Social Network Analysis

N. Garg, S. Favre, H. Salamin, D. Hakkani-Tur, and A. Vinciarelli

Proceedings of 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 693-696.

10/__/2008
Live Speaker Identification in Conversations

G. Friedland and O. Vinyals

Proceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1017-1018

10/__/2008
Towards Audio-Visual On-Line Diarization of Participants in Group Meetings

H. Hung and G. Friedland

Proceedings of European Conference on Computer Vision (ECCV), Marseille, France

10/__/2008
Automated Lecture Recording

G. Friedland, L. Knipping, and W. Huerst

Encyclopedia of Multimedia, B. Furht, ed., Springer

10/__/2008
Multi-modal Speaker Diarization of Real-world Meetings Using Compressed-domain Video Features

G. Friedland, H. Hung, and C. Yeo

ICSI Technical Report TR-08-007, October 2008

10/__/2008
Sampling Alignment Structure Under a Bayesian Translation Model

J. DeNero, A. Bouchard-Côté, and D. Klein

Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), Waikiki, Honolulu, Hawaii, pp. 314-323

10/__/2008
Multimedia Education—Can We Find Unity in Diversity?

G. Friedland, W. Hürst, and L. Knipping

Proceedings of the 16th ACM International Conference on Multimedia, Vancouver, Canada, pp. 1115-1116

10/__/2008
Personalized, Interactive Tag Recommendation for Flickr

N. Garg and I. Weber

Proceedings of the 2nd ACM International Conference on Recommender Systems, Lausanne, Switzerland, pp. 67-74

09/__/2008
Packing the Meeting Summarization Knapsack

K. Riedhammer, D. Gillick, B. Favre, and D. Hakkani-Tur

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 2434-2437

09/__/2008
Speech-overlapped Acoustic Event Detection for Automotive Applications

C. Müller, J. I. Biel, E. Kim, and D. Rosario

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 2590-2593

09/__/2008
Multi-Stream Spectro-Temporal Features for Robust Speech Recognition

S. Y. Zhao and N. Morgan

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 898-901

09/__/2008
Two's a Crowd: Improving Speaker Diarization by Automatically Identifying and Excluding Overlapped Speech Authors

K. Boakye, O. Vinyals, and G. Friedland

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 32-35

09/__/2008
Getting the Last Laugh: Automatic Laughter Segmentation in Meetings

M. Knox, N. Morgan, and N. Mirghafori

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 797-800

09/__/2008
Development of the SRI/Nightingale Arabic ASR system

D. Vergyri, A. Mandal, W. Wang, A. Stolcke, J. Zheng, M. Graciarena, D. Rybach, C. Gollan, R. Schlater, K. Kirchoff, A. Faria, and N. Morgan

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 1437-1440

09/__/2008
Cross-Lingual Sentence Extraction for Information Distillation

A. Singla and D. Hakkani-Tur

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 2707-2710

09/__/2008
The Value of Auditory Offset Adaptation and Appropriate Acoustic Modeling

H. Wang, D. Gelbart, H.G. Hirsch, and W. Hemmert

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 902-905

09/__/2008
Unsupervised Learning of Edit Parameters for Matching Name Variants

D. Gillick, D. Hakkani-Tur, and M. Levit.

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 467-470

09/__/2008
Best Papers from the Second IEEE International Conference on Semantic Computing (IJSC)

G. Friedland and C. Martell, eds.

International Journal on Semantic Computing (IJSC), Vol. 2, Issue 3

09/__/2008
Perceptually Motivated Sub-Band Decomposition for FDLP Audio Coding

P. Motlicek, S. Ganapathy, H. Hermansky, H. Garudadri, and M. Athineos

Proceedings of 11th International Conference on Text, Speech, and Dialogue (TSD 2008), Brno, Czech Republic, pp. 435-442

09/__/2008
Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification

E. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, and F. Goodman

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 609-612

09/__/2008
The Case for Automatic Higher-Level Features in Forensic Speaker Recognition

E. Shriberg and A. Stolcke

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 1509-1512

09/__/2008
Source Separation Based on Binaural Cues and Source Model Constraints

R. Weiss, M. Mandel, and D. Ellis

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 419-422

09/__/2008
Spectral Noise Shaping: Improvements in Speech/Audio Codec Based on Linear Prediction in Spectral Domain

S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 675--678

09/__/2008
Modulation Spectrogram Features for Speaker Diarization

O. Vinyals and G. Friedland

Proceedings of the 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pp. 630-633

08/__/2008
Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings

O. Vinyals and G. Friedland

Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, pp. 426-431

08/__/2008
Appscio: A Software Environment for Semantic Multimedia Analysis

G. Friedland, E. Hensley, J. Schumacher, and R. Jain

Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, California, pp. 456-459

07/__/2008
Educational Multimedia

G. Friedland, L. Knipping, and W. Huerst (guest editors)

Special Section in IEEE Multimedia Magazine, pp. 54-74, July-Sept. 2008

05/__/2008
Automatic Laughter Segmentation

M. T. Knox

Master's report

05/__/2008
Speech Segmentation and Spoken Document Processing

M. Ostendorf, B. Favre, R. Grishman, D. Hakkani-Tur, M. Harper, D. Hillard, J. Hirschberg, J. Heng, J. G. Kahn, Y. Liu, S. Maskey, E. Matusov, H. Ney, A. Rosenberg, E. Shriberg, W. Wang, and C. Wooters

IEEE Signal Processing Magazine, Vol. 25, Issue 3, pp. 59-69

05/__/2008
Autoregressive Modeling of Hilbert Envelopes for Wide-Band Audio Coding

S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri

Proceedings of 124th Convention of Audio Engineering Society (AES), paper 7481, Amsterdam, the Netherlands

04/__/2008
Corrected Tandem Features for Acoustic Model Training

A. Faria and N. Morgan

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4737-4740

04/__/2008
Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies

H. Hung, Y. Huang, G. Friedland, and D. Gatica-Perez

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 2197-2200

04/__/2008
Overlapped Speech Detection for Improved Speaker Diarization in Multiparty Meetings

K.A. Boakye, B. Trueba-Hornero, O. Vinyals, and G. Friedland

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4353-4356

04/__/2008
An Iterative Unsupervised Learning Method for Information Distillation

K. Kamangar, D. Hakkani-Tur, G. Tur, and M. Levit

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4949 - 4952

04/__/2008
Punctuating Speech For Information Extraction

B. Favre, R. Grishman, D. Hillard, H. Ji, D. Hakkani-Tur, and M.Ostendorf

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5013-5016

04/__/2008
Name-Aware Speech Recognition for Interactive Question Answering

S. Stoyanchev, G. Tur, and D. Hakkani-Tür

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 5113-5116

04/__/2008
System Combination Using Auxiliary Information for Speaker Verification

L. Ferrer, M. Graciarena, A. Zymnis, and E. Shriberg

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4853-4856

04/__/2008
Exploiting Dialog Act Tagging and Prosodic Information for Action Item Identification

F. Yang, G. Tur, and E. Shriberg

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, Nevada, pp. 4941-4944

04/__/2008
Nonparametric Feature Normalization for SVM-Based Speaker Verification

A. Stolcke, S. Kajarekar, and L. Ferrer

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 1577-1580

04/__/2008
Multimedia Education in Computer Science -- A Little Bit of Everything Is Not Enough

G. Friedland, L. Knipping, and W. Huerst

IEEE Multimedia Magazine, Vol. 15, Issue 2, pp. 78-82

04/__/2008
Detecting Music in Ambient Audio by Long-Window Autocorrelation

K. Lee and D. Ellis

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 9-12

04/__/2008
Temporal Masking for Bit-Rate Reduction in Audio Codec based on Frequency Domain Linear Prediction

S. Ganapathy, P. Motlicek, H. Hermansky, and H. Garudadri

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4781-4784

03/__/2008
When a Mismatch Can Be Good: Large Vocabulary Speech Recognition Trained with Idealized Tandem Features

A. Faria and N. Morgan

Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, pp. 1574-1577

03/__/2008
Using Corpus and Knowledge-Based Similarity Measure in Maximum Marginal Relevance for Meeting Summarization

S. Xie and Y. Liu

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), Las Vegas, Nevada, pp. 4985-4988

01/__/2008
Anthropocentric Video Segmentation for Lecture Webcasts

G. Friedland and R. Rojas

EURASIP Journal on Image and Video Processing, Vol. 8, Issue 2, Article 9

01/__/2008
Comparisons of Recent Speaker Recognition Approaches based on Word Conditioning

H. Lei and N. Mirghafori

Proceedings of Odyssey 2008, Stellenbosch, South Africa

__/__/2007
Higher Level Features in Speaker Recognition

E. Shriberg

Speaker Classification I (Lecture Notes in Computer Science, Vol. 4343), pp. 241-259, Springer: Heidelberg / Berlin

__/__/2007
Term-Weighting for Summarization of Multi-Party Spoken Dialogues

G. Murray and S. Renals

In Machine Learning for Multimodal Interaction IV (Lecture Notes in Computer Science, Vol. 4892), pp. 155-166, Springer

12/__/2007
A Fast-Match Approach for Robust, Faster than Real-Time Speaker Diarization

Y. Huang, O. Vinyals, G. Friedland, C. Müller, N. Mirghafori, and C. Wooters

Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 693-698

12/__/2007
Speech Encoding in a Model of Peripheral Auditory Processing: Quantitative Assessment by Means of Automatic Speech Recognition

M. Holmberg, D. Gelbart, and W. Hemmert

Speech Communication, Vol. 49, Issue 12, pp. 917-932

12/__/2007
Building a Highly Accurate Mandarin Speech Recognizer

M-Y. Hwang, G. Peng, W. Wang, A. Faria, and A. Heidel

Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, pp. 490-495

12/__/2007
Morph-Based Speech Recognition and Modeling of Out-of-Vocabulary Words Across Languages

M. Creutz, T. Hirsimäki, M. Kurimo, A. Puurula, J. Pylkkönen, V. Siivola, M. Varjokallio, E. Arisoy, M. Saraclar, and A. Stolcke

ACM Transactions on Speech and Language Processing, Vol. 5, Issue 1, pp. 1-29

12/__/2007
Visualizing Large-Screen Electronic Chalkboard Content on Handheld Devices

A. Lüning, G. Friedland, L. Knipping, and R. Rojas

Proceedings of the Second IEEE International Workshop on Multimedia Technologies for E-Learning at 9th IEEE Symposium on Multimedia, Taichung, Taiwan, pp. 369-375

11/__/2007
Multimedia Technologies for E-Learning 2007

G. Friedland, L. Knipping, and N. Ludwig (eds.)

Special Issue of Interactive Technology Smart Education (ITSE), Vol. 4, Issue 4

09/__/2007
Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms

A. Stolcke, S. Kajarekar, L. Ferrer, and E. Shriberg

IEEE Transactions on Audio, Speech, and Language Processing. Special issue on speaker and language recognition, Vol. 15, Issue 7, IEEE Computer Society, pp. 1987-1998, California

09/__/2007
Acoustic Beamforming for Speaker Diarization of Meetings

X. Anguera, C. Wooters, and J. Hernando

IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 7, IEEE Computer Society, pp. 2011-2022, California

09/__/2007
Speaker Diarization For Multiple-distant-microphone Meetings Using Several Sources of Information

J. M. Pardo, X. Anguera, and C. Wooters

IEEE Transactions on Computers, Vol. 56, Issue 9, IEEE Computer Society, pp. 1212-1224, California

09/__/2007
Cross-genre Feature Comparisons for Spoken Sentence Segmentation

S. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, and B. Favre

Proceedings of International Conference on Semantic Computing, IEEE Computer Society, pp. 265-274, Irvine, CA. Also published in International Journal of Semantic Computing, Volume 1, Issue 3, World Scientific, 335-346, USA.

09/__/2007
Multimedia Data Formats and Semantic Computing: A Practical Example and its Implications for the Future

G. Friedland

IEEE International Conference on Semantic Computing, Irvine, CA

09/__/2007
Educational Multimedia Systems: The Past, the Present, and a Glimpse into the Future

G. Friedland, W. Huerst, and L. Knipping

Proceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, pp. 1-4, Augsburg, Germany

09/__/2007
A Low-Cost Mobile Pointing and Drawing Device

K. Jantz, G. Friedland, L. Knipping, and R. Rojas

Proceedings of the ACM Workshop on Educational Multimedia and Multimedia Education at ACM Multimedia 2007, pp. 121-122, Augsburg, Germany

09/__/2007
Using Audio and Video Features to Classify the Most Dominant Person in Meetings

H. Hung, D. Jayagopi, C. Yeo, G. Friedland, S. Ba, J-M. Odobez, K. Ramchandran, N. Mirghafori, and D. Gatica-Perez

Proceedings of ACM Multimedia 2007, pp. 835-838, Augsburg, Germany

08/__/2007
Automatic Laughter Detection Using Neural Networks

M. Knox and N. Mirghafori

Proceedings of Interspeech 2007, ISCA, pp. 2973-2976, Antwerp, Belgium

08/__/2007
Exploiting Information Extraction Annotations for Document Retrieval in Distillation Tasks

D. Hakkani-Tur, G. Tur, and M. Levit

Proceedings of Interspeech 2007, ISCA, pp. 330-333, Antwerp, Belgium

08/__/2007
Co-training Using Prosodic and Lexical Information for Sentence Segmentation

U. Guz, S. Cuendet, D. Hakkani-Tur, and G. Tur

Proceedings of Interspeech 2007, ISCA, pp. 2597-2600, Antwerp, Belgium

08/__/2007
Detecting Deception Using Critical Segments

F. Enos, E. Shriberg, M. Graciarena, J. Hirschberg, and A. Stolcke

Proceedings of Interspeech 2007, ISCA, pp. 2281-2284, Antwerp, Belgium

08/__/2007
fMPE-MAP: Improved Discriminative Adaptation for Modeling New Domains

J. Zheng and A. Stolcke

Proceedings of Interspeech 2007, ISCA, pp. 1573-1576, Antwerp, Belgium

08/__/2007
Word-Conditioned HMM Supervectors for Speaker Recognition

H. Lei and N. Mirghafori

Proceedings of Interspeech 2007, ISCA, pp. 746-749, Antwerp, Belgium

08/__/2007
Prosodic Features and Feature Selection for Multi-lingual Sentence Segmentation

J. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, and N. Mirghafori

Proceedings of Interspeech 2007, ISCA, pp. 2585-2588, Antwerp, Belgium

08/__/2007
Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification

G. Tur, E. Shriberg, A. Stolcke, and S. Kajarekar

Proceedings of the 8th International Conference of the ISCA (Interspeech--Eurospeech 2008), Antwerp, Belgium, pp. 2049-2052

08/__/2007
A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification

L. Ferrer, K. Sonmez, and E. Shriberg

Proceedings of Interspeech 2007, ISCA, pp. 738-741, Antwerp, Belgium

08/__/2007
Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings

J. Kolar, Y. Liu, and E. Shriberg

Proceedings of Interspeech 2007, ISCA, pp. 1621-1624, Antwerp, Belgium

08/__/2007
A Text-constrained Prosodic System for Speaker Verification

E. Shriberg and L. Ferrer

Proceedings of Interspeech 2007, ISCA, pp. 1226-1229, Antwerp, Belgium

08/__/2007
Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age

C. Müller and F. Burkhardt

Proceedings of Interspeech 2007, ISCA, pp. 2277-2280, Antwerp, Belgium

08/__/2007
The Blame Game: Performance Analysis of Speaker Diarization System Components

M. Huijbregts and C. Wooters

Proceedings of Interspeech 2007, ISCA, pp. 1857-1860, Antwerp, Belgium

08/__/2007
Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections

M. Huijbregts, C. Wooters, and R. Ordelman

Proceedings of Interspeech 2007, ISCA, pp. 2925-2928, Antwerp, Belgium

08/__/2007
Improving Speech Translation with Automatic Boundary Prediction

E. Matusov, D. Hillard, M. Magimai-Doss, D. Hakkani-Tur, M. Ostendorf, and H. Ney

Proceedings of Interspeech 2007, ISCA, pp. 2449-2452, Antwerp, Belgium

08/__/2007
A New Algorithm for High Speed Speech and Audio Coding

U. Guz, H. Gurkan, and B.S. Yarman

Proceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, Seville, Spain

08/__/2007
EEG Signal Compression Based on Classified Signature and Envelope Vector Sets

H. Gurkan, U. Guz, and B.S. Yarman

Proceedings of the European Conference on Circuit Theory and Design, IEEE Circuits and Systems Society and the European Circuit Society, pp. 420-423, Seville, Spain

08/__/2007
Selecting On-topic Sentences from Natural Language Corpora

M. Levit, E. Boschee, and M. Freedman

Proceedings of Interspeech 2007, ISCA, pp. 2793-2796, Antwerp, Belgium

07/__/2007
The Digital Hand, Vol 2 - How Computers Changed the Work of the American Financial, Telecommunications, Media, and Entertainment Industries (book review)

G. Friedland

IEEE Annals of the History of Computing, Vol. 29, Issue 3, IEEE Computer Society, pp. 72-75, California

07/__/2007
Object Cut and Paste in Images and Videos

G. Friedland, K. Jantz, T. Lenz, F. Wiesel, and R. Rojas

International Journal of Semantic Computing, World Scientific, Vol. 1, Issue 2, pp. 221-247, USA

07/__/2007
An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings

S. Cuendet, E. Shriberg, B. Favre, J. Fung, and D. Hakkani-Tür

Proceedings of the SIGIR Workshop on Searching Conversational Spontaneous Speech, Amsterdam, Netherlands, pp. 43-59

07/__/2007
Applications of Keyword-Constraining in Speaker Recognition

H. Lei

MS Thesis, University of California-Berkeley

06/__/2007
Computers and Commerce: A Study of Technology and Management at Eckert-Mauchly Computer Company, Engineering Research Associates, and Remington Rand, 1946-1957 (book review)

G. Friedland

EEE Annals of the History of Computing, Vol. 29, no. 2, IEEE Computer Society, pp. 74-77, California

06/__/2007
Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech

S. Cuendet, D. Hakkani-Tur and E. Shriberg

Proceedings of 4th International Conference on Machine Learning and Multimodal Interaction, pp. 144-155, Brno, Czech Republic

06/__/2007
Interpretation of Spatial Language in a Map Navigation Task

M. Levit and D. Roy

In IEEE Transactions on Systems, Man and Cybernetics, Part B, vol. 37, no. 3, IEEE Systems, man, and Cybernetics Society, pp.667-679

06/__/2007
Mutaphrase: Paraphrasing with FrameNet

M. Ellsworth and A. Janin

Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing (TextEntail), Prague, Czech Republic, pp. 143-150

05/__/2007
The ICSI RT07s Speaker Diarization System

C. Wooters and M. Huijbregts

Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 509-519

05/__/2007
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System

A. Stolcke, X. Anguera, K. Boakye, O. Cetin, A. Janin, M. Magimai-Doss, C. Wooters, and J. Zheng

Proceedings of the Second International Workshop on Classification of Events, Activities, and Relationships (CLEAR 2007) and the Fifth Rich Transcription 2007 Meeting Recognition (RT 2007), Baltimore, Maryland, pp. 450-463

05/__/2007
Speaker Recognition Via Nonlinear Discriminant Features

L. Stoll, J. Frankel, and N. Mirghafori

Proceedings of Non-Linear Speech Processing 2007, ISCA, pp. 27-30, Paris, France

04/__/2007
Comparing Evaluation Metrics for Sentence Boundary Detection

Y. Liu and E. Shriberg

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Vol. 4, pp. 185-188, Honolulu, Hawaii

04/__/2007
Word-Conditioned Phone N-Grams for Speaker Recognition

H. Lei and N. Mirghafori

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, pp. 253-256

04/__/2007
Statistical Sentence Extraction for Information Distillation

D. Hakkani-Tur and G. Tur

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, vol. 4, pp. 1-4

04/__/2007
Entropy Based Classifier Combination for Sentence Segmentation

M. Magimai Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, vol. 4, pp. 189-192

04/__/2007
Manual Transcription of Conversational Speech at the Articulatory Feature Level

K. Livescu, A. Bezman, N. Borges, L. Yung, O. Cetin, J. Frankel, S. King, M. Magimai-Doss, X. Chi, and L. Lavoie

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, vol. 4, pp. 953-956

04/__/2007
Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 Jhu Summer Workshop

K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii

04/__/2007
Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition

J. Zheng, O. Cetin, M.-Y. Huang, X. Lei, A. Stolcke, and N. Morgan

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Honolulu, Hawaii, Vol. 4, pp. 633-636

04/__/2007
An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling

O. Cetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and K. Livescu

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, Vol. 4, pp. 645-648

04/__/2007
Wide-Band Perceptual Audio Coding Based on Frequency-Domain Linear Prediction

P. Motlicek, V. Ullal, and H. Hermansky

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, Vol. 1, pp. 265-268

04/__/2007
Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings

X. Anguera, C. Wooters, J. Pardo, and J. Hernando

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, Vol. 4, pp. 241-244

04/__/2007
Model Complexity Selection and Cross-validation EM Training for Robust Speaker Diarization

X. Anguera, T. Shinozaki, C. Wooters, and J. Hernando

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, Vol. 4 pp. 273-276

04/__/2007
A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition

O. Cheng, J. Dines, and M. Magimai Doss

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, Vol. 4, pp. 345-348

04/__/2007
Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition

L. Ferrer, E. Shriberg, S. Kajarekar, and K. Sonmez

Proceedings of International Conference on Acoustics, Speech, and Signal Processing 2007, Honolulu, Hawaii, Vol. 4, pp. 233-236

04/__/2007
Noise Robust Speaker Identification for Spontaneous Arabic Speech

M. Graciarena, S. Kajarekar, A. Stolcke, and E. Shriberg

Proceedings of the 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), Honolulu, Hawaii, Vol. 4, pp. 245-248

03/__/2007
Multimedia Technologies for E-learning

G. Friedland and L. Knipping (editors)

Special issue of International Journal of Interactive Technology Smart Education (ITSE), Vol 4, No 1, Troubador Publishing Ltd., UK

01/__/2007
How to Build a Spoken Dialog System with Limited (or No) Resources

M. Plauché, O. Cetin, and N. Uhdaykumar

Presented at the Workshop on AI in ICT for Development at the 20th International Joint Conference on AI (IJCAI07), Hyderabad, India

__/__/2006
The ICSI-SRI Spring 2006 Meeting Recognition System

A. Janin, A. Stolcke, X. Anguera, K. Boakye, O. Cetin, J. Frankel, and J. Zheng

In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006), Lecture Notes in Computer Science. Springer

__/__/2006
Robust Speaker Diarization for Meetings: ICSI TR06 Meetings Evaluation System

X. Anguera, C. Wooters, and J. Pardo

Lecture Notes in Computer Science, Volume 4299/2006, pp. 346-358, ISSN 0302-9743

12/__/2006
Let's DISCOH: Collecting an Annotated Open Corpus with Dialogue Acts and Reward Signals for Natural Language Helpdesks

G. Andeani, D. Di Fabbrizio, M. Gilbert, D. Gillick, D. Hakkani-Tur, and O. Lemon

Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 218-221

12/__/2006
Model Adaptation for Dialog Act Tagging

G. Tur, U. Guz, and D. Hakkani-Tur

Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 94-97

12/__/2006
Impact of Automatic Comma Prediction on POS/Name Tagging of Speech

D. Hillard, Z. Huang, H. Ji, R. Grishman, D. Hakkani-Tur, M. Harper, M. Ostendorf, and W. Wang

Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 58-61

12/__/2006
Model Adaptation for Sentence Segmentation from Speech

S. Cuendet, D. Hakkani-Tur, and G. Tur

Proceedings of the IEEE 2006 Workshop on Spoken Language Technology (SLT 2006), Palm Beach, Aruba, pp. 102-105

12/__/2006
Phonetic- and Speaker-Discriminant Features for Speaker Recognition

L. Stoll

UC Berkeley Masters Thesis

12/__/2006
Kernel Optimization for Support Vector Machines: Application to Speaker Verification

A. Hatch

UC Berkeley dissertation

11/__/2006
Detecting Categories in News Video Using Acoustic, Speech, and Image Features

S. Petrov, A. Faria, P. Michaillat, A. Berg, A. Stolcke, D. Klein, and J. Malik

Presented at the NIST TREC Video Retrieval Workshop, Gaithersburg, Maryland

10/__/2006
A Study in Machine Learning from Imbalanced Data for Sentence Boundary Detection in Speech

Y. Liu, N.V. Chawla, M.P. Harper, E. Shriberg, and A. Stolcke

Computer Speech and Language, Vol. 20, Issue 4, pp. 468-494

09/__/2006
Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings

J. Kolar, E. Shriberg, and Y. Liu

Proceedings of Ninth International Conference on Text, Speech and Dialogue (TSD 2006), Brno, Czech Republic, pp. 629-636

09/__/2006
On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party Meetings

J. Kolar, E. Shriberg, and Y. Liu

Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2014-2017

09/__/2006
Within-Class Covariance Normalization for SVM-Based Speaker Recognition

A.O. Hatch, S. Kajarekar, and A. Stolcke

Proceedings of the 9th International Conference on Spoken Lanugage Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1471-1474

09/__/2006
Improved Speech Activity Detection Using Cross-Channel Features for Recognition of Multiparty Meetings

K. Boakye and A. Stolcke

Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1962-1965

09/__/2006
Friends and Enemies: A Novel Initialization for Speaker Diarization

X. Anguera, C. Wooters, and J. Hernando

Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 689-692

09/__/2006
Robust Speaker Diarization for Meetings: ICSI RT06s evaluation system

X. Anguera, C. Wooters, and J. Pardo

Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1674-1677

09/__/2006
Multi-Stream Speaker Diarization Systems for the Meetings Domain

A. Gallardo-Antolin, X. Anguera, and C. Wooters

Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech 2006—ICSLP), Philadelphia, Pennsylvania, pp. 2186-2189

09/__/2006
Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time Differences

J. Pardo, X. Anguera, and C. Wooters

Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 2194-2197

09/__/2006
The ICSI+ Muilti-Lingual Sentence Segmentation System

M. Zimmerman, D. Hakkani-Tur, J. Fung, N. Mirghafori, L. Gottlieb, E. Shriberg, and Y. Liu

Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 117-120

09/__/2006
QASR: Question Answering Using Semantic Roles for Speech Interface

S. Stenchikova, D. Hakkani-Tur, and G. Tur

Proceedings of the 9th International Conference on Spoken Language Processing (ICSLP-Interspeech 2006), Pittsburgh, Pennsylvania, pp. 1185-1188

09/__/2006
Recent Innovations in Speech-to-Text Transcription at SRI-ICSI-UW

A. Stolcke, B. Chen, H. Franco, V.R.R. Gadde, M. Graciarena, M.-Y. Hwang, K. Kirchhoff, N. Morgan, X. Lin, T. Ng, M. Ostendorf, K. Sönmez, A. Venkataraman, D. Vergyri, W. Wang, J. Zheng, and Q. Zhu

IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1729-1744

09/__/2006
Enriching Speech Recognition with Automatic Detection of Sentence Boundaries and Disfluencies

Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper

IEEE Transactions on Audio, Speech and Language Processing, Vol. 14, Issue 5, pp. 1526-1540

06/__/2006
Hybrid Speech/Non-Speech Detector Applied to Speaker Diarization of Meetings

X. Anguera, M. Aguilo, C. Wooters, C. Nadeu, and J. Hernando

Proceedings of IEEE Odyssey: The Speaker and Language Recognition Workshop, San Juan de Puerto Rico, pp. 1-6

05/__/2006
Generalized Linear Kernels for One-Versus-All Classification: Application to Speaker Recognition

A.O. Hatch and A. Stolcke

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 585-588

05/__/2006
Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons

A. Stolcke, F. Grezl, M.-Y. Hwang, X. Lei, N. Morgan, and D. Vergyri

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 321-324

05/__/2006
Purity Algorithms for Speaker Diarization of Meetings Data

X. Anguera, C. Wooters and J. Hernando

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France

05/__/2006
Nuts and Flakes: A Study of Data Characteristics in Speaker Diarization

N. Mirghafori and C. Wooters

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 1017-1020

05/__/2006
Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap

O. Cetin and E.E. Shriberg

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, pp. 357-360

05/__/2006
Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings

M. Zimmermann, A. Stolcke, E.E. Shriberg

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 581-584

05/__/2006
Speech Recognition for Illiterate Access to Information and Technology

M. Plauche', N. Udhyakummar, C. Wooters, J. Pal, and D. Ramachadran

Proceedings of the First International Conference on Information and Communication Technologies and Development (ICTD '06), Berkeley, California, pp. 83-92

05/__/2006
Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site

O. Cetin and E. Shriberg

Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 212-224

05/__/2006
Putting Linguistics into Speech Recognition: The Regulus Grammar Compiler

M. Rayner, B.A. Hockey, and P. Bouillon

CSLI Press

05/__/2006
REGULUS: A Generic Multilingual Open Source Platform for Grammar-Based Speech Applications

M. Rayner, P. Bouillon, B.A. Hockey, and N. Chatzichrisafis

Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, pp. 783-788

05/__/2006
Reranking for Sentence Boundary Detection in Conversational Speech

B. Roark, Y. Liu, M. Harper, R. Stewart, M. Lease, M. Snover, Z. Shafran, B. Dorr, J. Hale, A. Krasnyanskaya, and L. Young

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, pp. 545-548

05/__/2006
Speaker Diarization for Multi-Microphone Meetings Using Only Between-Channel Differences

J.M. Pardo, X Anguera, and C. Wooters

Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 257-264

05/__/2006
Automatic Cluster Complexity and Quantity Selection: Towards Robust Speaker Diarization

X. Anguera, C. Wooters, and J. Hernando

Proceedings of the Third Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2006), Washington DC, pp. 248-256

04/__/2006
Tamil Market: A spoken dialog system for rural India

M. Plauche' and M. Prabaker

Working Papers in Computer-Human Interfaces

04/__/2006
The challenges of IT research in developing regions

E. Brewer, M. Demmer, M. Ho, R.J. Honicky, J. Pal, M. Plauche' and S. Surana

IEEE Pervasive Computing, Vol. 5, No. 2, pp. 15-23

04/__/2006
A Multilingual Shared Grammar for Recognition and Generation (in French)

P. Bouillon, M. Rayner, B. Novellas, Y. Nakao, M. Santaholma, M. Starlander, and N. Chatzichrisafis

Proceedings of the 13th Conference on Natural Language Processing (TALN 2006), Leuwen, Belgium, pp. 93-102

03/__/2006
Improving the Usability of MedSLT: Back-Translation and the Help System (in Japanese)

Y. Nakao, M. Rayner, N. Chatzichrisafis, K. Kanzaki, P. Bouillon, B.A. Hockey, and H. Isahara

Proceedings of the 12th Annual Meeting of the Japanese Society for Natural Language Processing (NLP2006), Tokyo, Japan

01/__/2006
Automatic Speech Recognition with an Adaptation Model Motivated by Auditory Processing

M. Holmberg, D. Gelbart and W. Hemmert

IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue 1, pp. 44-49

__/__/2005
Improved MLP Structures for Data-Driven Feature Extraction for ASR

Q. Zhu, B. Chen, F. Grezl, and N. Morgan

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132

11/__/2005
ICSI's 2005 Speaker Recognition System

N. Mirghafori, A.O. Hatch, S. Stafford, K. Boakye, D. Gillick, and B. Peskin

Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 23-28

11/__/2005
Combining Feature Sets with Support Vector Machines: Application to Speaker Recognition

A.O. Hatch, A. Stolcke, and B. Peskin

Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 75-79

11/__/2005
Speaker Diarization for Multi-Party Meetings Using Acoustic Fusion

X. Anguera, C. Wooters, and J. Hernando

Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 426-461

11/__/2005
A* Based Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings

M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke

Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2005), San Juan, Puerto Rico, pp. 215-219

10/__/2005
Japanese Speech Understanding Using Grammar Specialization

M. Rayner, N. Chatzichrisafis, P. Bouillon, Y. Nakao, H. Isahara, K. Kanzaki, B. A. Hockey, M. Santaholma, and M. Starlander

Proceedings of the Joint Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT-EMNLP 2005), Vancouver, Canada, pp. 26-27

09/__/2005
Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency Detection

Y. Liu, E. Shriberg, A. Stolcke, and M. Harper

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 3313-3316

09/__/2005
Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?

A. Venkataraman, Y. Liu, E. Shriberg, and A. Stolcke

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2777-2780

09/__/2005
Using MLP Features in SRI's Conversational Speech Recognition System

Q. Zhu, A. Stolcke, B.Y. Chen, and N. Morgan

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2141-2144

09/__/2005
Improved MLP Structures for Data-Driven Feature Extraction for ASR

Q. Zhu, B. Chen, F. Grezl and N. Morgan

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2129-2132

09/__/2005
Automatic Data Selection for MLP-Based Feature Extraction for ASR

C. Pelaez-Moreno, Q. Zhu, B. Chen, and N. Morgan

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 229-232

09/__/2005
Efficient Pitch-Based Estimation of VTLN Warp Factors

A. Faria and D. Gelbart

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 213-216

09/__/2005
A Methodology for Comparing Grammar-Based and Robust Approaches to Speech Understanding

P. Bouillon, N. Chatzichrisafis, B.A. Hockey, M. Rayner, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. Nakao

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 1877-1880

09/__/2005
Spontaneous Speech: How People Really Talk, and Why Engineers Should Care

E. E. Shriberg

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 1781-1784

09/__/2005
Pushing the Envelope - Aside

N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard, and M. Athineos

IEEE Signal Processing Magazine, Vol. 22 No. 5, pp. 81-88

09/__/2005
Automatic Speech Recognition with Neural Spike Trains

M. Holmberg, D. Gelbart, U. Ramacher, and W. Hemmert

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal

09/__/2005
MLLR Transforms as Features in Speaker Recognition

A. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, and A. Venkataraman

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 2425-2428

09/__/2005
The Effects of Speech Recognition and Punctuation on Information Extraction Performance

J. Makhoul, A. Baron, I. Bulyko, L. Nguyen, L. Ramshaw, D. Stallard, R. Schwartz, and B. Xiang

Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech 2005-Eurospeech 2005), Lisboa, Portugal, pp. 57-60

07/__/2005
Modeling Prosodic Feature Sequences for Speaker Recognition

E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, A. Stolcke

Speech Communication, Vol. 46, Issues 3-4, pp. 455-472

07/__/2005
Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System

A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A. Mandal, B. Peskin, C. Wooters and J. Zheng

Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 463-475

07/__/2005
Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System

X. Anguera, C. Wooters, B. Peskin and M. Aguilo

Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 402-414

07/__/2005
Multi-Microphone Signal Processing for Automatic Speech Recognition in Meeting Rooms

M. Ferras Font

M.S. Thesis, Universitat Politecnica de Catalunya, Barcelona, Spain

07/__/2005
Comparison of Grammar Based and Statistical Language Models Trained on the Same Data

B.A. Hockey and M. Rayner

Presented at the Workshop on Spoken Language Understanding at the 20th AIII National Conference on Artificial Intelligence, Pittsburgh, Pennsylvania

07/__/2005
Toward Joint Segmentation and Classification of Dialog Acts in Multi-Party Meetings

M. Zimmermann, Y. Liu, E. Shriberg, and A. Stolcke

Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 187-193

07/__/2005
Accent Classification for Speech Recognition

A. Faria

Proceedings of the Second Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK, pp. 285-293

06/__/2005
Using Conditional Random Fields For Sentence Boundary Detection in Speech

Y. Liu, A. Stolcke, E. Shriberg, and M. Harper

Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), Ann Arbor, Michigan, pp.451-458

06/__/2005
A Voice-Enabled Procedure Browser for the International Space Station

M. Rayner, B.A. Hockey, N. Chatzichrisafis, K. Farrell and J.M. Renders

Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL 2005), Ann Arbor, Michigan, pp. 29-32 (interactive poster and demo track)

05/__/2005
The Sequential GMM: A Gaussian Mixture Model Based Speaker Verification System that Captures Sequential Information

S. Stafford

M.S. Thesis, University of California at Berkeley

05/__/2005
Speaker Recogntion in the Text-Independent Domain Using Keyword Hidden Markov Models

K. Boakye

M.S. Thesis, University of California at Berkeley

05/__/2005
Learning Discriminant Narrow-Band Temporal Patterns for Automatic Recognition of Conversational Telephone Speech

B.Y. Chen

Ph.D. Thesis, University of California at Berkeley

05/__/2005
A Generic Multi-Lingual Open Source Platform for Limited-Domain Medical Speech Translation

P. Bouillon, M. Rayner, N. Chatzichrisafis, B.A. Hockey, M. Santaholma, M. Starlander, H. Isahara, K. Kanzaki, and Y. Nakao

Proceedings of the 10th Annual Conference of the European Association of Machine Translation (EAMT 2005), Budapest, Hungary, pp. 5-58

03/__/2005
Automatic Dialog Act Segmentation and Classification in Multiparty Meetings

J. Ang, Y. Liu, and E. Shriberg

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 1061-1064

03/__/2005
Tonotopic Multi-Layered Perceptron: A Neural Network for Learning

B. Y. Chen, Q. Zhu, N. Morgan

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 945-948

03/__/2005
Improved Phonetic Speaker Recognition Using Lattice Decoding

A. O. Hatch, B. Peskin, and A. Stolcke

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 169-172

03/__/2005
Multi-Rate and Variable-Rate Modeling of Speech at Phone and Syllable Time Scales

O. Cetin and M. Ostendorf

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 665-668

03/__/2005
Speaker Detection Without Models

D. Gillick, S. Stafford, and B. Peskin

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 757-760

03/__/2005
Structural Metadata Research in the EARS Program

Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendorf, M. Tomalin, P. Woodland, and M. Harper

Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, Pennsylvania, pp. 957-960

__/__/2004
Text-Constrained Speaker Recognition on a Text-Independent Task

K. Boakye and B. Peskin

Odyssey 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain

__/__/2004
Speech Recognition Technology

H. Franco, F. Beaufays, N. Morgan and H. Bourlard

Chapter in Handbook of Brain Theory and Neural Networks, 2nd edition, M. Arbib ed. MIT Press

__/__/2004
Show what you know: musings on the reporting of negative results in speech recognition research

H. Hermansky and N. Morgan

Journal of Negative Results in Speech and Audio Sciences

__/__/2004
Speech Recognition and the Auditory Perspective

N. Morgan, H. Bourlard and H. Hermansky

Chapter in Speech Processing in the Auditory System, S. Greenberg and W. Ainsworth, eds, Springer

__/__/2004
Scaling Up: Learning Large-scale Recognition Methods from Small-scale Recognition Tasks

N. Morgan, B. Chen, Q. Zhu and A. Stolcke

ICSI Technical Report tr-03-02. Also Special Workshop in Maui(SWIM) paper 218.

__/__/2004
Multimodal Model Integration for Sentence Unit Detection

L. Chen, Y. Liu, M. Harper and E. Shriberg

In 6th International Conference on Multimodal Interfaces, October 2004

__/__/2004
Meeting Recorder Project: Dialog Act Labeling Guide

R. Dhillon, S. Bhagat, H. Carvey and E. Shriberg

ICSI Technical Report TR-04-002

__/__/2004
Using Machine Learning to Cope with Imbalanced Classes in Natural Speech: Evidence from Sentence Boundary and Disfluency Detection

Y. Liu, E. Shriberg, A. Stolcke and M. Harper

In Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.

__/__/2004
Prosody Modeling for Automatic Speech Recognition and Understanding

E. Shriberg and A. Stolcke

Mathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag.

__/__/2004
Speech recognition on vector architectures

A. Janin

Ph.D. Thesis, University of California at Berkeley

__/__/2004
Evaluating Factors Impacting the Accuracy of Forced Alignments in a Multimodal Corpus

L. Chen, Y. Liu, M. Harper, E. Maia, and S. McRoy

Proceedings of LREC 2004, Lisbon

12/19/2004
Structural Event Detection for Rich Transcription of Speech

Y. Liu

Ph.D Thesis, Purdue University

11/__/2004
Towards Robust Speaker Segmentation: The ICSI-SRI Fall 2004 Diarization System

C. Wooters, J. Fung, B. Peskin and X. Anguera

Proceedings of Fall 2004 Rich Transcription Workshop (RT-04F), Nov. 2004

11/__/2004
Incorporating Tandem/HATs MLP Features into SRI's Conversational Speech Recognition System

Q. Zhu, A. Stolcke, B. Y. Chen, and N. Morgan

Proceedings of the EARS RT-04F Workshop, Palisades, New York, November 2004.

11/__/2004
SmartKom English: From Robust Recognition to Felicitous Interaction

D. Gelbart, J. Bryants, A. Stolcke, R. Porzel, M. Baudis, and N. Morgan

In SmartKom--Foundations of Multimodal Dialogue Systems, W. Wahlster, ed., pp. 453-470, Springer

10/__/2004
From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System

N. Mirghafori, A. Stolcke, C. Wooters, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. Ostendorf

Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.

10/__/2004
Auditory-based Automatic Speech Recognition

W. Hemmert, M. Holmberg and D. Gelbart

Proceedings of ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004.

10/__/2004
Vocabulary and Language Model Adaptation using Information Retrieval

B. Bigi, Y. Huang and R. De Mori

Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.

10/__/2004
Learning Long-Term Temporal Features in LVCSR Using Neural Networks

B. Chen, Q. Zhu and N. Morgan

Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.

10/__/2004
On using MLP features in LVCSR

Q. Zhu, B. Chen, N. Morgan and A. Stolcke

Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.

07/__/2004
Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies

M. Galley, K. McKeown, J. Hirschberg and E. Shriberg

Proceedings of 42nd Meeting of the ACL, July 21-26, Barcelona

07/__/2004
Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech

Y. Liu, A. Stolcke, E. Shriberg and M. Harper

In Proceedings of Conference on Empirical Methods in Natural Language Processing, Barcelona

07/__/2004
Time delay based failure-robust direction of arrival estimation

T. Pirinen and J. Yli-Hietanen

Proceedings of IEEE SAM 2004, Sitges, Barcelona, Spain.

06/__/2004
The 2004 ICSI-SRI-UW Meeting Recognition System

C. Wooters, N. Mirghafori, A. Stolcke, T. Pirinen, I Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. Ostendorf

Proceedings of the Joint AMI/PASCAL/IM2/IM4 Workshop on Multimodal and Related Machine Learning Algorithms (MLMI '04), Martigny, Switzerland, pp. 196-208

06/__/2004
Long-Term Temporal Features for Conversational Speech Recognition

B. Chen, Q. Zhu and N. Morgan

In Proceedings of First International Workship, MLMI 2004. Martingy, Switzerland, June 2004.

06/__/2004
Tandem Connectionist Feature Extraction for Conversational Speech Recognition

Q. Zhu, B. Chen, N. Morgan and A.Stolcke

In Proceedings of First International Workshop, MLMI 2004, Martigny, Switzerland, June 2004.

05/__/2004
Desperately Seeking Impostors: Data-Mining for Competitive Impostor Testing in a Text-Dependent Speaker Verification System

M. Hebert and N. Mirghafori

In Proceedings of IEEE ICASSP, Montreal

05/__/2004
The ICSI Meeting Project: Resources and Research

A. Janin, J. Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macias, N. Morgan, B. Peskin, E. Shriberg, A. Stolcke, C. Wooters, and B. Wrede

Proceedings of the ICASSP 2004 Meeting Recognition Workshop, Montreal, Canada

05/__/2004
Parameterization of the Score Threshold for a Text-Dependent Adaptive Speaker Verification System

N. Mirghafori and M. Hebert

In Proceedings of IEEE ICASSP, Montreal

05/__/2004
TRAPping Conversational Speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition

N. Morgan, B. Y. Chen, Q. Zhu, and A. Stolcke

In Proceedings of IEEE ICASSP, Montreal

05/__/2004
Detection and compensation of sensor malfunction in time delay based direction of arrival estimation

T. Pirinen, J. Yli-Hietanen, P. Pertilä and A. Visa

In Proceedings of IEEE ISCAS, Vancouver

05/__/2004
Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004 Evaluation System

A. Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. Ostendorf

In NIST ICASSP 2004 Meeting Recognition Workshop, Montreal

05/24/2004
The ICSI Meeting Corpus: Close-talking and Far-field, Multi-channel Transcriptions for Speech and Language Researchers

Jane A. Edwards

LREC 2004, Workshop on Compiling and Processing Spoken Language Corpora, Lisbon, Portugal, May 2004.

04/__/2004
Improving Automatic Sentence Boundary Detection with Confusion Networks

D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu and E. Shriberg

In Proceedings of HLT-NAACL Conference, Boston

04/__/2004
The ICSI Meeting Recorder Dialog Act (MRDA) Corpus

E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey

In Proceedings of HLT-NAACL SIGDIAL Workshop, April-May 2004, Boston

03/__/2004
Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing

E. Shriberg and A. Stolcke

Proc. International Conference on Speech Prosody, Nara, Japan, March 2004.

01/__/2004
The ICSI/SRI/UW RT04 Structural Metadata Extraction System

Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, M. Harper.

RT-04 EARS Workshop

__/__/2003
An Improved Approximation Algorithm for Vertex Cover with Hard Capacities

R. Gandhi, E. Halperin, S. Khuller, G. Kortsarz and A. Srinivasan

Proceedings of the International Colloquium on Automata, Languages and Programming (ICALP), 164-175

__/__/2003
Automatic Speech Recognition

H. Hermansky and N. Morgan

In Encyclopedia of Cognitive Science, Nature Publishing Group, London

__/__/2003
Word Fragments Identification Using Acoustic-Prosodic Features in Conversational Speech

Y. Liu

Proceedings of HLT/NAACL, Student Session, Edmonton, Alberta

12/__/2003
A Robust Speaker Clustering Algorithm

J. Ajmera and C. Wooters

Proceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands

12/__/2003
The Relationship Between Dialogue Acts and Hot Spots in Meetings

B. Wrede and E. Shriberg

Proceedings of IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands

09/__/2003
Learning Discriminative Temporal Patterns in Speech: Development of Novel TRAPS-Like Classifiers

B. Chen, S. Chang and S. Sivadas

Proceedings of EUROSPEECH 2003, Geneva

09/__/2003
Far-Field ASR on Inexpensive Microphones

L. Docio, D. Gelbart, and N. Morgan

Proceedings of 8th European Conference on Speech Communication and Technology (EUROSPEECH 2003), Geneva, Switzerland, pp. 2141-2144

09/__/2003
Automatic disfluency identification in conversational speech using multiple knowledge sources

Y. Liu, E. Shriberg and A. Stolcke

Proceedings of EUROSPEECH 2003, Geneva

09/__/2003
Feature Transformations and Combinations for Improving ASR Performance

P. Somervuo, B. Chen and Q. Zhu

Proceedings of EUROSPEECH 2003, Geneva

09/__/2003
Spotting "Hotspots" in Meetings: Human Judgments and Prosodic Cues

B. Wrede and E. Shriberg

Proceedings of EUROSPEECH 2003, Geneva

08/__/2003
Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty Meetings

S. Bhagat, H. Carvey and E. Shriberg

Proceedings of ICPhS 2003, Barcelona

05/__/2003
Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data

D. Hillard, M. Ostendorf and E. Shriberg

Proceedings of HLT-NAACL Conference, Edmonton, Canada

04/__/2003
The ICSI Meeting Corpus

A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke and C. Wooters

Proceedings of ICASSP-2003, Hong Kong

04/__/2003
Meetings about meetings: research at ICSI on speech in multiparty conversations

N. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart, A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke and C. Wooters

Proceedings of ICASSP-2003, Hong Kong

04/__/2003
Using prosodic and conversational features for high-performance speaker recognition: Report from JHU WS'02.

B. Peskin, J. Navratil, J. Abramson, D. Jones, D. Klusacek, D. Reynolds and B. Xiang

Proceedings of ICASSP-2003, Hong Kong

04/__/2003
Audio Information Access from Meeting Rooms

S. Renals and D. Ellis

In Proceedings of ICASSP-2003. Hong Kong

04/__/2003
The SuperSID Project: Exploiting high-level information for high-accuracy speaker recognition

D. Reynolds, W. Andrews, J. Campbell, J. Navratil, B. Peskin, A. Adami, Q. Jin, D. Klusacek, J. Abramson, R. Mihaescu, J. Godfrey, D. Jones and B. Xiang

Proceedings of ICASSP-2003, Hong Kong

04/__/2003
Experiments With Linear And Nonlinear Feature Transformations In HMM Based Phone Recognition

P. Somervuo

Proceedings of ICASSP-2003, Hong Kong

03/__/2003
Data-Driven Speaker and Subword Unit Clustering in Speech Processing

M. Hersch

EPFL Diploma Thesis, ICSI

__/__/2002
The relation of stress accent to pronunciation variation in spontaneous American English discourse

S. Greenberg, H.M. Carvey and L. Hitchcock

In Proceedings of International Conference on Speech Prosody 2002, Aix, France

__/__/2002
Robust Speech Recognition Based on Spectro-Temporal Processing

M. Kleinschmidt

PhD Dissertation, University of Oldenberg

12/__/2002
Prosodic Cues For Emotion Recognition In Communicator Dialogs

J.C. Ang

M.S. Thesis, University of California at Berkeley

09/__/2002
Qualcomm-ICSI-OGI Features for ASR

A. Adami, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, P. Jain, S. Kajarekar, N. Morgan and S. Sivadas

ICSLP-2002, Denver, Colorado, USA

09/__/2002
Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer Dialog

J. Ang, R. Dhillon, A. Krupski, E. Shriberg and A. Stolcke

ICSLP-2002, Denver, Colorado, USA

09/__/2002
Automatic Punctuation and Disfluency Detection in Multi-Party Meetings Using Prosodic and Lexical Cues

D. Baron, E. Shriberg and A. Stolcke

ICSLP-2002, Denver, Colorado, USA

09/__/2002
A Syllable, Articulatory-Feature, and Stress-Accent Model of Speech Recognition

S. Chang

Ph.D. Thesis, University of California at Berkeley

09/__/2002
Double the Trouble: Handling Noise and Reverberation in Far-Field Automatic Speech Recognition

D. Gelbart and N. Morgan

ICSLP-2002, Denver, Colorado, USA

09/__/2002
Spectro-temporal Gabor Features as a Front End for Automatic Speech Recognition

M. Kleinschmidt

Forum Acusticum 2002, Seville, Spain

09/__/2002
Improving Word Accuracy with Gabor Feature Extraction

M. Kleinschmidt and D. Gelbart

ICSLP-2002, Denver, Colorado, USA

09/__/2002
What's new in government-sponsored speech recognition research

N. Morgan

Speech Technology Magazine, vol. 7 no. 5

09/__/2002
Speech Modeling Using Variational Bayesian Mixture of Gaussians

P. Somervuo

ICSLP-2002, Denver, Colorado, USA

05/__/2002
A New Speaker Change Detection Method for Two-Speaker Segmentation

A. Adami, S. Kajarekar and H. Hermansky

ICASSP-2002, Orlando, Florida, USA

05/__/2002
Unknown-Multiple Speaker Clustering using HMM

J. Ajmera, H. Bourlard, I. Lapidot and I. McCowan

In Proceedings of ICSLP-2002, Orlando, Florida

05/__/2002
Prosody-Based Automatic Detection of Punctuation and Interruption Events in the ICSI Meeting Recorder Corpus

D. Baron

M.S. Thesis, University of California at Berkeley

05/__/2002
Reducing the Effect of Room Acoustics on Human-Computer Interaction

D. Gelbart

Avios-2002, San Jose, California, USA

05/__/2002
Hierarchical Tandem Feature Extraction

S. Sivadas and H. Hermansky

ICASSP-2002, Orlando, Florida, USA

05/__/2002
Using Prosodic and Lexical Information for Speaker Identification

F. Weber, L. Manganaro, B. Peskin and E. Shriberg

ICASSP-2002, Orlando, Florida, USA

__/__/2001
Chapter 17: The Transcription of Discourse

J. Edwards

In The Handbook of Discourse Analysis, D. Shriffrin, D. Tannen and H. Hamilton, eds. Oxford: Blackwell, pp. 321-348.

__/__/2001
A study of two dimensional linear descriminants for ASR

S. Kajarekar, B. Yegnanarayana and H. Hermansky

ICASSP 2001, Salt Lake City, Utah, 2001.

__/__/2001
Speech Intelligibility Derived From Asynchrounous Processing of Auditory-Visual Information

K.W. Grant and S. Greenberg

AVSP Workshop, 2001.

12/__/2001
Multispeaker Speech Activity Detection for the ICSI Meeting Recorder

T. Pfau, D. Ellis and A. Stolcke

Proceedings Automatic Speech Recognition and Understanding Workshop (ASRU), Trento, Italy, December 2001.

12/__/2001
Evaluating Long-term Spectral Subtraction for Reverberant ASR

D. Gelbart and N. Morgan

ASRU-2001, Madonna di Campiglio, Italy, December 2001.

10/__/2001
The relation between stress accent and vocalic identity in spontaneous American English discourse

S. Greenberg, S. Chang and L. Hitchcock

In Proceedings of ISCA Workshop on Prosody in Speech Recognition and Understanding, Red Bank, NJ, October 2001.

10/__/2001
Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech

E. Shriberg, A. Stolcke and D. Baron

ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, Red Bank, NJ, October 2001.

09/__/2001
Combining bottom-up and top-down constraints for robust ASR: The multiscore decoder

J. Barker, M. Cooke and D. Ellis

Workshop on Consistent and Reliable Acoustic Cues CRAC-2001. Aalborg, Denmark, September 2001.

09/__/2001
Investigations into Tandem acoustic modeling for the Aurora taks

D.P.W. Ellis and M. Reyes

In Proceedings of Eurospeech-01. Aalborg, Denmark, September 2001.

09/__/2001
Relating Frame Accuracy with Word Error in Hybrid ANN-HMM ASR

M. Shire

Eurospeech-2001, Aalborg, September 2001.

09/__/2001
Robust ASR Front-End Using Spectral-Based and Discriminant Features: Experiments on the Aurora Tasks

C. Benitez, L. Burget, B. Chen, S. Dupont, H. Garudadri, H. Hermansky, P. Jain, S. Kajarekar, and S. Sivadas

In proceedings of 7th European Conference on Speech Communication and Technology (EUROSPEECH 2001), pp. 429-432, Aalborg, Denmark

09/__/2001
Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation

E. Shriberg, A. Stolcke and D. Baron

Eurospeech-2001, Aalborg, September 2001.

09/__/2001
An Elitist Approach to Articulatory-Acoustic Feature Classification

S. Chang, S. Greenberg and M. Wester

Eurospeech-2001, Aalborg, September 2001.

09/__/2001
From Here to Utility -Melding Phonetic Insight with Speech Technology

S. Greenberg

Eurospeech-2001, Aalborg, September 2001.

09/__/2001
Whither Speech Technology? -A Twenty-First Century Perspective

S. Greenberg

Eurospeech-2001, Aalborg, September 2001.

09/__/2001
The Relation Between Speech Intelligibility and the Complex Modulation Spectrum

S. Greenberg and T. Arai

Eurospeech-2001, Aalborg, September 2001.

09/__/2001
Vowel Height is Intimately Associated with Stress Accent in Spontaneous American English Discourse

L. Hitchcock and S. Greenberg

Eurospeech-2001, Aalborg, September 2001.

09/__/2001
A Dutch Treatment of an Elitist Approach to Articulatory-Acoustic Feature Classification

M. Wester, S. Greenberg and S. Chang

Eurospeech-2001, Aalborg, September 2001.

08/__/2001
Word-Level Confidence Estimation for Automatic Speech Recognition

A. Hatch

M.S. Thesis, University of California at Berkeley, August 2001.

06/__/2001
Corpus Variation and Parser Performance

D. Gildea

Empirical Methods in Natural Language Processing, Pittsburgh, June 2001

05/__/2001
Tandem acoustic modeling in large-vocabulary recognition

D. Ellis, R. Singh and S. Sivadas

ICASSP-2001, Salt Lake City, Utah, May 2001.

05/__/2001
Multi-Stream ASR trained with Heterogeneous Reverberant Environments

M.L. Shire

ICASSP-2001, Salt Lake City, May 2001.

05/__/2001
Global Posterior Probability Estimates as Confidence Measures in an Automatic Speech Recognition System

W. Warren

ICASSP-2001, Salt Lake City, May 2001.

04/__/2001
SpeechCorder, The Portable Meeting Recorder

A. Janin and N. Morgan

Workshop on Hands-Free Speech Communication Kyoto, Japan, April 2001

04/__/2001
Meeting Recorder

A. Janin

Avios, San Jose, April 2001.

03/__/2001
The Meeting Project at ICSI

N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin, T. Pfau, E. Shriberg and A. Stolcke

Human Language Technologies Conference, San Diego, March 2001

__/__/2000
Linguistic dissection of switchboard-corpus automatic speech recognition systems

S. Greenberg and S. Chang

ISCA Workshop on Automatic Speech Recognition: Challenges for the New Millennium, Paris, 2000.

__/__/2000
Discriminant Training of Front-End and Acoustic Modeling Stages to Heterogeneous Acoustic Environments for Multi-stream Automatic Speech Recognition

M. Shire

PhD Dissertation, University of California at Berkeley, Fall 2000.

__/__/2000
Search for Information Bearing Components in Speech

H.H. Yang and H. Hermansky

In Advances in Neural Information Processing Systems, Vol. 12, S.A. Solla, T.K. Leen and K.-R. Muller, eds. MIT Press, 2000.

12/__/2000
Global Posterior Probability Estimates as Decision Confidence Measures in an Automatic Speech Recognition System

W. Warren

Ph.D. Dissertation, UC Berkeley, December 2000.

10/__/2000
Using mutual information to design feature combinations

D. Ellis and J. Bilmes

ICSLP-2000, Beijing, October 2000.

10/__/2000
Decoding speech in the presence of other sound sources

J. Barker, M. Cooke and D. Ellis

ICSLP-2000, Beijing, October 2000.

10/__/2000
Using acoustic condition clustering to improve acoustic change detection on Broadcast News

J.F. Lopez and D. Ellis

Proceedings of International Conference on Spoken Language Processing (ICSLP 2000), Vol. 4, pp. 568-571, 16-20 October 2000, Beijing, China

10/__/2000
Consonant discrimination in elicited and spontaneous speech: A case for signal-adaptive front ends in ASR

K. Sönmez, M. Plauché, E. Shriberg and H. Franco

ICSLP-2000, Beijing, October 2000.

10/__/2000
On data-derived temporal processing in speech feature extraction

M. Shire and B. Chen

ICSLP-2000, Beijing, October 2000.

10/__/2000
Automatic Phonetic Transcription of Spontaneous Speech American English

S. Chang, L. Shastri and S. Greenberg

ICSLP-2000, Beijing, October 2000.

10/__/2000
A comparison of data-derived and knowledge-based modeling of pronunciation variation

M. Wester and E.Fosler-Lussier

ICSLP-2000, Beijing, October 2000.

10/__/2000
Automatic Labeling of Semantic Roles

D. Gildea and D. Jurafsky

ACL-2000, Hong Kong, October 2000, pp. 512-520.

09/__/2000
Prosody-Based Automatic Segmentation of Speech into Sentences and Topics

E. Shriberg, A. Stolcke, D. Hakkani-Tür and G. Tür

Speech Communications, T. Robinson and S. Rendals, eds. Vol. 32, 1-2, 127-154, Sep. 2000.

08/__/2000
Relevance of TimeFrequency Features for Phonetic and SpeakerChannel Classification

H.H. Yang. S. Sharma, S. van Vuuren and H. Hermansky

Speech Communication, August 2000.

06/__/2000
Tandem connectionist feature stream extraction for conventional HMM systems

H. Hermansky, D. Ellis and S. Sharma

ICASSP-2000, Istanbul, June 2000, III-1635-1638.

06/__/2000
Feature extraction using non-linear transformation for robust speech recognition on the Aurora database

S. Sharma, D. Ellis, S. Kajarekar, P. Jain and H. Hermansky

ICASSP-2000, Istanbul, June 2000, II-1117-1120.

06/__/2000
Data-driven RASTA filters in reverberation

M. Shire and B. Chen

ICASSP-2000, Istanbul, June 2000, III-1627-1630.

05/__/2000
Improved recognition by combining different features and different systems

D.P.W. Ellis

In Proceedings of AVIOS-2000, San Jose, May 2000.

05/__/2000
An introduction to the diagnostic evaluation of the Switchboard-corpus automatic speech recognition systems

S. Greenberg, S. Chang and J. Hollenback

NIST Speech Transcription Workshop, College Park, MD, May 16-19, 2000.

05/__/2000
Prosodic stress revisited: Reassessing the fole of fundamental frequency

R. Silipo and S. Greenberg

NIST Speech Transcription Workshop, College Park, MD, May 16-19, 2000.

05/__/2000
The uninvited guest: Information's role in guiding the production of spontaneous speech

S. Greenberg and E. Fosler-Lussier

Crest Workshop on Models of Speech Production: Motor Planning and Articulatory Modelling, Kloster Seeon, Germany, May 1-4, 2000.

__/__/1999
Effects of Speaking Rate and Word Frequency on Conversational Pronunciations

E. Fosler-Lussier and N. Morgan

Speech Communication 29 2-4, pp. 37-157.

__/__/1999
Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures

D. Ellis

Speech Communication 27, 3-4, pp. 281-298.

__/__/1999
Multi-Level Decision Trees for Static and Dynamic Pronunciation Models

E. Fosler-Lussier

Eurospeech-99, Budapest, pp. I-463-466.

__/__/1999
Multi-stream speech recognition: Ready for prime time?

A. Janin, D. Ellis and N. Morgan

Eurospeech-99, Budapest, pp. II-591-594.

__/__/1999
Speech/music discrimination based on posterior probability features

G. Williams and D. Ellis

Eurospeech-99, Budapest, pp. II-687-690.

__/__/1999
Temporal constraints on speech intelligibility as deduced from exceedingly sparse spectral representations

R. Silipo, S. Greenberg and T. Arai

Eurospeech-99, Budapest, pp. VI-2687-2690.

__/__/1999
Data-driven modulation filter design under adverse acoustic conditions and using phonetic and syllabic units

M.L. Shire

Eurospeech-99, Budapest, pp. III-1123-1126.

__/__/1999
Topic-based language models using EM

D. Gildea and T. Hofmann

Eurospeech-99, Budapest, pp. V-2167-2170.

__/__/1999
Sooner or Later: Exploring Asynchrony in Multi-Band Speech Recognition

N. Mirghafori and N. Morgan

Eurospeech-99, Budapest, pp. II-595-598.

__/__/1999
# Buried Markov models for speech recognition

J. Bilmes

ICASSP-99, Phoenix, pp. II-713-716.

__/__/1999
Size matters: An empirical study of neural network training for large vocabulary continuous speech recognition

D. Ellis and N. Morgan

ICASSP-99, Phoenix, pp. II-1013-1016.

__/__/1999
Dynamic classifier combinations in hybrid speech recognition systems using utterance-level confidence values

K. Kirchhoff and J. Bilmes

ICASSP-99, Phoenix, pp. II-693-696

__/__/1999
# Using Boosting to Improve a Hybrid HMM/Neural Network Speech Recognizer

H. Schwenk

ICASSP-99, Phoenix, pp. II-1009-1012

__/__/1999
Speech and Audio Signal Processing

B. Gold and N. Morgan

Wiley Press, New York, 1999.

__/__/1999
Temporal Signal Processing for ASR

N. Morgan

IEEE Workshop on Automatic Speech Recognition and Understanding, pp 9-16, 1999.

12/__/1999
Contextual word and syllable pronunciation models

E. Fosler-Lussier

ASRU-99, Keystone CO, December 1999.

12/__/1999
Combined speech and speaker recognition with speaker-adapted connectionist models

D. Genoud, D. Ellis and N. Morgan

ASRU-99, Keystone CO, December 1999.

08/__/1999
Dynamic Pronunciation Models for Automatic Speech Recognition

E. Fosler-Lussier

PhD Dissertation, University of California at Berkeley, August 1999.

08/__/1999
Forms of English function words - Effects of disfluencies, turn position, age and sex, and predictability

A. Bell, D. Jurafsky, E. Fosler-Lussier, C. Girand and D. Gildea

International Congress of Phonetic Sciences, San Francisco, August 1999, pp. 1:395-398.

08/__/1999
Incorporating contextual phonetics into automatic speech recognition

E. Fosler-Lussier, S. Greenberg, and N. Morgan

International Congress of Phonetic Sciences, San Francisco, August 1999, pp. 1:611-614.

08/__/1999
Statistical Acoustic Indications of Coarticulation

K. Kirchoff and J. Bilmes

International Congress of Phonetic Sciences, San Francisco, August 1999, pp. 3:1729-1732.

08/__/1999
Syllable Detection and Segmentation Using Temporal Flow Neural Networks

L. Shastri, S. Chang and S. Greenberg

International Congress of Phonetic Sciences, San Francisco, August 1999, pp. 3:1721-1724

08/__/1999
Automatic Transcription of Prosodic Stress for Spontaneous English Discourse

R. Silipo and S. Greenberg

International Congress of Phonetic Sciences, San Francisco, August 1999, pp. 3:2351-2354.

05/__/1999
Natural Statistical Models for Automatic Speech Recognition

J. Bilmes

PhD Dissertation, University of California at Berkeley, May 1999.

03/__/1999
Temporal Patterns (TRAPS) in ASR of Noisy Speech

H. Hermansky and S. Sharma

In Proceedings of ICASSP '99, Phoenix, Arizona, USA, March 1999.

03/__/1999
Relevancy of Time Frequency Features for Phonetic Classification Measured by Mutual Information

H.H. Yang, S. van Vuuren and H. Hermansky

ICASSP'99, Phoenix, Arizona, USA, March 1999.

02/__/1999
Not just what, but also when: Guided automatic pronunciation modeling for Broadcast News

E. Fosler-Lussier and G. Williams

DARPA Broadcast News Transcription and Understanding Workshop, Herndon VA, February 1999.

02/__/1999
Reducing errors by increasing the error rate: MLP Acoustic Modeling for Broadcast News Transcription

N. Morgan, D. Ellis, E. Fosler-Lussier, A. Janin and B. Kingsbury

DARPA Broadcast News Transcription and Understanding Workshop, Herndon VA, February 1999

02/__/1999
An Overview of the SPRACH System for the Transcription of Broadcast News

G. Cook, J. Christie, D. Ellis, E. Fosler-Lussier, Y. Gotoh, B. Kingsbury, N. Morgan, S. Renals, T. Robinson and G. Williams

DARPA Broadcast News Transcription and Understanding Workshop, Herndon VA, February 1999

__/__/1998
Speech Recognition with Dynamic Bayesian Networks

G. Zweig

PhD Dissertation, University of California at Berkeley, Spring 1998.

__/__/1998
Speech intelligibility in the presence of cross-channel spectral asynchrony

T. Arai and S. Greenberg

ICASSP-98, Seattle, pp. 933-936.

__/__/1998
Data-Driven Extensions to HMM Statistical Dependencies

J. Bilmes

ICSLP-98, Sydney, Australia, pp. 69-72.

__/__/1998
Maximum Mutual Information Based Reduction Strategies for Cross-Correlation based Joint Distributional Modeling

J. Bilmes

ICASSP-98, Seattle, pp. 469-472.

__/__/1998
Midlevel representations for computational auditory scene analysis: The weft element

D. Ellis and D. Rosenthal

In Computational Auditory Scene Analysis, D.F. Rosenthal & H.G. Okuno, eds., Lawrence Erlbaum, pp. 257-272.

__/__/1998
Recognition in a new key - Towards a science of spoken language

S. Greenberg

ICASSP-98, Seattle, pp. 1041-1045

__/__/1998
Speaking in shorthand - A syllable-centric perspective for understanding pronunciation variation

S. Greenberg

ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kekrade Netherlands, pp. 47-56.

__/__/1998
Speech intelligibility is highly tolerant of cross-channel spectral asynchrony

S. Greenberg and T. Arai

Joint Meeting of the Acoustical Society of America and the International Congress on Acoustics, Seattle, pp. 2677-2678.

__/__/1998
Speech intelligibility derived from exceedingly sparse spectral information

S. Greenberg, T. Arai and R. Silipo

ICSLP-98, Sydney, Australia, pp. 74-77.

__/__/1998
Robust speech recognition using the modulation spectrogram

B. Kingsbury, N. Morgan and S. Greenberg

Speech Communication, 25, pp. 117-132.

__/__/1998
Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition of Natural Numbers

N. Mirghafori and N. Morgan

ICSLP-98, Sydney, Australia, pp. 743-746.

__/__/1998
Transmissions and Transitions: A Study of Two Common Assumptions in Multi-Band ASR

N. Mirghafori and N. Morgan

ICASSP-98, Seattle, pp. 713-716

__/__/1998
Combining Multiple Estimators of Speaking Rate

N. Morgan and E. Fosler-Lussier

ICASSP-98, Seattle, pp. 729-732

__/__/1998
Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition

S.L. Wu

Ph.D. Thesis, UC Berkeley, Spring 1998, ICSI Technical Report TR-98-014.

__/__/1998
Incorporating Information from Syllable-length Time Scales into Automatic Speech Recognition

S.L. Wu, B. Kingsbury, N. Morgan and S. Greenberg

ICASSP-98, Seattle, pp. 721-724

__/__/1998
# Performance improvements through combining phone- and syllable-length information in automatic speech recognition

S.L. Wu, B. Kingsbury, N. Morgan and S. Greenberg

ICSLP-98, Sydney, Australia, pp. 854-857.

__/__/1998
Training Neural Networks with SPERT-II

K. Asanovic, J. Beck, D. Johnson, B. Kingsbury, N. Morgan and J. Wawrzynek

Chapter in Parallel Architectures for Artificial Networks - Paradigms and Implementations, eds. N. Sundararajan and P. Saratchandran, IEEE Computer Society Press, Los Alamitos, CA, pp. 345-364, 1998.

__/__/1998
Connectionist Techniques for Speech Recognition

H. Bourlard and N. Morgan

Article in the Survey on the State of the Art in Human Language Technology, ed. R. Cole, Cambridge University Press, pp. 356-361, 1998.

__/__/1998
Hybrid HMM/ANN systems for speech recognition: overview and new research directions

H. Bourlard and N. Morgan

In Adaptive Processing of Sequences and Data Structures, C.L. Giles and M. Gori (Eds.), pp. 389-417, Lecture Notes in Artificial Intelligence (1387), Springer, 1998.

__/__/1998
Modeling Dynamic Prosodic Variation for Speaker Verification

K. Sonmez, E. Shriberg, L. Heck and M. Weintraub

In Proceedings of ICSLP-98, Sydney, 1998.

12/__/1998
Perceptually-inspired signal processing strategies for robust speech recognition in reverberant environments

B. Kingsbury

PhD Dissertation, University of California at Berkeley, December 1998.

12/__/1998
A Multi-Band Approach to Automatic Speech Recognition

N. Mirghafori

PhD Dissertation, University of California at Berkeley, December 1998. Reprinted as ICSI Technical Report, TR-99-004, Berkeley, CA, January 1999.

11/__/1998
Spectral Basis Functions from Discriminant Analysis

H. Hermansky and N. Malayath

In Proceedings of ICSLP'98, Sydney, Australia, November 1998.

05/__/1998
Effects of Speaking Rate and Word Predictability on Conversational Pronunciations

E. Fosler-Lussier and N. Morgan

ESCA Workshop on Modeling Pronunciation for ASR, May 1998

__/__/1997
The temporal properties of spoken Japanese are similar to those of English

T. Arai and S. Greenberg

Eurospeech-97, Rhodes, vol. 2 pp. 1011-1014.

__/__/1997
Speech Recognition using On-line Estimation of Speaking Rate

Nelson Morgan, Eric Fosler and Nikki Mirghafori

Eurospeech-97, Rhodes, vol. 4 pp. 2079-2082.

__/__/1997
On the origins of speech intelligibility in the real world

S. Greenberg

ESCA workshop of Robust Speech Recog., Pont-a-Mousson, pp. 23-32

__/__/1997
Robust features and environmental compensation: A few comments

N. Morgan

ESCA workshop of Robust Speech Recog., Pont-a-Mousson, pp. 43-44

__/__/1997
Improving ASR performance for reverberant speech

B. Kingsbury, N. Morgan and S. Greenberg

ESCA workshop of Robust Speech Recog., Pont-a-Mousson, pp. 87-90.

__/__/1997
A Space-Time theory of Pitch and Timbre based on Cortical Expansion of the Cochlea Traveling Wave Delay

S. Greenberg, D. Poeppel and T.Roberts

XIth Int. Symp. on Hearing, Grantham

__/__/1997
The modulation spectrogram: In pursuit of an invariant representation of speech

S. Greenberg and B. Kingsbury

ICASSP-97, Munich, vol. 3 pp. 1647-1650

__/__/1997
Recognizing reverberant speech with RASTA-PLP

B. Kingsbury and N. Morgan

ICASSP-97, Munich, vol. 2 pp. 1259-1262.

__/__/1997
Integrating syllable boundary information into speech recognition

S.L. Wu, M. Shire, S. Greenberg and N. Morgan

ICASSP-97, Munich, vol. 2 pp. 987-990.

__/__/1997
The Weft: A representation for periodic sounds

D. Ellis

ICASSP-97, Munich, vol. 2 pp. 1307-1310.

__/__/1997
Computational Auditory Scene Analysis exploiting Speech-Recognition knowledge

D. Ellis

IEEE workshop on Apps. of Sig. Proc. to Aud. & Acous., Mohonk.

__/__/1997
Joint Distributional Modeling with Cross-Correlation Based Features

J. Bilmes

ASRU-97, Santa Barbara, pp.148-155

__/__/1997
Should Recognizers Have Ears?

H. Hermansky

In Proceedings of ESCA Tutorial and Research Workshop on Robust Speech Recognition for Unknown Communication Channels, pp.1-10, France, 1997.

__/__/1997
Switchboard-DAMSL Labeling Project Coder's Manual

D. Jurafsky, E. Shriberg and D. Biasca

Technical Report 97-02, University of Colorado, Institute of Cognitive Science, Boulder, Colorado, 1997.

__/__/1997
Data-Driven Design of RASTA-like Filters

S. van Vuuren and H. Hermansky

In Proceedings of EUROSPEECH'97, Rhodes, Greece, 1997.

09/__/1997
Multiresolution channel normalization for ASR in reverberant environments

C. Avendano, S. Tibrewala and H. Hermansky

In Proceedings of Eurospeech-97, Rhodes, Greece, September 1997.

09/__/1997
Estimation of Global Posteriors and Forward-Backward Training of Hybrid HMM/ANN Systems

L. Hennebert, C. Ris, H. Bourlard, S Renals and N. Morgan

In Proceedings of Eurospeech 1997, pp 1951-1954, Greece, 1997.

__/__/1996
Towards Robustness to Fast Speech in ASR

N. Mirghafori, E. Fosler and N. Morgan

ICASSP-96, Atlanta

__/__/1996
REMAP - Experiments with speech recognition

Y. Konig, H. Bourlard and N. Morgan

ICASSP-96, Atlanta

__/__/1996
On Reversing the Generation Process in Optimality Theory

E. Fosler

ACL-96, Santa Cruz

__/__/1996
Automatic Learning of Word Pronunciation from Data

E. Fosler, M. Weintraub, S. Wegmann, Y. H. Kao, S. Khudanpur, C. Galles and M. Saraclar

ICSLP-96, Philadelphia.

__/__/1996
Stochastic perceptual speech models with durational dependence

J. Bilmes, N. Morgan, S.L. Wu and H. Bourlard

ICSLP-96, Philadelphia

__/__/1996
Insights into spoken language gleaned from phonetic transcriptions of the Switchboard corpus

S. Greenberg, J. Hollenback and D. Ellis

ICSLP-96, Philadelphia

__/__/1996
Prediction-driven computational auditory scene analysis for dense sound mixtures

D. Ellis

ESCA workshop on Aud. Basis of Speech Percept., Keele '96

__/__/1996
Understanding Speech Understanding

S. Greenberg

ESCA workshop on Aud. Basis of Speech Percept., Keele '96

__/__/1996
Towards Subband-Based Speech Recognition

H. Bourlard, S. Dupont, H. Hermansky and N. Morgan

In Proceedings VIII European Signal Processing Conference (EUSIPCO'96) (Trieste, Italy), pp. 1579-1582, 1996.

__/__/1996
Hybrid Connnectionist Models for Continuous Speech Recognition

H. Bourlard and N. Morgan

Chapter in Automatic Speech and Speaker Recognition - Advanced Topics, Lee, Paliwal and Soong, eds., pp.259-283, Kluwer Academic Press, 1996.

__/__/1996
REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities - Application to Transition-Based Connectionist Speech Recognition

Y. Konig, H. Bourlard and N. Morgan

In Proceedings of NIPS 8, pp.388-394, 1996.

__/__/1996
Speech Data Modeling at WS96: The Questionable Parameter Group

N. Morgan

Intl. Conference on Spoken Language Processing, Addendum, pp 30-31, Philadelphia, 1996

__/__/1996
SPERT-II: A Vector Microprocessor System and its Application to Large Problems in Backpropagation Training

J. Wawrzynek, K. Asanovic, B. Kingsbury, J. Beck, D. Johnson and N. Morgan

Proceedings of NIPS 8, pp. 619-625, 1996. Also in IEEE Computer, vol. 29, no. 3, pp 79-86, March 1996.

07/__/1996
A Training Algorithm for Statistical Sequence Recognition with Applications to Transition-Based Speech Recognition

H. Bourlard, Y. Konig and N. Morgan

IEEE Signal Processing Letters, pp.203-205, July, 1996.

05/__/1996
Towards Increasing Speech Recognition Error Rates

H. Bourlard, H., Hermansky and N. Morgan

Speech Communication, May 1996, pp. 205-231.

__/__/1995
Digit Recognition with Stochastic Perceptual Models

N. Morgan, S.L. Wu, and H. Bourlard

Eurospeech-95, Madrid

__/__/1995
Building Multiple Pronunication Models for Novel Words using Exploratory Computational Phonology

G. Tajchman, E. Fosler and D. Jurafsky

Eurospeech-95, Madrid

__/__/1995
REMAP: Recursive Estimation and Maximization of A Posteriori probabilities in connectionist speech recognition

H. Bourlard, Y. Konig and N. Morgan

Eurospeech-95, Madrid

__/__/1995
Fast Speakers in Large Vocabulary Continuous Speech Recognition: Analysis & Antidotes

N. Mirghafori, E. Fosler and N. Morgan

Eurospeech-95, Madrid

__/__/1995
Stochastic Perceptual Models of Speech

N. Morgan, H. Bourlard, S. Greenberg, H. Hermansky and S.L. Wu.

ICASSP-95, Detroit, 1995.

__/__/1995
Using A Stochastic Context-Free Grammar as a Language Model for Speech Recognition

D. Jurafsky, C. Wooters, J. Segal, A. Stolcke, E. Fosler, G. Tajchman and N. Morgan

ICASSP-95, Detroit.

__/__/1995
SPAM: Experiments with Digit Recognition

N. Morgan, S.L. Wu, H. Bourlard

Speech Research Symposium '95

__/__/1995
Remap modeling for connectionist speech recognition

Y. Konig, H. Bourlard and N. Morgan

Speech Research Symposium '95

__/__/1995
Learning Phonological Rule Probabilities from Speech Corpora with Exploratory Computational Phonology

G. Tajchman, D. Jurafsky and E. Fosler

ACL-95, Boston

__/__/1995
REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities - Application to Transition-Based Connectionist Speech Recognition

Y. Konig, H. Bourlard and N. Morgan

NIPS-95

__/__/1995
Why Is ASR Harder For Fast Speech And What Can We Do About It?

N. Mirghafori, E. Fosler, and N. Morgan

IEEE Snowbird workshop '95

__/__/1995
Transition-based statistical training for ASR

N. Morgan, Y. Konig, S.L. Wu and H. Bourlard

IEEE Snowbird workshop '95

05/__/1995
An Introduction to Hybrid HMM/Connectionist Continuous Speech Recognition

N. Morgan and H. Bourlard

IEEE Signal Processing Magazine, pp. 25-42, May 1995.

05/__/1995
Neural Networks for Statistical Recognition of Continuous Speech

N. Morgan and H. Bourlard

Proceedings of IEEE, pp. 742-770

01/__/1995
The Challenge of Spoken Language Systems: Research Directions for the Nineties

R. Cole, L. Hirschman, L. Atlas, M. Beckman, A. Biermann, M. Bush, M. Clements, J. Cohen, O. Garcia, B. Hanson, H. Hermansky, S. Levinson, K. McKeown, N. Morgan, D. Novick, M. Ostendorf, S. Oviatt, P. Price, H. Silverman, J. Spitz, A. Waibel, C. Weinstein, S. Zahorian and V. Zue

IEEE Transactions on Speech and Audio Processing, vol. 3, no. 1, pp. 1-21

__/__/1994
The Berkeley Restaurant Project

D. Jurafsky, C. Wooters, G. Tajchman, J. Segal, A. Stolcke, E. Fosler and N. Morgan

ICSLP-94.

__/__/1994
Multiple-Pronunciation Lexical Modeling in a Speaker Independent Speech Understanding System

C. Wooters and A. Stolcke

ICSLP-94

__/__/1994
# Stochastic Perceptual Auditory-Event-Based Models for Speech Recognition

N. Morgan, H. Bourlard, S. Greenberg and H. Hermansky

ICSLP-94.

__/__/1994
Modeling Dynamics in Connectionist Speech Recognition - the Time Index Model

Y. Konig and N. Morgan

ICSLP-94

__/__/1994
# Integrating Experimental Models of Syntax, Phonology, and Accent/Dialect in a Speech Recognizer

D. Jurafsky, C.Wooters, G. Tajchman, J. Segal, A. Stolcke and N. Morgan

AAAI-94.

__/__/1994
Parallel Training of MLP Probability Estimators for Speech Recognition: A Gender-Based Approach

N. Mirghafori, N. Morgan and H. Bourlard

NNSP-94, Greece.

__/__/1994
Integrating RASTA-PLP into Speech Recognition

J. Koehler, N. Morgan, H. Hermansky, H.G. Hirsch and G. Tong

Proceedings of IEEE Int. Conf. Acoustics, Speech & Signal Processing, I-421-424

__/__/1994
Big Dumb Neural Nets: A Working Brute Force Approach to Speech Recognition

N. Morgan

Proceedings of ICNN, Vol. VII, pp. 4462-4465

__/__/1994
Parallel Training of MLP Probability Estimators for Speech Recognition: A Gender-Based Approach

N. Mirghafori, N. Morgan and H. Bourlard

IEEE Workshop on Neural Networks for Signal Processing, Greece, pp. 289-298

__/__/1994
Scaling a Hybrid HMM/MLP System for Large Vocabulary CSR

N. Morgan, G. Tajchman, N. Mirghafori, Y. Konig and C. Wooters

ARPA Spoken Language Technology Workshop, Morgan Kaufmann, pp. 123-124

10/__/1994
RASTA Processing of Speech

H. Hermansky and N. Morgan

IEEE Transactions on Speech and Audio Processing, special issue on Robust Speech Recognition, vol. 2, no. 4, pp. 578-589

10/__/1994
Using A Million Connections for Continuous Speech Recognition

N. Morgan

Invited paper for ICONIP' 94, Seoul, pp. 1439 - 1444

09/__/1994
Current Research in Acoustically Robust Speech Recognition

N. Morgan

Proceedings of American Voice Input/Output Society (AVIOS), pp. 207-214

__/__/1993
A Neural Network Based, Speaker Independent, Large Vocabulary, Continuous Speech Recognition System: the Wernicke Project

A. Robinson, L. Almeida, J. Boite, H. Bourlard, F. Fallside, H. Hochberg, D. Kershaw, P. Kohn, Y. Konig, N. Morgan, J. Neto, S. Renals, M. Saerens and C. Wooters

Proceedings of Eurospeech, pp. 1941-1944, Berlin, Germany

__/__/1993
Context-Dependent Multiple Distribution Phonetic Modeling

M. Cohen, H. Franco, N. Morgan, D. Rumelhart and V. Abrash

Advances in Neural Information Processing Systems V, pp. 649-657

__/__/1993
Modeling Consistency in a Speaker Independent Continuous Speech Recognition System

Y. Konig, N. Morgan, C. Wooters, V. Abrash, M. Cohen and H. Franco

Advances in Neural Information Processing Systems V, pp. 682-687

__/__/1993
Recognition of Speech in Additive and Convolutional Noise Based on RASTA Spectral Processing

H. Hermansky, N. Morgan and H.G. Hirsch

Proceedings of IEEE Conference on Acoustics, Speech & Signal Processing, II-83-86, Minneapolis

__/__/1993
Supervised and Unsupervised Clustering of the Speaker Space for Connectionist Speech Recognition

Y. Konig and N. Morgan

Proceedings of IEEE Int. Conf. Acoustics, Speech & Signal Processing, I-545-548, Minneapolis

__/__/1993
Connectionist Speech Recognition: A Hybrid Approach

H. Bourlard and N. Morgan

Kluwer Press, 1993

__/__/1993
The Berkeley Restaurant Project

C. Wooters, D. Jurafsky, G. Tajchman and N. Morgan

Speech Research Symposium XIII, Johns Hopkins, pp. 119-128

__/__/1993
A Can of RASTA Worms

H. Hermansky and N. Morgan

Speech Research Symposium XIII, Johns Hopkins, pp. 343-350

11/__/1993
Continuous Speech Recognition by Connectionist Statistical Methods

H. Bourlard and N. Morgan

IEEE Trans. on Neural Networks, vol. 4, no. 6, pp. 893-909

01/__/1993
Connectionist Probability Estimators in HMM Speech Recognition

S. Renals, N. Morgan, H. Bourlard, M. Cohen and H. Franco

IEEE Transactions on Speech and Audio Processing, II-161-174,

__/__/1992
Connectionist Probability Estimation in the Decipher Speech Recognition System

S. Renals, N. Morgan, M. Cohen H. Bourlard and H. Franco

Proceedings of IEEE Int. Conf. Acoustics, Speech & Signal Processing, I-601-604

__/__/1992
Factoring Networks by a Statistical Method

N. Morgan and H. Bourlard

Neural Computation, vol. 4 no. 6, pp. 835-838

__/__/1992
Factoring Networks by a Statistical Method

N. Morgan and H. Bourlard

Neural Computation, vol. 4 no. 6, pp. 835-838

__/__/1992
RASTA-PLP Speech Analysis Technique

H. Hermansky, N. Morgan, A. Bayya, and P. Kohn

Proceedings of IEEE Int. Conf. Acoustics, Speech & Signal Processing, San Francisco, I-121-124

__/__/1992
CDNN: A Context Dependent Neural Network for Continuous Speech Recognition

H. Bourlard, N. Morgan, C. Wooters and S. Renals

Proceedings of IEEE Int. Conf. Acoustics, Speech & Signal Processing, San Francisco, 1992, II-349-352

__/__/1992
Neural nets and hidden Markov models: Review and Generalizations

H. Bourlard, N. Morgan and S. Renals

Speech Communication, vol. 11, no.2-3, pp.237-246

__/__/1992
GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition

Y. Konig and N. Morgan

Proceedings of IJCNN '92, II-332-337

__/__/1992
Improving Statistical Speech Recognition

S. Renals, N. Morgan, M. Cohen, H. Franco, H. Bourlard

Proceedings of IJCNN '92, II-302-307

__/__/1992
Context-Dependent Connectionist Probability Estimation in a Hybrid HMM-Neural Net Speech Recognition System

H. Franco, M. Cohen, N. Morgan, D. Rumelhart and V. Abrash

Proceedings of Beijing IJCNN '92

__/__/1992
Connectionist-Based Acoustic Word Models

C. Wooters and N. Morgan

IEEE Workshop on Neural Networks for Signal Processing, pp. 157-163, Copehagen

__/__/1992
Multiple-State Context-Dependent Phonetic Modeling with MLPs

M. Cohen, H. Franco, N. Morgan, D. Rumelhart and V. Abrash

Speech Research Symposium XII, Rutgers

__/__/1992
RelAtive SpecTrAl (RASTA) Processing in Speech Analysis

H. Hermansky and N. Morgan

Speech Research Symposium XII, Rutgers

__/__/1992
Connectionist Gender Adaptation in a Hybrid Neural Network / Hidden Markov Model Speech Recognition System

V. Abrash, M. Cohen, H. Franco, N. Morgan and Y. Konig

International Conference on Spoken Language Processing, pp. 911-914

__/__/1992
Hybrid Neural Network / Hidden Markov Model Continuous Speech Recognition

M. Cohen, H. Franco, N. Morgan, D. Rumelhart and V. Abrash

International Conference on Spoken Language Processing, pp. 915-918

__/__/1992
Towards Handling the Acoustic Environment in Spoken Language Processing

H. Hermansky and N. Morgan

International Conference on Spoken Language Processing, pp. 85-88

__/__/1992
Acoustic Sub-word Models in the Berkeley Restaurant Project

C. Wooters and N. Morgan

International Conference on Spoken Language Processing, pp. 1551-1554, 1992

__/__/1992
RASTA Extensions: Robustness to Additive and Convolutional Noise

N. Morgan and H. Hermansky

Proceedings of Workshop on Speech Processing in Adverse Conditions, pp. 115-118

__/__/1991
Connectionist Approaches to the Use of Markov Models for Continuous Speech Recognition

H. Bourlard and N. Morgan

Advances in Neural Information Processing Systems III, pp. 213-219

__/__/1991
Continuous Speech Recognition Using PLP Analysis with Multilayer Perceptrons

N. Morgan, H. Hermansky, H. Bourlard, P. Kohn and C. Wooters

Proceedings of IEEE Int. Conf. Acoustics, Speech & Signal Processing, pp. 49-52, Toronto, Canada

__/__/1991
Phonetic Context in Hybrid HMM/MLP Continuous Speech Recognition

H. Bourlard, M. Cohen, P. Kohn, N. Morgan and C. Wooters

Proceedings of Eurospeech 1991, pp. 109-112, Genova, Italy

__/__/1991
Connectionist Optimisation of Tied Mixture Hidden Markov Models

S. Renals, H. Bourlard, N. Morgan, H. Franco and M. Cohen

Advances in Neural Information Processing Systems IV, pp. 167-174

__/__/1991
Compensation for the effect of the communication channel in Perceptual Linear Predictive (PLP) analysis of speech

H. Hermansky, A. Bayya, N. Morgan, P. Kohn

Proceedings of Eurospeech, pp. 1367-1370

__/__/1991
Experiments with Temporal Resolution for Continuous Speech Recognition with Multi-Layer Perceptrons

N. Morgan, C. Wooters, H. Hermansky, H. Bourlard

Proceedings of IEEE Workshop on Neural Networks for Signal Processing, pp. 405-410

__/__/1991
Probability Estimation by Feed-forward Networks in Continuous Speech Recognition

S. Renals, N. Morgan and H. Bourlard

ICSI Technical Report TR-91-030, also published in Proceedings of IEEE Workshop on Neural Networks for SIgnal Processing, pp. 309-318

__/__/1991
Neural Networks for Statistical Inference: Generalizations with Applications to Speech Recognition

H. Bourlard and N. Morgan

Proceedings of the IJCNN '91, Singapore

__/__/1991
Neural Networks for Statistical Inference: Generalizations with Applications to Speech Recognition

H. Bourlard and N. Morgan

Proceedings of IJCNN '91 - Singapre

11/__/1991
The Challenge of Inverse-E: The RASTA-PLP Method

H. Hermansky, N. Morgan, A. Bayya and P. Kohn

25th Asilomar Conference on Signals, Systems, & Computers, pp. 800-804, Pacific Grove, CA

__/__/1990
Continuous Speech Recognition Using Multilayer Perceptrons with Hidden Markov Models

H. Bourlard and N. Morgan

In Proceedings of IEEE International Conference of Acoustics, Speech & Signal Processing. Albuquerque, NM.

__/__/1990
Statistical Inference in Multilayer Perceptrons and Hidden Markov Models with Applications in Continuous Speech Recognition

H. Bourlard, N. Morgan and C. Wellekens

Neuro Computing, Algorithms, Architectures and Applications, NATO ASI Series, vol. F68, pp. 217-226

__/__/1990
A Continuous Speech Recognition System Embedding MLP into HMM

H. Bourlard and N. Morgan

Advances in Neural Information Processing Systems II, pp. 186-193

__/__/1990
Merging Multilayer Perceptrons & Hidden Markov Models: Some Experiments in Continuous Speech Recognition

H. Bourlard and N. Morgan

Artificial Neural Networks: Advances and Applications

__/__/1989
Merging Multilayer Perceptrons & Hidden Markov Models: Some Experiments in Continuous Speech Recognition

H. Bourlard and N. Morgan

ICSI Technical Report TR-089-033

__/__/1989
Generalization and Parameter Estimation in Feedforward Nets: Some Experiments

H. Bourlard and N. Morgan

ICSI Technical Report TR-089-017, also published in Advances in Neural Information Processing Systems II, pp. 630-637, 1990.

__/__/1989
A Multi-DSP Ring Array for Connectionist Simulations

J. Beck, N. Morgan, A. Allman and J. Beer

Proceedings of 23rd Asilomar Conference on Signals, Systems & Computers, 1989

 

 

   
Copyright © 2005 International Computer Science Institute. All Rights Reserved.