Technical Program

ICASSP2000 Home
Chair's Message
Committee
General Information

 

Technical Program:

Technical Program Home
Program Schedule
List of Accepted Papers
Sessions by Area:

AE COMM DISPS
IMDSP Industry MMSP
NNSP Other SAM
SP SPED SPTM


Plenary Sessions
Tutorials
Special Sessions

Area 3: Speech Processing (SP)

Session:            SP-L1
Time: 15:30 - 17:30, Tuesday, June 6, 2000
Location: Convention Center Lower Hall (L3)
Title: SPEECH SYNTHESIS
Chair: Michael Macon, Oregon Graduate Institute, USA
1. Paper ID: 1370 CONSTRUCTION OF THE ACOUSTIC INVENTORY FOR A GREEK TEXT-TO-SPEECH CONCATENATIVE SYNTHESIS SYSTEM, C. Christogiannis, T. Varvarigou, A. Zappa, Y. Vamvakoulas, National Technical University of Athens, Greece, C. Shih, Lucent Technologies, USA, A. Arvaniti, University of Cyprus, Cyprus.
2. Paper ID: 2555 CONCATENATING SYLLABLES FOR RESPONSE GENERATION IN SPOKEN LANGUAGE APPLICATIONS, T. Fung, H. Meng, The Chinese University of Hong Kong, Hong Kong, China.
3. Paper ID: 3176 SEGMENT PRE-SELECTION IN DECISION-TREE BASED SPEECH SYNTHESIS SYSTEMS, R. Donovan, IBM, USA.
4. Paper ID: 4007 SPECTRAL MODIFICATION FOR CONCATENATIVE SPEECH SYNTHESIS, J. Wouters, M. Macon, Oregon Graduate Institute of Science and Technology, USA.
5. Paper ID: 1851 TRANSITION-BASED SPEECH SYNTHESIS USING NEURAL NETWORKS, G. Corrigan, N. Massey, O. Schnurr, Motorola, USA.
6. Paper ID: 4105 STOCHASTIC MODELING OF SPECTRAL ADJUSTMENT FOR HIGH QUALITY PITCH MODIFICATION, A. Kain, Oregon Graduate Institute of Science and Technology, USA, Y. Stylianou, AT&T Labs, USA.
7. Paper ID: 939 VOICE QUALITY CONVERSION IN TD-PSOLA SPEECH SYNTHESIS, X. Sun, Northwestern University, USA.
8. Paper ID: 3237 ON THE IMPLEMENTATION OF THE HARMONIC PLUS NOISE MODEL FOR CONCATENATIVE SPEECH SYNTHESIS, Y. Stylianou, AT&T Labs, USA.


Session:            SP-L2
Time: 15:30 - 17:30, Tuesday, June 6, 2000
Location: Sadirvan A - Hilton Hotel
Title: ACOUSTIC MODEL ADAPTATION FOR ASR
Chair: Ananth Sankar, Nuance, USA
1. Paper ID: 838 ON-LINE INCREMENTAL SPEAKER ADAPTATION WITH AUTOMATIC SPEAKER CHANGE DETECTION, Z.-P. Zhang, S. Furui, Tokyo Institute of Technology, Japan, K. Ohtsuki, NTT, Japan.
2. Paper ID: 1277 JOINT MAXIMUM A POSTERIORI ESTIMATION OF TRANSFORMATION AND HIDDEN MARKOV MODEL PARAMETERS, O. Siohan, Lucent Technologies, USA, C. Chesta, Politecnico di Torino, Italy, C.-H. Lee, Lucent Technologies, USA.
3. Paper ID: 2235 FULL COVARIANCE MODELLING AND ADAPTATION IN SUB-BANDS, B. Doherty, S. Vaseghi, P. McCourt, The Queen's University of Belfast, United Kingdom.
4. Paper ID: 2826 HIERARCHICAL BAYES APPROACH TO ADAPTING DELTA- AND DELTA-DELTA CEPSTRA, A. Surendran, Lucent Technologies, USA.
5. Paper ID: 2891 ON-LINE BAYESIAN SPEAKER ADAPTATION USING TREE-STRUCTURED TRANSFORMATION AND ROBUST PRIORS, S. Wang, University of Illinois at Urbana-Champaign, USA, Y. Zhao, University of Missouri-Columbia, USA.
6. Paper ID: 354 SPEAKER ADAPTATION BASED ON COMBINATION OF MAP ESTIMATION AND WEIGHTED NEIGHBOR REGRESSION, L. He, J. Wu, D. Fang, W. Wu, Tsinghua University, People's Republic of China.
7. Paper ID: 1718 ROBUST ESTIMATION FOR RAPID SPEAKER ADAPTATION USING DISCOUNTED LIKELIHOOD TECHNIQUES, A. Gunawardana, W. Byrne, The Johns Hopkins University, USA.
8. Paper ID: 3630 FAST SPEAKER ADAPTATION OF LARGE VOCABULARY CONTINUOUS DENSITY HMM SPEECH RECOGNIZER USING A BASIS TRANSFORM APPROACH, C. Boulis, V. Digalakis, Technical University of Crete, Greece.

Session:            SP-L3
Time: 09:00 - 12:00, Wednesday, June 7, 2000
Location: Convention Center Lower Hall (L3)
Title: ACOUSTIC MODELING I
Chair: Hermann Ney, RWTH, Germany
1. Paper ID: 1846 A GENERALIZATION OF THE MAXIMUM A POSTERIORI TRAINING ALGORITHM FOR MIXTURE PRIORS, E. Buhrke, C. Liu, Motorola, USA.
2. Paper ID: 388 LINEAR REGRESSION UNDER MAXIMUM A POSTERIORI CRITERION WITH MARKOV RANDOM FIELD PRIOR, X. Wu, Oregon Graduate Institute of Science and Technology, USA, Y. Yan, Intel Corporation, USA.
3. Paper ID: 617 EFFICIENT ML TRAINING OF CDHMM PARAMETERS BASED ON PRIOR EVOLUTION, POSTERIOR INTERVENTION AND FEEDBACK, Q. Huo, N. Smith, B. Ma, The University of Hong Kong, Hong Kong, China.
4. Paper ID: 3811 ASYNCHRONOUS-TRANSITION HMM, S. Matsuda, M. Nakai, H. Shimodaira, S. Sagayama, Japan Advanced Institute of Science and Technology, Japan.
5. Paper ID: 3544 FACTORED SPARSE INVERSE COVARIANCE MATRICES, J. Bilmes, University of Washington, USA.
6. Paper ID: 1062 SUB-STATE TYING IN TIED MIXTURE HIDDEN MARKOV MODELS, L. Gu, K. Rose, University of California, Santa Barbara, USA.
7. Paper ID: 1927 UNIFIED FRAME AND SEGMENT BASED MODELS FOR AUTOMATIC SPEECH RECOGNITION, H.-W. Hon, K. Wang, Microsoft Corporation, USA.
8. Paper ID: 3436 USE OF HIGHER LEVEL LINGUISTIC STRUCTURE IN ACOUSTIC MODELING FOR SPEECH RECOGNITION, I. Shafran, M. Ostendorf, University of Washington, USA.
9. Paper ID: 1929 MANDARIN ACCENT ADAPTATION BASED ON CONTEXT-INDEPENDENT/CONTEXT-DEPENDENT PRONUNCIATION MODELING, M. Liu, B. Xu, T. Huang, Y. Deng, C. Li, Chinese Academy of Sciences, People's Republic of China.
10. Paper ID: 1446 TOWARDS LANGUAGE INDEPENDENT ACOUSTIC MODELING, W. Byrne, The Johns Hopkins University, USA, P. Beyerlein, Philips Research Laboratories, The Netherlands, J. Huerta, Carnegie Mellon University, USA, S. Khudanpur, The Johns Hopkins University, USA, B. Marthi, University of Toronto, Canada, J. Morgan, West Point, USA, N. Peterek, Charles University, Czech Republic, J. Picone, Mississippi State University, USA, D. Vergyri, The Johns Hopkins University, USA, W. Wang, Rice University, USA.


Session:            SP-L4
Time: 15:30 - 17:30, Wednesday, June 7, 2000
Location: Sadirvan A - Hilton Hotel
Title: SPEECH ENHANCEMENT I
Chair: Abeer Alwan, University of California at Los Angeles, USA
1. Paper ID: 478 IMPOVING THE PERFORMANCE OF A SMALL MICROPHONE ARRAY AT LOW FREQUENCIES USING CRITICAL BAND AND LPC CODEBOOKS, Y. Cao, S. Sridharan, Queensland University of Technology, Australia.
2. Paper ID: 1692 SPEECH DEREVERBERATION AND NOISE REDUCTION WITH A COMBINED MICROPHONE ARRAY APPROACH, J. Gonzalez-Rodriguez, J. Sanchez-Bote, J. Ortega-Garcia, Universidad Politecnica de Madrid, Spain.
3. Paper ID: 1848 EXPLORING PERMUTATION INCONSISTENCY IN BLIND SEPARATION OF SPEECH SIGNALS IN A REVERBERANT ENVIRONMENT, M. Ikram, Georgia Institute of Technology, USA, D. Morgan, Lucent Technologies, USA.
4. Paper ID: 4021 INTELLIGIBILITY ASSESSMENT OF A MULTI-BAND SPEECH ENHANCEMENT SCHEME, A. Hussain, University of Dundee, United Kingdom.
5. Paper ID: 627 SPEECH ENHANCEMENT USING NONLINEAR MICROPHONE ARRAY WITH NOISE ADAPTIVE COMPLEMENTARY BEAMFORMING, H. Saruwatari, S. Kajita, K. Takeda, F. Itakura, Nagoya University, Japan.
6. Paper ID: 2520 LOCALIZATION OF MULTIPLE SOUND SOURCES BASED ON A CSP ANALYSIS WITH A MICROPHONE ARRAY, T. Nishiura, Nara Institute of Science and Technology, Japan, T. Yamada, University of Tsukuba, Japan, S. Nakamura, K. Shikano, Nara Institute of Science and Technology, Japan.
7. Paper ID: 661 AUTOMATIC ENHANCEMENT OF SPEECH INTELLIGIBILITY, V. Colotte, Y. Laprie, LORIA, France.
8. Paper ID: 3710 COMBINED ACOUSTIC ECHO AND NOISE REDUCTION USING GSVD-BASED OPTIMAL FILTERING, S. Doclo, M. Moonen, Katholieke Universiteit Leuven, Belgium, E. De Clippel, Philips ITCL, Belgium.


Session:            SP-L5
Time: 09:00 - 12:00, Thursday, June 8, 2000
Location: Convention Center Lower Hall (L3)
Title: SPEAKER RECOGNITION I
Chair: Doug Reynolds, MIT Lincoln Laboratory, USA
1. Paper ID: 1258 SPEAKER-CENTRIC SCORE NORMALISATION AND TIME PATTERN ANALYSIS FOR CONTINUOUS SPEAKER VERIFICATION, R. Auckenthaler, University of Swansea, United Kingdom, M. Carey, Ensigma Ltd., United Kingdom, J. Mason, University of Wales Swansea, United Kingdom.
2. Paper ID: 2089 A PROPOSED LIKELIHOOD TRANSFORMATION FOR SPEAKER VERIFICATION, D. Tran, M. Wagner, University of Canberra, Australia.
3. Paper ID: 2381 THE USE OF SUB-BAND CEPSTRUM IN SPEAKER VERIFICATION, P. Sivakumaran, A. Ariyaeeinia, University of Hertfordshire, United Kingdom.
4. Paper ID: 3945 GENERATION OF OPTIMUM SIGNATURE BASE SEQUENCES FOR SPEECH SIGNALS, B. Yarman, ISIK University, Turkey, R. Akdeniz, Trakya University, Turkey.
5. Paper ID: 2151 EFFECTIVE SPEAKER ADAPTATIONS FOR SPEAKER VERIFICATION, S. Ahn, Korea University, South Korea, S. Kang, Seokyeong University, South Korea, H. Ko, Korea University, South Korea.
6. Paper ID: 1236 GSM SPEECH CODING AND SPEAKER RECOGNITION, L. Besacier, CLIPS/IMAG, France, S. Grassi, A. Dufaux, M. Ansorge, F. Pellandini, Institute of Microtechnology, Switzerland.
7. Paper ID: 2854 SPEAKER RECOGNITION USING G.729 SPEECH CODEC PARAMETERS, T. Quatieri, R. Dunn, D. Reynolds, Massachusetts Institute of Technology, USA, J. Campbell, US Department of Defense, USA, E. Singer, Massachusetts Institute of Technology, USA.
8. Paper ID: 2301 USER VALIDATION FOR MOBILE TELEPHONES, M. Carey, Ensigma Ltd., United Kingdom, R. Auckenthaler, University of Swansea, United Kingdom.
9. Paper ID: 3532 AN INSTANTIABLE SPEECH BIOMETRICS MODULE WITH NATURAL LANGUAGE INTERFACE: IMPLEMENTATION IN THE TELEPHONY ENVIRONMENT, J. Navratil, J. Kleindienst, S. Maes, IBM, USA.
10. Paper ID: 1192 SPEAKER VERIFICATION: MINIMIZING THE CHANNEL EFFECTS USING AUTOASSOCIATIVE NEURAL NETWORK MODELS, S. Kishore, B. Yegnanarayana, Indian Institute of Technology, India.


Session:            SP-L6
Time: 15:30 - 17:30, Thursday, June 8, 2000
Location: Convention Center Lower Hall (L3)
Title: ROBUST RECOGNITION I
Chair: Yifan Gong, Texas Instruments, USA
1. Paper ID: 1671 LDA DERIVED CEPSTRAL TRAJECTORY FILTERS IN ADVERSE ENVIRONMENTAL CONDITIONS, M. Lieb, R. Haeb-Umbach, Philips Forschungslaboratorien, Germany.
2. Paper ID: 907 MAXIMUM LIKELIHOOD JOINT ESTIMATION OF CHANNEL AND NOISE FOR ROBUST SPEECH RECOGNITION, Y. Zhao, University of Missouri-Columbia, USA.
3. Paper ID: 3059 PCA-PMC: A NOVEL USE OF A PRIORI KNOWLEDGE FOR FAST PARALLEL MODEL COMBINATION, R. Sarikaya, J. Hansen, University of Colorado at Boulder, USA.
4. Paper ID: 3380 FEATURE EXTRACTION USING NON-LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION ON THE AURORA DATABASE, S. Sharma, Oregon Graduate Institute of Science and Technology, USA, D. Ellis, International Computer Science Institute, USA, S. Kajarekar, P. Jain, H. Hermansky, Oregon Graduate Institute of Science and Technology, USA.
5. Paper ID: 1191 ASYNCHRONY IN MULTI-BAND SPEECH RECOGNITION, C. Cerisara, D. Fohr, J.-P. Haton, LORIA, France.
6. Paper ID: 1944 RESIDUAL NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION IN NONSTATIONARY NOISE, K. Yao, Tsinghua University, People's Republic of China, B. Shi, Hong Kong University of Science and Technology, Hong Kong, China, P. Fung, Hong Kong University of Science and Technology , Hong Kong, China, Z. Cao, Tsinghua University, People's Republic of China.
7. Paper ID: 1747 MAXIMUM LIKELIHOOD DISCRIMINANT FEATURE SPACES, G. Saon, M. Padmanabhan, R. Gopinath, S. Chen, IBM T.J. Watson Research Center, USA.
8. Paper ID: 2892 BLIND SPEECH SEPARATION OF MOVING SPEAKERS IN REAL REVERBERANT ENVIRONMENTS, A. Koutras, E. Dermatas, G. Kokkinakis, University of Patras, Greece.


Session:            SP-L7
Time: 15:30 - 18:00, Thursday, June 8, 2000
Location: Sadirvan A - Hilton Hotel
Title: WIDEBAND SPEECH CODING
Chair: Peter Kroon, Lucent Technologies, USA
1. Paper ID: 162 MIXED EXCITATION LINEAR PREDICTION CODING OF WIDEBAND SPEECH AT 8KBPS, W. Lin, S.-N. Koh, X. Lin, Nanyang Technological University, Singapore.
2. Paper ID: 3835 AN EMBEDDED SINUSOIDAL TRANSFORM CODEC WITH MEASURED PHASES AND SAMPLING RATE SCALABILITY, G. Aguilar, Lucent Technologies, USA, J.-H. Chen, Lucent InterNetworking Systems, USA, R. Dunn, R. McAulay, Massachusetts Institute of Technology, USA, X. Sun, W. Wang, R. Zopf, Lucent Technologies, USA.
3. Paper ID: 912 HIGH QUALITY EMBEDDED WIDEBAND SPEECH CODING USING AN INHERENTLY LAYERED CODING PARADIGM, S. Ramprashad, Lucent Technologies, USA.
4. Paper ID: 1444 A 16-KBIT/S BANDWIDTH SCALABLE AUDIO CODER BASED ON THE G.729 STANDARD, K. Koishida, V. Cuperman, A. Gersho, University of California, Santa Barbara, USA.
5. Paper ID: 3109 A 14 KB/S WIDEBAND SPEECH CODER WITH A PARAMETRIC HIGHBAND MODEL, A. McCree, Texas Instruments, USA.
6. Paper ID: 665 HI-BIN: AN ALTERNATIVE APPROACH TO WIDEBAND SPEECH CODING, R. Taori, R. Sluijter, A. Gerrits, Philips Research Laboratories, The Netherlands.
7. Paper ID: 4062 A HIGH-FIDELITY SPEECH AND AUDIO CODEC WITH LOW DELAY AND LOW COMPLEXITY, J.-H. Chen, Lucent Technologies, USA.
8. Paper ID: 1946 A MULTI-RATE WIDEBAND SPEECH CODEC ROBUST TO BACKGROUND NOISE, A. Murashima, M. Serizawa, K. Ozawa, NEC Corporation, Japan.
9. Paper ID: 1441 STOCHASTIC-ALGEBRAIC WIDEBAND LSF QUANTIZATION, S. Ragot, Université de Sherbrooke, Canada, R. Lefebvre, R. Salami, J.-P. Adoul, University of Sherbrooke, Canada.
10. Paper ID: 1199 A SILENCE COMPRESSION ALGORITHM FOR MULTI-RATE/DUAL-BANDWIDTH MPEG-4 CELP STANDARD, M. Serizawa, H. Ito, T. Nomura, NEC Corporation, Japan.


Session:            SP-L8
Time: 09:00 - 12:00, Friday, June 9, 2000
Location: Convention Center Lower Hall (L3)
Title: SPEAKER RECOGNITION II
Chair: M. Demirekler, Middle East Technical University, Turkey
1. Paper ID: 1628 A SPEAKER TRACKING SYSTEM BASED ON SPEAKER TURN DETECTION FOR NIST EVALUATION, J.-F. Bonastre, Laboratoire Informatique d'Avignon, France, P. Delacourt, Eurecom, France, C. Fredouille, T. Merlin, Laboratoire Informatique d'Avignon, France, C. Wellekens, Eurecom, France.
2. Paper ID: 1648 SPEAKER IDENTIFICATION IN MISMATCH TRAINING AND TESTING CONDITIONS, C. Alonso-Martinez, M. Faundez-Zanuy, Escola Universitaria Politecnica de Mataro, Spain.
3. Paper ID: 1709 AN ITERATIVE TECHNIQUE FOR TRAINING SPEAKER VERIFICATION SYSTEMS, W. Campbell, Motorola, USA.
4. Paper ID: 1880 SEARCH-SPACE REDUCTION FOR FAST, OPTIMAL HMM DECODING IN SPEAKER VERIFICATION, Q. Li, Lucent Technologies, USA.
5. Paper ID: 2227 A TWO-STAGE SCORING METHOD COMBINING WORLD AND COHORT MODELS FOR SPEAKER VERIFICATION, W. Zhang, M.-W. Mak, The Hong Kong Polytechnic University, Hong Kong, China, M. He, Ocean University of Qingdao, Hong Kong, China.
6. Paper ID: 2320 BEHAVIOR OF A BAYESIAN ADAPTATION METHOD FOR INCREMENTAL ENROLLMENT IN SPEAKER VERIFICATION, C. Fredouille, Laboratoire Informatique d'Avignon, France, J. Mariethoz, Dalle Molle Institute of Perceptual Artificial Intelligence, Switzerland, C. Jaboulet, J. Hennebert, UBS, Switzerland, J.-F. Bonastre, Laboratoire Informatique d'Avignon, France, C. Mokbel, Universite de St. Joseph, Lebanon, F. Bimbot, IRISA, France.
7. Paper ID: 2665 EVOLUTIVE HMM FOR MULTI-SPEAKER TRACKING SYSTEM, S. Meignier, J.-F. Bonastre, C. Fredouille, T. Merlin, Universite d' Avignon, France.
8. Paper ID: 2835 SPEAKER RECOGNITION IN TWO-SPEAKER DATA: RECENT RESULTS FROM DRAGON SYSTEMS, F. Weber, B. Peskin, M. Newman, L. Gillick, Dragon Systems, Inc., USA.
9. Paper ID: 2861 A NOVEL RANK-BASED CLASSIFIER COMBINATION SCHEME FOR SPEAKER IDENTIFICATION, H. Altincay, M. Demirekler, Middle East Technical University, Turkey.
10. Paper ID: 3067 IMPROVED NORMALIZATION WITHOUT RECOURSE TO AN IMPOSTOR DATABASE FOR SPEAKER VERIFICATION, M. Hebert, S. Peters, Nuance Communications, USA.


Session:            SP-L9
Time: 13:30 - 15:00, Friday, June 9, 2000
Location: Convention Center Lower Hall (L2)
Title: SPOKEN LANGUAGE DIALOGUE
Chair: Roberto Pieraccini, SpeechWorks, USA
1. Paper ID: 1049 PROBABILISTIC SIMULATION OF HUMAN-MACHINE DIALOGUES, K. Scheffler, S. Young, University of Cambridge, United Kingdom.
2. Paper ID: 1151 FUNDAMENTAL PERFORMANCE ANALYSIS FOR SPOKEN DIALOGUE SYSTEMS BASED ON A QUANTITATIVE SIMULATION APPROACH, B.-S. Lin, National Taiwan University, Taiwan, L.-S. Lee, Institute of Information Science, Academia Sinica, Taiwan.
3. Paper ID: 1819 PARSER ADAPTATION VIA HOUSEHOLDER TRANSFORM, X. Luo, IBM T.J. Watson Research Center, USA.
4. Paper ID: 2494 CU FOREX: A BILINGUAL SPOKEN DIALOG SYSTEM FOR FOREIGN EXCHANGE ENQUIRIES, H. Meng, The Chinese University of Hong Kong, Hong Kong, China, S. Lee, SpeechWorks International Ltd., USA, C. Wai, The Chinese University of Hong Kong, Hong Kong, China.
5. Paper ID: 2895 FAST REINFORCEMENT LEARNING OF DIALOG STRATEGIES, D. Goddeau, Compaq Computer Corporation, USA, J. Pineau, Carnegie Mellon University, USA.
6. Paper ID: 697 CONFIDENCE MEASURES FOR DIALOGUE MANAGEMENT IN THE CU COMMUNICATOR SYSTEM, R. San-Segundo, Universidad Politecnica de Madrid, Spain, B. Pellom, W. Ward, University of Colorado at Boulder, USA, J. Pardo, Universidad Politecnica de Madrid, Spain.


Session:            SP-P1
Time: 15:15 - 17:00, Tuesday, June 6, 2000
Location: Ballroom - Hilton Hotel (P6)
Title: ACOUSTIC MODELING II
Chair: Ramesh Gopinath, IBM, USA
1. Paper ID: 1033 TIED POSTERIORS: AN APPROACH FOR EFFECTIVE INTRODUCTION OF CONTEXT DEPENDENCY IN HYBRID NN/HMM LVCSR, J. Rottland, Duisburg University, Germany, G. Rigoll, Gerhard-Mercator-University Duisburg, Germany.
2. Paper ID: 1248 DISCRIMINATIVE RESOLUTION ENHANCEMENT IN ACOUSTIC MODELLING, J. Duchateau, K. Demuynck, P. Wambacq, Katholieke Universiteit Leuven, Belgium.
3. Paper ID: 1978 A SEGMENTAL-FEATURE HMM USING PARAMETRIC TRAJECTORY MODEL, Y.-S. Yun, Y.-H. Oh, Korea Advanced Institute of Science and Technology, South Korea.
4. Paper ID: 2353 SOFT GPD FOR MINIMUM CLASSIFICATION ERROR RATE TRAINING, B. Shi, Hong Kong University of Science and Technology, Hong Kong, China, K. Yao, Z. Cao, Tsinghua University, People's Republic of China.
5. Paper ID: 2846 HETEROGENEOUS LEXICAL UNITS FOR AUTOMATIC SPEECH RECOGNITION: PRELIMINARY INVESTIGATIONS, I. Bazzi, J. Glass, Massachusetts Institute of Technology, USA.
6. Paper ID: 967 ACOUSTIC MODELING FOR CHINESE SPEECH RECOGNITION: A COMPARATIVE STUDY OF MANDARIN AND CANTONESE, S. Gao, T. Lee, Y. Wong, The Chinese University of Hong Kong, People's Republic of China, B. Xu, Chinese Academy of Sciences, People's Republic of China, P. Ching, The Chinese University of Hong Kong, People's Republic of China, T. Huang, Chinese Academy of Sciences, People's Republic of China.
7. Paper ID: 845 AN EFFECTIVE ACOUSTIC MODELING OF NAMES BASED ON MODEL INDUCTION, T. Kim, Korea University, South Korea, S. Kang, Seokyeong University, South Korea, H. Ko, Korea University, South Korea.
8. Paper ID: 3770 A NEW PHONETIC TIED-MIXTURE MODEL FOR EFFICIENT DECODING, A. Lee, T. Kawahara, Kyoto University, Japan, K. Takeda, Nagoya University, Japan, K. Shikano, Nara Institute of Science and Technology, Japan.
9. Paper ID: 1647 AGGLOMERATIVE VS. TREE-BASED CLUSTERING FOR THE DEFINITION OF MULTILINGUAL SET OF TRIPHONES, B. Imperl, Z. Kacic, B. Horvat, A. Zgank, University of Maribor, Slovenia.
10. Paper ID: 3030 INTEGRATING DYNAMIC SPEECH MODALITIES INTO CONTEXT DECISION TREES, C. Fugen, I. Rogina, University of Karlsruhe, Germany.

Session:            SP-P2
Time: 09:00 - 12:00, Wednesday, June 7, 2000
Location: Ballroom - Hilton Hotel (P5)
Title: TOPICS IN SPEECH PROCESSING - PART 1: SPEECH SYNTHESIS & ANALYSIS; PART 2: SPEECH ANALYSIS
Chair: Robert Donovan, IBM, USA
1. Paper ID: 509 A NOVEL APPROACH TO THE FULLY AUTOMATIC EXTRACTION OF FUJISAKI MODEL PARAMETERS, H. Mixdorff, Dresden University of Technology, Germany.
2. Paper ID: 1896 ROBUST GENERATION OF SYMBOLIC PROSODY BY A NEURAL CLASSIFIER BASED ON AUTOASSOCIATORS, A. Muller, H. Zimmermann, R. Neuneier, Siemens, Germany.
3. Paper ID: 4052 IMPROVING INTONATIONAL PHRASING WITH SYNTACTIC INFORMATION, P. Koehn, University of Southern California, USA, S. Abney, J. Hirschberg, M. Collins, AT&T Labs, USA.
4. Paper ID: 2361 AUTOMATIC LEARNING OF NUMERAL GRAMMARS FOR MULTI-LINGUAL SPEECH SYNTHESIZERS, G. Flach, Dresden University of Technology, Germany, M. Holzapfel, Siemens, Germany, C. Just, A. Wachtler, M. Wolff, Dresden University of Technology, Germany.
5. Paper ID: 3894 TIME AND FREQUENCY SCALE MODIFICATION OF SPEECH SIGNALS, B. Ninness, S. Henriksen, University of Newcastle, Australia.
6. Paper ID: 716 SPEECH RECONSTRUCTION FROM MEL FREQUENCY CEPSTRAL COEFFICIENTS AND PITCH FREQUENCY, D. Chazan, R. Hoory, G. Cohen, M. Zibulski, IBM Research, Israel.
7. Paper ID: 2526 IMPROVING THE ROBUSTNESS OF WAVELET TRANSFORM FOR EPOCH DETECTION, Y. Lam, R. Luk, F. Chung, The Hong Kong Polytechnic University, Hong Kong, China.
8. Paper ID: 1466 A WEIGHTED AUTOCORRELATION METHOD FOR PITCH EXTRACTION OF NOISY SPEECH, H. Kobayashi, T. Shimamura, Saitama University, Japan.
9. Paper ID: 2735 PERFORMANCE OF THE PITCH-SCALED HARMONIC FILTER AND APPLICATIONS IN SPEECH ANALYSIS, P. Jackson, C. Shadle, University of Southampton, United Kingdom.
10. Paper ID: 3745 SPEECH PARAMETER GENERATION ALGORITHMS FOR HMM-BASED SPEECH SYNTHESIS, K. Tokuda, T. Yoshimura, Nagoya Institute of Technology, Japan, T. Masuko, T. Kobayashi, Tokyo Institute of Technology, Japan, T. Kitamura, Nagoya Institute of Technology, Japan.
11. Paper ID: 2311 UNSUPERVISED ESTIMATION OF THE HUMAN VOCAL TRACT LENGTH OVER SENTENCE LEVEL UTTERANCES, B. Necioglu, The MITRE Corporation, USA, M. Clements, T. Barnwell, III, Georgia Institute of Technology, USA.
12. Paper ID: 3743 MULTIVARIATE-STATE HIDDEN MARKOV MODELS FOR SIMULTANEOUS TRANSCRIPTION OF PHONES AND FORMANTS, M. Hasegawa-Johnson, University of Illinois at Urbana-Champaign, USA.
13. Paper ID: 4085 ON THE MUTUAL INFORMATION BETWEEN FREQUENCY BANDS IN SPEECH, M. Nilsson, S. Vang Andersen, W. Kleijn, Royal Institute of Technology, Sweden.
14. Paper ID: 3273 STUDY OF TALKER INDIVIDUALITY BY USING ARX SPEECH ANALYSIS-SYNTHESIS-EDITING SYSTEM, W. Zhu, K. Matsui, Matsushita Electric Industrial Co., Ltd., Japan, H. Kasuya, Utsunomiya University, Japan.
15. Paper ID: 2863 LINGUISTIC PROPERTIES OF NON-NATIVE SPEECH, L. Mayfield Tomokiyo, Carnegie Mellon University, USA.
16. Paper ID: 328 VISUAL APPROACH FOR AUTOMATIC PITCH PERIOD ESTIMATION, Z. Sen, K. Shirai, Waseda University, Japan.
17. Paper ID: 887 ROBUST PITCH TRACKING FOR PROSODIC MODELING IN TELEPHONE SPEECH, C. Wang, S. Seneff, Massachusetts Institute of Technology, USA.
18. Paper ID: 2670 PERCEPTUAL EFFECTS OF COARTICULATION IN FRICATIVES, S. Fernandez, S. Feijoo, R. Balsa, N. Barros, University of Santiago de Compostela, Spain.
19. Paper ID: 2774 MEL-SCALED DISCRETE WAVELET COEFFICIENTS FOR SPEECH RECOGNITION, J. Gowdy, Z. Tufekci, Clemson University, USA.
20. Paper ID: 1222 ON-LINE SPEAKING RATE ESTIMATION USING GAUSSIAN MIXTURE MODELS, R. Faltlhauser, T. Pfau, G. Ruske, Technische Universität München, Germany.


Session:            SP-P3
Time: 15:15 - 17:00, Wednesday, June 7, 2000
Location: Ballroom - Hilton Hotel (P5)
Title: LOW BIT RATE SPEECH CODING
Chair: Alan McCree, Texas Instruments, USA
1. Paper ID: 397 WAVEFORM EXTRACTION FOR PERFECT RECONSTRUCTION IN WI CODING, V. Ruoppila, University of Sherbrooke, Canada, M. Tammi, J. Saarinen, Tampere University of Technology, Finland.
2. Paper ID: 782 HIGH QUALITY ENHANCED WAVEFORM INTERPOLATIVE CODING AT 2.8 KBPS, O. Gottesman, A. Gersho, University of California, Santa Barbara, USA.
3. Paper ID: 1121 ANALYSIS-BY-SYNTHESIS MULTIMODE HARMONIC SPEECH CODING AT 4 KB/S, C. Li, V. Cuperman, University of California, Santa Barbara, USA.
4. Paper ID: 1864 SPEECH CODING WITH AN ANALYSIS-BY-SYNTHESIS SINUSOIDAL MODEL, C. Etemoglu, V. Cuperman, A. Gersho, University of California, Santa Barbara, USA.
5. Paper ID: 3143 A 1200 BPS SPEECH CODER BASED ON MELP, T. Wang, K. Koishida, V. Cuperman, A. Gersho, SignalCom, Inc., USA, J. Collura, National Security Agency, USA.
6. Paper ID: 3190 A 4 KB/S HYBRID MELP/CELP CODER WITH ALIGNMENT PHASE ENCODING AND ZERO PHASE EQUALIZATION, J. Stachurski, A. McCree, Texas Instruments, USA.
7. Paper ID: 4074 PERCEPTUAL PHASE REDUNDANCY IN SPEECH, D.-S. Kim, Samsung Advanced Institute of Technology, South Korea.
8. Paper ID: 1891 A COMBINED WI AND MELP CODER AT 5.2 KBPS, J. Skoglund, R. Cox, AT&T Labs, USA, J. Collura, National Security Agency, USA.
9. Paper ID: 127 A BACKGROUND NOISE REDUCTION TECHNIQUE BASED ON SINUSOIDAL SPEECH CODING SYSTEMS, S. Yeldener, J. Rieser, COMSAT Laboratories, USA.
10. Paper ID: 3325 VARIABLE RATE MULTI-MODE EXCITATION CODING OF SPEECH AT 2.4KBPS, S. Wang, Atmel Corporation, USA.


Session:            SP-P4
Time: 15:15 - 17:00, Wednesday, June 7, 2000
Location: Ballroom - Hilton Hotel (P6)
Title: TOPICS IN SPEECH RECOGNITION I
Chair: Philip Loizou, University of Texas, Dallas, USA
1. Paper ID: 1998 SPEECH/NON-SPEECH CLASSIFICATION USING MULTIPLE FEATURES FOR ROBUST ENDPOINT DETECTION, W.-H. Shin, B.-S. Lee, Y.-K. Lee, J.-S. Lee, LG Corporate Institute of Technology, South Korea.
2. Paper ID: 1587 SPEECH RECOGNITION FOR A DISTANT MOVING SPEAKER BASED ON HMM COMPOSITION AND SEPARATION, T. Takiguchi, IBM Tokyo Research Laboratory, Japan, S. Nakamura, K. Shikano, Nara Institute of Science and Technology, Japan.
3. Paper ID: 2274 HANDS-FREE SPEECH RECOGNITION USING A FILTERED CLEAN CORPUS AND INCREMENTAL HMM ADAPTATION, M. Matassoni, M. Omologo, D. Giuliani, ITC-irst, Italy.
4. Paper ID: 3320 HMM ADAPTATION AND MICROPHONE ARRAY PROCESSING FOR DISTANT SPEECH RECOGNITION, J. Kleban, Rutgers University, USA, Y. Gong, Texas Instruments, USA.
5. Paper ID: 2356 COMPARING ACOUSTIC FEATURES FOR ROBUST ASR IN FIXED AND CELLULAR NETWORK APPLICATIONS, F. de Wet, B. Cranen, J. de Veth, L. Boves, University of Nijmegen, The Netherlands.
6. Paper ID: 2741 ANCHORING HYPOTHESIS AND ITS APPLICATION TO TONE RECOGNITION OF CHINESE CONTINUOUS SPEECH, J.-S. Zhang, K. Hirose, The University of Tokyo, Japan.
7. Paper ID: 980 STRATEGIES FOR AUTOMATIC SEGMENTATION OF AUDIO DATA, T. Kemp, M. Schmidt, M. Westphal, A. Waibel, University of Karlsruhe, Germany.
8. Paper ID: 1738 A METHOD FOR DIRECT AUDIO SEARCH WITH APPLICATIONS TO INDEXING AND RETRIEVAL, S. Johnson, P. Woodland, University of Cambridge, United Kingdom.
9. Paper ID: 135 THE STUDY ON DISTRIBUTED SPEECH RECOGNITION SYSTEM, W. Zhang, L. He, Intel Corporation, People's Republic of China, Y.-L. Chow, Lernout & Hauspie, Singapore, R. Yang, Y. Su, Intel Corporation, People's Republic of China.
10. Paper ID: 3367 CONVERSATIONAL SPEECH RECOGNITION USING ACOUSTIC AND ARTICULATORY INPUT, K. Kirchhoff, University of Washington, USA, G. Fink, G. Sagerer, University of Bielefeld, Germany.


Session:            SP-P5
Time: 09:00 - 12:00, Thursday, June 8, 2000
Location: Ballroom - Hilton Hotel (P5)
Title: TOPICS IN SPEECH CODING - PART 1; PART 2
Chair: Raymond Chen, Lucent Technologies, USA
1. Paper ID: 96 HARMONIC EXPONENTIAL MODELING OF TRANSITIONAL SPEECH SEGMENTS, J. Jensen, S. Jensen, E. Hansen, Aalborg University, Denmark.
2. Paper ID: 1993 VARIABLE DIMENSIONAL ALGEBRAIC CELP CODING OF PROTOTYPE WAVEFORMS, J. Sohn, W. Sung, Seoul National University, South Korea.
3. Paper ID: 2668 LOW-RATE QUANTIZATION OF SPECTRUM PARAMETERS, T. Eriksson, Chalmers University of Technology, Sweden, H.-G. Kang, AT&T Labs, USA, P. Hedelin, Chalmers University of Technology, Sweden.
4. Paper ID: 3060 RECURSIVE LPC SPECTRUM CODING - A CLASSIFIED VQ APPROACH, F. Norden, J. Samuelsson, P. Hedelin, Chalmers University of Technology, Sweden.
5. Paper ID: 4146 ROBUST APPLICATION OF DISCRETE ALL-POLE MODELING TO SINUSOIDAL TRANSFORM CODING, D. Molyneux, M.-S. Ho, B. Cheetham, University of Manchester, United Kingdom.
6. Paper ID: 272 PREDICTIVE AND MEL-SCALE BINARY VECTOR QUANTIZATION OF VARIABLE DIMENSION SPECTRAL MAGNITUDE, Y. Cho, University of Surrey, United Kingdom, M. Kim, Samsung Advanced Institute of Technology, South Korea, A. Kondoz, University of Surrey, United Kingdom.
7. Paper ID: 374 ENCODING SINUSOIDAL AMPLITUDES WITH A MINIMUM PHASE RATIONAL MODEL, N. Malik, W. Holmes, University of New South Wales, Australia.
8. Paper ID: 944 PHASE AND TRANSIENT MODELING FOR HARMONIC+NOISE SPEECH CODING, E. Yu, C.-F. Chan, City University of Hong Kong, Hong Kong, China.
9. Paper ID: 2101 LINEAR PREDICTION INCORPORATING SIMULTANEOUS MASKING, J. Lukasiak, I. Burnett, J. Chicharo, University of Wollongong, Australia, M. Thomson, Motorola, Australia.
10. Paper ID: 4112 A FRAME INTERPRETATION OF SINUSOIDAL CODING AND WAVEFORM INTERPOLATION, W. Kleijn, Royal Institute of Technology, Sweden.
11. Paper ID: 3875 OPTIMIZED ESTIMATION OF SPECTRAL PARAMETERS FOR THE CODING OF NOISY SPEECH, R. Martin, I. Wittke, P. Jax, Institute of Communication Systems and Data Processing, Germany.
12. Paper ID: 1784 IMPROVED FRAME ERASURE CONCEALMENT FOR CELP-BASED CODERS, J. De Martin, Politecnico di Torino, Italy, T. Unno, V. Viswanathan, Texas Instruments, USA.
13. Paper ID: 709 A CELP-BASED HYBRID DIGITAL-ANALOG (HDA) JOINT SOURCE-CHANNEL SPEECH CODER, N. Phamdo, U. Mittal, State University of New York at Stony Brook, USA.
14. Paper ID: 1077 PITCH-SYNCHRONOUS LINEAR-PREDICTION ANALYSIS BY SYNTHESIS WITH REDUCED PULSE DENSITIES, D. Guerchi, Y. Qian, P. Mermelstein, Universite du Quebec, Canada.
15. Paper ID: 1497 SHAPED FIXED CODEBOOK SEARCH FOR CELP CODING AT LOW BIT RATES, E. Erzin, Lucent Technologies, USA.
16. Paper ID: 2757 DIGITAL WATERMARKING OF SPEECH SIGNALS FOR THE NATIONAL GALLERY OF THE SPOKEN WORD, F. Ruiz, J. Deller, Jr., Michigan State University, USA.
17. Paper ID: 3546 DISPERSED-PULSE CODEBOOK AND ITS APPLICATION TO A 4KB/S SPEECH CODER, K. Yasunaga, Matsushita Research Institute Tokyo, Inc., Japan, H. Ehara, K. Yoshida, Matsushita Communication Industrial Co., Ltd., Japan, T. Morii, Matsushita Research Institute Tokyo, Inc., Japan.
18. Paper ID: 3637 JOINT SOURCE - CHANNEL MMSE-DECODING OF SPEECH PARAMETERS, S. Heinen, P. Vary, Aachen University of Technology, Germany.
19. Paper ID: 3126 SPEECH QUALITY OBJECTIVE ASSESSMENT USING NEURAL NETWORK, Q. Fu, K. Yi, Xidian University, People's Republic of China, M. Sun, University of Pittsburgh, USA.
20. Paper ID: 648 THE PERCEPTUAL ANALYSIS MEASUREMENT SYSTEM FOR ROBUST END-TO-END SPEECH QUALITY ASSESSMENT, A. Rix, M. Hollier, BT, United Kingdom.


Session:            SP-P6
Time: 09:00 - 12:00, Thursday, June 8, 2000
Location: Ballroom - Hilton Hotel (P6)
Title: TOPICS IN LVCSR I - PART 1: FAST DECODING AND ADAPTATION; PART 2
Chair: Michael Picheny, IBM, USA
1. Paper ID: 1453 RAPID LIKELIHOOD CALCULATION OF SUBSPACE CLUSTERED GAUSSIAN COMPONENTS, A. Aiyer, Stanford University, USA, M. Gales, University of Cambridge, United Kingdom, M. Picheny, IBM , USA.
2. Paper ID: 3718 PITCH TRACKING AND TONE FEATURES FOR MANDARIN SPEECH RECOGNITION, H. Huang, F. Seide, Philips Innovation Center, Taiwan.
3. Paper ID: 1411 BOOSTING GAUSSIAN MIXTURES IN AN LVCSR SYSTEM, G. Zweig, IBM, USA, M. Padmanabhan, IBM T.J. Watson Research Center, USA.
4. Paper ID: 3948 USING SIMD INSTRUCTIONS FOR FAST LIKELIHOOD CALCULATION IN LVCSR, S. Kanthak, Aachen University of Technology, Germany, K. Schutz, AXYS Design Automation for Embedded Systems, Germany, H. Ney, Aachen University of Technology, Germany.
5. Paper ID: 873 FAST DECODING IN LARGE VOCABULARY NAME DIALING, J. Suontausta, J. Hakkinen, O. Viikki, Nokia Research Center, Finland.
6. Paper ID: 1741 ON THE INCREMENTAL ADDITION OF REGRESSION CLASSES FOR SPEAKER ADAPTATION, J. McDonough, V. Venkataramani, W. Byrne, The Johns Hopkins University, USA.
7. Paper ID: 1755 INTER-CLASS MLLR FOR SPEAKER ADAPTATION, S.-J. Doh, R. Stern, Carnegie Mellon University, USA.
8. Paper ID: 3313 MODEL ADAPTATION IN LINE SPECTRUM DOMAIN, A.-T. Yu, H.-C. Wang, National Tsing Hua University, Taiwan.
9. Paper ID: 3558 MAP ADAPTATION WITH SUBSPACE REGRESSION CLASSES AND TYING, K.-M. Wong, B. Mak, Hong Kong University of Science and Technology, Hong Kong, China.
10. Paper ID: 1001 DUCODER - THE DUISBURG UNIVERSITY LVCSR STACKDECODER, D. Willett, C. Neukirchen, G. Rigoll, Gerhard-Mercator-University Duisburg, Germany.
11. Paper ID: 1937 PROGRESSIVE 2-PASS DECODER FOR REAL-TIME BROADCAST NEWS CAPTIONING, T. Imai, A. Kobayashi, S. Sato, H. Tanaka, A. Ando, NHK Science and Technical Research Laboratories, Japan.
12. Paper ID: 3688 TURKISH LVCSR: TOWARDS BETTER SPEECH RECOGNITION FOR AGGLUTINATIVE LANGUAGES, K. Carki, P. Geutner, T. Schultz, University of Karlsruhe, Germany.
13. Paper ID: 3829 PERFORMANCE OF LVCSR WITH MORPHEME-BASED AND SYLLABLE-BASED RECOGNITION UNITS, O.-W. Kwon, ETRI, South Korea.
14. Paper ID: 2312 EMPLOYING HETEROGENEOUS INFORMATION IN A MULTI-STREAM FRAMEWORK, H. Christensen, B. Lindberg, O. Andersen, Aalborg University, Denmark.
15. Paper ID: 2908 DICTATION OF MULTIPARTY CONVERSATION USING STATISTICAL TURN TAKING MODEL AND SPEAKER MODEL, N. Murai, T. Kobayashi, Waseda University, Japan.
16. Paper ID: 3733 AUTOMATIC SPEECH SUMMARIZATION BASED ON WORD SIGNIFICANCE AND LINGUISTIC LIKELIHOOD, C. Hori, S. Furui, Tokyo Institute of Technology, Japan.
17. Paper ID: 393 STATISTICAL KNOWLEDGE BASED FRAME SYNCHRONOUS SEARCH STRATEGIES IN CONTINUOUS SPEECH RECOGNITION, Z. Song, F. Zheng, W. Wu, Tsinghua University, People's Republic of China.
18. Paper ID: 563 USING POSTERIOR WORD PROBABILITIES FOR IMPROVED SPEECH RECOGNITION, F. Wessel, R. Schlüter, H. Ney, Aachen University of Technology, Germany.
19. Paper ID: 1048 VARIABLE WORD RATE N-GRAMS, Y. Gotoh, S. Renals, University of Sheffield, United Kingdom.
20. Paper ID: 1131 INTEGRATING DETAILED INFORMATION INTO A LANGUAGE MODEL, R. Zhang, E. Black, A. Finch, Y. Sagisaka, ATR Laboratories, Japan.


Session:            SP-P7
Time: 15:15 - 17:00, Thursday, June 8, 2000
Location: Ballroom - Hilton Hotel (P6)
Title: FEATUE EXTRACTION
Chair: Kuldip Paliwal, Griffith University, Australia
1. Paper ID: 3408 AN OPTIMAL BHATTACHARYYA CENTROID ALGORITHM FOR GAUSSIAN CLUSTERING WITH APPLICATIONS IN AUTOMATIC SPEECH RECOGNITION, L. Rigazio, B. Tsakam, J.-C. Junqua, Panasonic Technologies, Inc., USA.
2. Paper ID: 655 EAR-MODEL DERIVED FEATURES FOR AUTOMATIC SPEECH RECOGNITION, R. de Mori, Universite d' Avignon, France, D. Albesano, R. Gemello, F. Mana, CSELT, Italy.
3. Paper ID: 657 BITSTREAM-BASED FEATURE EXTRACTION FOR WIRELESS SPEECH RECOGNITION, H. Kim, R. Cox, AT&T Labs, USA.
4. Paper ID: 695 A FUZZY APPROACH FOR THE EQUALIZATION OF CEPSTRAL VARIANCES, W.-W. Hung, Ming-Chi Institute of Technology, Taiwan, H.-C. Wang, National Tsing Hua University, Taiwan.
5. Paper ID: 987 A NEW APPROACH TO DISCRIMINATIVE FEATURE EXTRACTION USING MODEL TRANSFORMATION, M. Thomae, DaimlerChrysler AG, Germany, G. Ruske, T. Pfau, Technische Universität München, Germany.
6. Paper ID: 2330 A MARKOV RANDOM FIELD BASED MULTI-BAND MODEL, G. Gravier, M. Sigelle, G. Chollet, CNRS-ENST, France.
7. Paper ID: 3141 AUDITORY-BASED SPEECH PROCESSING BASED ON THE AVERAGE LOCALIZED SYNCHRONY DETECTION, A. Ali, J. Van der Spiegel, University of Pennsylvania, USA, P. Mueller, Corticon, Inc., USA.
8. Paper ID: 3324 DATA-DRIVEN RASTA FILTERS IN REVERBERATION, M. Shire, B. Chen, University of California, Berkeley, USA.
9. Paper ID: 3434 SPEECH FEATURE EXTRACTION USING INDEPENDENT COMPONENT ANALYSIS, J.-H. Lee, Korea Advanced Institute of Science and Technology, South Korea, H.-Y. Jung, Electronics and Telecommunications Research Institute, South Korea, T.-W. Lee, University of California, San Diego, USA, S.-Y. Lee, Korea Advanced Institute of Science and Technology, South Korea.
10. Paper ID: 3476 TANDEM CONNECTIONIST FEATURE EXTRACTION FOR CONVENTIONAL HMM SYSTEMS, H. Hermansky, Oregon Graduate Institute of Science and Technology, USA, D. Ellis, International Computer Science Institute, USA, S. Sharma, Oregon Graduate Institute of Science and Technology, USA.


Session:            SP-P8
Time: 09:00 - 12:00, Friday, June 9, 2000
Location: Ballroom - Hilton Hotel (P5)
Title: TOPICS IN LVCSR II - PART 1: LANGUAGE MODELING & SEARCH TECHNIQUES; PART 2: PRONUNCIATION & LANGUAGE MODELING
Chair: Rafid Sukkar, Lucent Technologies, USA
1. Paper ID: 1922 A UNIFIED CONTEXT-FREE GRAMMAR AND N-GRAM MODEL FOR SPOKEN LANGUAGE PROCESSING, Y.-Y. Wang, M. Mahajan, X. Huang, Microsoft Corporation, USA.
2. Paper ID: 2012 INTEGRATING A CONTEXT-DEPENDENT PHRASE GRAMMAR IN THE VARIABLE N-GRAM FRAMEWORK, M.-H. Siu, Hong Kong University of Science and Technology, Hong Kong, China, M. Ostendorf, University of Washington, USA.
3. Paper ID: 1960 PUTTING IT ALL TOGETHER: LANGUAGE MODEL COMBINATION, J. Goodman, Microsoft Corporation, USA.
4. Paper ID: 880 TOWARDS A LARGE-VOCABULARY FRENCH VOCAL DICTATION BASED ON A SIZE-INDEPENDENT LANGUAGE-MODEL SEARCH USING THE INRS RECOGNIZER, H. Tolba, D. O'Shaughnessy, Universite du Quebec, Canada.
5. Paper ID: 2366 LARGE VOCABULARY DECODING AND CONFIDENCE ESTIMATION USING WORD POSTERIOR PROBABILITIES, G. Evermann, P. Woodland, University of Cambridge, United Kingdom.
6. Paper ID: 2169 EFFICIENT INTEGRATION OF MULTIPLE PRONUNCIATIONS IN A LARGE VOCABULARY DECODER, H. Schramm, X. Aubert, Philips Research Laboratories, Germany.
7. Paper ID: 3268 TRANSCRIPTION AND INDEXATION OF BROADCAST DATA, J.-L. Gauvain, L. Lamel, Y. de Kercadio, G. Adda, CNRS, France.
8. Paper ID: 3918 A BASELINE FOR THE TRANSCRIPTION OF ITALIAN BROADCAST NEWS, F. Brugnara, M. Cettolo, M. Federico, D. Giuliani, ITC-irst, Italy.
9. Paper ID: 3956 RECENT IMPROVEMENTS OF THE RWTH LARGE VOCABULARY SPEECH RECOGNITION SYSTEM ON SPONTANEOUS SPEECH, A. Sixtus, S. Molau, S. Kanthak, R. Schlüter, H. Ney, Aachen University of Technology, Germany.
10. Paper ID: 3766 FRENCH LARGE VOCABULARY RECOGNITION WITH CROSS-WORD PHONOLOGY TRANSDUCERS, G. Boulianne, J. Brousseau, P. Ouellet, P. Dumouchel, Centre de recherche informatique de Montréal, Canada.
11. Paper ID: 587 PRONUNCIATION AMBIGUITY VS PRONUNCIATION VARIABILITY IN SPEECH RECOGNITION, M. Saraclar, S. Khudanpur, The Johns Hopkins University, USA.
12. Paper ID: 1842 LEXICAL MODELING OF NON-NATIVE SPEECH FOR AUTOMATIC SPEECH RECOGNITION, K. Livescu, J. Glass, Massachusetts Institute of Technology, USA.
13. Paper ID: 2669 DATA - DRIVEN GENERATION OF PRONUNCIATION DICTIONARIES IN THE GERMAN VERBMOBIL PROJECT - DISCUSSION OF EXPERIMENTAL RESULTS, M. Eichner, M. Wolff, Dresden University of Technology, Germany.
14. Paper ID: 3644 AUTOMATIC GENERATION OF PHONE SETS AND LEXICAL TRANSCRIPTIONS, R. Singh, B. Raj, R. Stern, Carnegie Mellon University, USA.
15. Paper ID: 973 SELECTING ARTICLES FROM THE LANGUAGE MODEL TRAINING CORPUS, D. Klakow, Philips Research Laboratories, Germany.
16. Paper ID: 1367 SYNTACTIC HEADS IN STATISTICAL LANGUAGE MODELING, J. Wu, S. Khudanpur, The Johns Hopkins University, USA.
17. Paper ID: 1980 A UNIFIED APPROACH TO STATISTICAL LANGUAGE MODELING FOR CHINESE, J. Gao, H.-F. Wang, M. Li, K.-F. Lee, Microsoft Corporation, People's Republic of China.
18. Paper ID: 3730 POLYPHONE DECISION TREE SPECIALIZATION FOR LANGUAGE ADAPTATION, T. Schultz, University of Karlsruhe, Germany, A. Waibel, Carnegie Mellon University, USA.
19. Paper ID: 1326 ENHANCED LANGUAGE MODELLING WITH PHONOLOGICALLY CONSTRAINED MORPHOLOGICAL ANALYSIS, A. Fang, M. Huckvale, University College, London, United Kingdom.
20. Paper ID: 660 LONG RANGE LANGUAGE MODELS FOR FREE SPELLING RECOGNITION, F. Thiele, B. Rueber, D. Klakow, Philips Research Laboratories, Germany.


Session:            SP-P9
Time: 09:00 - 12:00, Friday, June 9, 2000
Location: Ballroom - Hilton Hotel (P6)
Title: TOPICS IN ASR - PART 1: ROBUSTNESS; PART 2: ACOUSTIC MODELING
Chair: Jean-Claude Junqua, Panasonic Laboratory, USA
1. Paper ID: 339 MODEL-BASED FEATURE ENHANCEMENT FOR NOISY SPEECH RECOGNITION, C. Couvreur, H. Van hamme, Lernout & Hauspie, Belgium.
2. Paper ID: 526 ROBUST SPEECH RECOGNITION USING NEAR-FIELD SUPERDIRECTIVE BEAMFORMING WITH POST-FILTERING, I. McCowan, Queensland University of Technology, Australia, C. Marro, L. Mauuary, France Telecom, France.
3. Paper ID: 784 NOISY SPEECH RECOGNITION USING NOISE REDUCTION METHOD BASED ON KALMAN FILTER, M. Fujimoto, Y. Ariki, Ryukoku University, Japan.
4. Paper ID: 819 STATISTICAL ESTIMATION OF UNRELIABLE FEATURES FOR ROBUST SPEECH RECOGNITION, P. Renevey, A. Drygajlo, Swiss Federal Institute of Technology, Switzerland.
5. Paper ID: 346 HANDS-FREE VOICE ACTIVATION OF PERSONAL COMMUNICATION DEVICES, S. Bou-Ghazale, A. Asadi, Conexant Systems Inc., USA.
6. Paper ID: 594 AN ACOUSTIC MEASURE FOR PREDICTING RECOGNITION PERFORMANCE DEGRADATION, K. Takeda, M. Kondo, F. Itakura, Nagoya University, Japan.
7. Paper ID: 839 LOW COMPLEXITY SPEAKER INDEPENDENT COMMAND WORD RECOGNITION IN CAR ENVIRONMENTS, S. Riis, Nokia Mobile Phones, Inc., Denmark, O. Viikki, Nokia Research Center, Finland.
8. Paper ID: 625 SPEECH RECOGNITION BASED ON SPACE DIVERSITY USING DISTRIBUTED MULTI-MICROPHONE, Y. Shimizu, S. Kajita, K. Takeda, F. Itakura, Nagoya University, Japan.
9. Paper ID: 989 A NOVEL APPROACH TO ROBUST SPEECH ENDPOINT DETECTION IN CAR ENVIRONMENTS, L.-S. Huang, C.-H. Yang, Panasonic Taiwan Laboratories Co., Ltd., Taiwan.
10. Paper ID: 1292 DETECTING THE END OF SPELLINGS USING STATISTICS ON RECOGNIZED LETTER SEQUENCES FOR SPELLED NAMES RECOGNITION, S. Hanel, D. Jouvet, France Telecom, France.
11. Paper ID: 1610 DECISION TREE BASED MANDARIN TONE MODEL AND ITS APPLICATION TO SPEECH RECOGNITION, Y. Cao, Y. Deng, H. Zhang, T. Huang, B. Xu, Chinese Academy of Sciences, People's Republic of China.
12. Paper ID: 1557 DETECTION OF PROSODIC WORD BOUNDARIES BY STATISTICAL MODELING OF MORA TRANSITIONS OF FUNDAMENTAL FREQUENCY CONTOURS AND ITS USE FOR CONTINUOUS SPEECH RECOGNITION, K. Hirose, K. Iwano, The University of Tokyo, Japan.
13. Paper ID: 713 PARAMETER OPTIMIZATION FOR VOCAL TRACT LENGTH NORMALIZATION, P. Dognin, A. El-Jaroudi, University of Pittsburgh, USA, J. Billa, GTE Internetworking/BBN Technologies, USA.
14. Paper ID: 2985 RETRIEVAL OF BROADCAST NEWS SPEECH IN MANDARIN CHINESE COLLECTED IN TAIWAN USING SYLLABLE-LEVEL STATISTICAL CHARACTERISTICS, B. Chen, H.-M. Wang, L.-S. Lee, Institute of Information Science, Academia Sinica, Taiwan.
15. Paper ID: 934 WORD-LEVEL RATE OF SPEECH MODELING USING RATE-SPECIFIC PHONES AND PRONUNCIATIONS, J. Zheng, H. Franco, F. Weng, SRI International, USA, A. Sankar, Nuance Communications, USA, H. Bratt, SRI International, USA.
16. Paper ID: 3800 SPECIALIZED ACOUSTIC MODELS FOR HYPERARTICULATED SPEECH, H. Soltau, A. Waibel, University of Karlsruhe, Germany.
17. Paper ID: 3264 ON THE USE OF VARIABLE FRAME RATE ANALYSIS IN SPEECH RECOGNITION, Q. Zhu, A. Alwan, University of California, Los Angeles, USA.
18. Paper ID: 739 A PROBABILISTIC UNION MODEL FOR SUB-BAND BASED ROBUST SPEECH RECOGNITION, J. Ming, F. Smith, The Queen's University of Belfast, United Kingdom.
19. Paper ID: 3726 ROBUST SPEECH RECOGNITION OVER IP NETWORKS, B. Milner, S. Semnani, BT, United Kingdom.
20. Paper ID: 4096 FAST SPEAKER ADAPTATION OF ARTIFICIAL NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION, S. Dupont, L. Cheboub, Polytechnique de Mons, Belgium.


Session:            SP-P10
Time: 13:30 - 15:00, Friday, June 9, 2000
Location: Ballroom - Hilton Hotel (P5)
Title: CONFIDENCE MEASURES/REJECTION
Chair: Mark Clements, Geogia Institute of Technology, USA
1. Paper ID: 1894 WORD AND PHONE LEVEL ACOUSTIC CONFIDENCE SCORING, S. Kamppari, T. Hazen, Massachusetts Institute of Technology, USA.
2. Paper ID: 1651 CONTEXTUAL CONFIDENCE MEASURES FOR CONTINUOUS SPEECH RECOGNITION, G. Hernandez-Abrego, J. Marino, Universidad Politecnica de Cataluña, Spain.
3. Paper ID: 2324 CONFIDENCE MEASURE AND INCREMENTAL ADAPTATION FOR THE REJECTION OF INCORRECT DATA, N. Moreau, D. Charlet, D. Jouvet, France Telecom, France.
4. Paper ID: 1834 ROBUST OUT-OF-VOCABULARY REJECTION FOR LOW-COMPLEXITY SPEAKER INDEPENDENT SPEECH RECOGNITION, C. Broun, W. Campbell, Motorola, USA.
5. Paper ID: 3958 META-MODELS FOR CONFIDENCE ESTIMATION IN SPEECH RECOGNITION, S. Dasmahapatra, S. Cox, University of East Anglia, United Kingdom.
6. Paper ID: 3148 EVALUATION OF VARIOUS CONFIDENCE-BASED STRATEGIES FOR ISOLATED WORD REJECTION, E. Tsiporkova, F. Vanpoucke, H. Van hamme, Lernout & Hauspie, Belgium.
7. Paper ID: 1782 USE OF WORD LEVEL SIDE INFORMATION TO IMPROVE SPEECH RECOGNITION, D. Vergyri, The Johns Hopkins University, USA.
8. Paper ID: 2338 CONFIDENCE MEASURE BASED LANGUAGE IDENTIFICATION, F. Metze, T. Kemp, T. Schaaf, T. Schultz, H. Soltau, University of Karlsruhe, Germany.
9. Paper ID: 1966 A NEW KEYWORD SPOTTING APPROACH BASED ON ITERATIVE DYNAMIC PROGRAMMING, M. Silaghi, Swiss Federal Institute of Technology, Lausanne, Switzerland, H. Bourlard, Dalle Molle Institute of Perceptual Artificial Intelligence, Switzerland.
10. Paper ID: 1006 FRAME DISCRIMINATIVE AND CONFIDENCE-DRIVEN ADAPTATION FOR LVCSR, F. Wallhoff, D. Willett, G. Rigoll, Gerhard-Mercator-University Duisburg, Germany.


Session:            SP-P11
Time: 13:30 - 15:00, Friday, June 9, 2000
Location: Ballroom - Hilton Hotel (P6)
Title: SPEECH ENHANCEMENT II
Chair: Richard Cox, AT&T Laboratories, USA
1. Paper ID: 156 AN ADAPTIVE SUBSPACE APPROACH FOR SPEECH ENHANCEMENT, S. Gazor, A. Rezayee, Queen's University, Canada.
2. Paper ID: 674 NARROWBAND TO WIDEBAND CONVERSION OF SPEECH USING GMM BASED TRANSFORMATION, K.-Y. Park, H. Kim, Pusan National University, South Korea.
3. Paper ID: 705 SIGNAL/NOISE KLT BASED APPROACH FOR ENHANCING SPEECH DEGRADED BY COLORED NOISE, U. Mittal, N. Phamdo, State University of New York at Stony Brook, USA.
4. Paper ID: 1266 LOW-BAND EXTENSION OF TELEPHONE-BAND SPEECH, G. Miet, Philips Consumer Corporation, France, A. Gerrits, Philips Research Laboratories, The Netherlands, J.-C. Valière, CNRS, France.
5. Paper ID: 1833 A HYBRID SPEECH ENHANCEMENT SYSTEM BASED ON HMM AND SPECTRAL SUBTRACTION, M. Ghoreishi, H. Sheikhzadeh, AmirKabir University of Technology, Iran.
6. Paper ID: 2935 ENHANCEMENT OF SPEECH BASED ON NON-PARAMETRIC ESTIMATION OF A TIME VARYING HARMONIC REPRESENTATION, S. Dubost, O. Cappe, Ecole Nationale Superieure des Telecommunications, France.
7. Paper ID: 3026 INTEGRATED NOISE REDUCTION AND ECHO CANCELLATION FOR IS-136 SYSTEMS, F. Basbug, K. Swaminathan, S. Nandkumar, Hughes Network Systems, USA.
8. Paper ID: 268 AN IMPROVED CUMULANT-BASED BLIND SPEECH SEPARATION METHOD, Y. Su, L. He, R. Yang, Intel Corporation, People's Republic of China.
9. Paper ID: 3709 IMPULSIVE NOISE SUPPRESSION USING NEURAL NETWORKS, I. Potamitis, N. Fakotakis, G. Kokkinakis, University of Patras, Greece.
10. Paper ID: 3782 QUANTILE BASED NOISE ESTIMATION FOR SPECTRAL SUBTRACTION AND WIENER FILTERING, V. Stahl, A. Fischer, R. Bippus, Philips Research Laboratories, Germany.

Go to top of the page

Go to Schedule of Sessions

Questions on Technical Program:

Murat Tekalp
Electrical Engineering Department
University of Rochester
Rochester, NY 14627
(716) 275-3774 (Voice)
(716) 473-0486 (Fax)
tekalp@ee.rochester.edu

Bülent Sankur
Department of Electrical and Electronic Engineering
Bogazici University
TR-80815, Bebek
Istanbul, Turkey
+90 (212) 263-1500/1414 (Voice)
+90 (212) 287-246 (Fax)

sankur@boun.edu.tr

General Information About ICASSP2000:

  Conference Management Services
3109 Westchester Avenue
College Station, TX 77845-7919
+1 409-693-6000 (Voice)
+1 409-693-66

00 (Fax)
icassp2000@cmsworldwide.com http://www.cmsworldwide.com/

 


[ Return to Main Page ]

Last Update: Tuesday, April 25, 2000 8:56 PM