SUMMARY OF SPEECH SIGNAL PROCESSING
PROGRAM
ELECTRICAL & COMPUTER ENGINEERING
DEPARTMENT
CLEMSON UNIVERSITY
DR. JOHN N. GOWDY
INDEX
- PUBLICATIONS ON SPEECH PROCESSING
- PUBLICATIONS ON OTHER DIGITAL SIGNAL PROCESSING
TOPICS
- SPEECH PROCESSING LAB
- FORMER GRADUATES AND AREAS OF GRADUATE RESEARCH
- SENIOR HONORS THESIS ADVISEES
PUBLICATIONS ON SPEECH PROCESSING
- Raghunandan Kumaran, Karthik Narayanan, John Gowdy, "Myoelectric Signals for Multimodal Speech
Recognition",
INTERPEECH (Eurospeech) 2005, Lisboa, Portugal, September 2005 .
- Raghunandan Kumaran, Karthik Narayanan, John Gowdy, " Language Modeling Using Independent Component
Analysis For Automatic Speech Recognition", Eurpoean Signal
Processing Conference, EUSIPCO 2005, Antalya, Turkey.
- S. Amarnag, Gowdy J.N., C. Bartels and J. Bilmes, "DBN based Multi-Stream Models for
Audio-Visual Speech Recognition", To Appear in Proceeding of IEEE
International Conference on Acoustics, Speech and Signal Processing,
Montreal, 2004.
- S. Amarnag, R. Kumaran., and Gowdy, J. N., "Real Time Eye Traking for Human Computer
Interfaces", Proceedings of the IEEE International Conferance on
Multimedia and Expo, Baltimore, July 2003.
- Patterson, E. K., Gurbuz, S., Tufekci, Z., and Gowdy, J. N., "Moving-Talker, Speaker-Independent
Feature Study and Baseline Results Using the CUAVE Multimodal
Speech Corpus," accepted for publication by the EURASIP Journal on
Applied Signal Processing. To Appear in 2003.
- E.K. Patterson, S. Gurbuz, Z. Tufekci, and J.N. Gowdy, "CUAVE: A New Audio-Visual Database for
Multimodal Human-Computer Interface Research," Proceedings of the
IEEE International Conference on Acoustics, Speech, and Signal
Processing, Orlando, May 2002.
- S. Gurbuz, E. K. Patterson, Z. Tufekci and Gowdy, J.N., "Multi-Stream Product Modal
Audio-Visual Integration Strategy for Robust Adaptive Speech
Recognition", IEEE International Conference on Acoustics, Speech and
Signal Processing, Orlando, May 2002.
- Gurbuz, S., Tufekci, Z., Patterson, E. K., and Gowdy, J. N.,
"Application of Affine-Invariant Fourier Descriptors to Lipreading for
Audio-Visual Speech Recognition," Proceedings of the International
Conference on Acoustics, Speech, and Signal Processing, Salt Lake City,
May 2001.
- S. Gurbuz, Patterson, E. K., Tufekci, Z., and Gowdy, J. N.,
"Affine-Invariant Visual Features Contain Supplementary Information to
Enhance Speech Recognition," Proceedings of the International
Conference on Audio-and-Video-Based Biometric Person Authentication,
Sweden, 2001.
- Patterson, E. K., Gurbuz, S., Tufekci, Z., and Gowdy, J. N.,
"Noise-Based Audio-Visual Fusion for Robust Speech Recognition,"
Proceedings of the International Conference on Auditory-Visual Speech
Processing, Scheelsminde, Denmark, September 2001.
- Tufekci, Z., Gowdy, J. N., Gurbuz, S., and Patterson, E. K.,
"Applying Parallel Model Combination with Mel-Scaled Discrete Wavelet
Coefficients for Noise-Robust Speech Recognition," Proceedings of
Eurospeech 2001, Aalborg, Denmark, September 2001.
- Gurbuz, S., Patterson, E. K., Tufekci, Z., and Gowdy, J. N.,
"Lip-Reading from Parametric Lip Contours for Audio-Visual Speech
Recognition," Proceedings of Eurospeech 2001, Aalborg, Denmark,
September 2001.
- Gowdy, J. N., and Tufekci, Z., "Mel-Scaled Discrete Wavelet
Coefficients for Speech Recognition," Proceedings of the 2000 IEEE
International Conference on Acoustics, Speech, and Signal Processing,
Istanbul, Turkey, (June 2000).
- Patterson, E. K., Gowdy, J. N., and Wu, D.,"Multi-Platform CBI
Tools Using Linux and Java-Based Solutions for Distance Learning,"
Proceedings of the International Conference on Acoustics, Speech, and
Signal Processing, Seattle, May 1998.
- Scordilis, M.S. and Gowdy, J. N., "A Neural Network-based Control
Strategy or a Speech Formant Synthesizer," Journal of Artificial Neural
Networks, to appear in early 1995.
- Wu, Duanpei, and Gowdy, J. N., "Tunable Time Delay Neural
Networks for Isolated Word Recognition," Proceedings of the 1994
International Symposium on Speech, Image Processing, and Neural
Networks, Hong Kong, April, 1994
- Bryant, Benjamin D. and Gowdy, J. N., "A Comparison of Feature
Representations for Speaker-Independent Voiced-Stop-Consonant
Recognition," Proceedings of the 1993 IEEE Southeastcon, Charlotte, NC,
April 1993.
- Wu, Duanpei and Gowdy, J. N., "Time-Frequency-Energy
Representation Based Real-Time Based Speech Recognition," Proceedings
of the 1993 IEEE Southeastcon, Charlotte, NC, April 1993.
- Berger, G. L. and Gowdy, J. N., "TDNN Based Speaker
Identification," Proceedings of the Twenty-Fifth Southeastern Symposium
on System Theory, Tuscaloosa, Alabama, March 1993.
- Bryant, B. D. and Gowdy, J. N., "Speaker-Independent
Voiced-Stop-Consonent Recognition Using a Block-Windowed Neural Network
Architecture," Proceedings of the Twenty-Fifth Southeastern Symposium
on System Theory, Tuscaloosa, Alabama, March 1993.
- Bryant, Benjamin D. and Gowdy, J. N., "Simulation of Stages I and
II of Seneff's Auditory Model (SAM) using MATLAB," Proceedings of the
National Matlab Conference, Boston, MA, October 1993.
- Easwaran, S. and Gowdy, J. N., " An Improved Algorithm for Use
with the K-Means Algorithm for Codebook Generation," Proceedings of the
1992 IEEE Southeastcon, Birmingham, AL, April 1992.
- Neelakantan, V., and Gowdy, J. N., "A Comparative Study of Using
Different Speech Parameters in the Design of a Discrete Hidden Markov
Model, Proceedings of the 1992 IEEE Southeastcon, Birmingham, AL, April
1992.
- Weber, A. C. and Gowdy, J. N. "Design and Simulation of a Speaker
Recognition System, Proceedings of the 1991 Southeastern Symposium on
System Theory, Columbia, SC, April 1991.
- Scordilis, M. J., and Gowdy, J. N., "Speech Synthesis of Phonemic
Triplets Through a Neural Network-Controlled Formant Synthesizer,"
Proceedings of the International Joint Conference on Neural Networks
1991, Seattle, WA, July 1991.
- Scordilis, M. J., and Gowdy, J. N. "Neural Network Control for a
Cascade/Parallel Formant Synthesizer", Proceedings of the 1990 IEEE
International Conference on Acoustics, Speech, and Signal Processing,
Albuquerque, NM, April 1990.
- Neelakantan, N. and Gowdy, J. N., "A Study of the HMM for
Speaker- Independent Isolated Word Recognition," Proceedings of the
1990 Southeastcon, New Orleans, LA, March 1990.
- Kepuska, V. Z. and Gowdy, J. N., "On the Effect of Topological
Structure of the Kohonen Network on the Performance of a Hierarchical
Two Layered Isolated Word Recognition System," Proceedings of the 1990
Southeastcon, New Orleans, LA, March 1990.
- Scordilis, M. S. and Gowdy, J. N., "Effects of the Vocal Tract
Shape on the Spectral Tilt of the Glottal Pulse Waveform," Proceedings
of the 1990 Southeastcon, New Orleans, LA, March 1990.
- Kepuska, V.Z. and Gowdy, J. N., "Investigation of Phonemic
Context in Speech Using Self-Organizing Maps," Proceedings of the 1989
IEEE International Conference on Acoustics, Speech, and Signal
Processing, Glasgow, Scotland, May 1989.
- Scordilis, M. J., and Gowdy, J. N. "Neural Network Generation of
Fundamental Frequency Contours", Proceedings of the 1989 IEEE
International Conference on Acoustics, Speech, and Signal Processing,
Glasgow, Scotland, May 1989.
- Scordilis, M. S. and Gowdy, J. N., "Text Processing for Speech
Synthesis Using Parallel Distributed Models," Proceedings of the 1989
Southeastcon, Columbia, SC, April 1989.
- Kepuska, V. Z. and Gowdy, J. N., "Phonemic Speech Recognition
System Based on a Neural Network," Proceedings of the 1989
Southeastcon, Columbia, SC, April 1989.
- Ward, R. M. and Gowdy, J. N., "An Investigation of Speaker
Verification Accuracy Using Fundamental Frequency and Duration as
Distinguishing Features," Proceedings of the 21st Southeastern
Symposium on System Theory, Tallahassee, Florida, March 1989.
- Sethuraman, R. and Gowdy, J. N., "A Cepstral Based Speaker
Recognition System," Proceedings of the 21st Southeastern Symposium on
System Theory, Tallahassee, Florida, March 1989.
- Kepuska, V. Z., Easwaran, S., and Gowdy, J. N., "Evaluation of
Digital Signal Processing Chips for Speech Processing Applications,"
Proceedings of the l987 Southeastern Symposium on System Theory,
Clemson, SC, March 1987.
- Gowdy, J. N., "Voice I/O for the Personal Computer," Final
Report, NCR Contract, June l986.
- Scordilis, M. S., and Gowdy, J. N., "Comparison of Computerized
Speech Synthesis Techniques," Proceedings of the 1986 IEEE
Southeastcon, Richmond, VA, March 1986.
- Vogel, K. R. and J. N. Gowdy, "Microprocessor Implementation of a
Near Real-Time Speech Recognition System," Proceedings of the l984 IEEE
Southeastcon, Louisville, KY, April l984.
- Rochester, L. R., Gowdy, J. N., and Bryan, J. K., "Comparison of
Speech Recognition Systems Based on FFT's and Zero-Crossings,"
Proceedings of the l979 Southeastcon, Roanoke, VA, April l979.
- Gupta, V. N., Bryan, J. K., and Gowdy, J. N.,
"Speaker-Independent Vowel Identification in Continuous Speech,"
Proceedings of the l978 IEEE International Conference on Acoustics,
Speech, and Signal Processing, Tulsa, OK, April l978.
- Gupta, V. N., Bryan, J. K., and Gowdy, J. N., "A
Speaker-Independent Speech-Recognition System Based on Linear
Prediction," IEEE Transactions on Acoustics, Speech, and Signal
Processing, vol. ASSP-26, no. 1, February l978.
- Gupta, V. N., Gowdy, J. N., and Bryan, J. K., "Evaluation of Some
Distance Measures for Speaker-Independent Isolated Word Recognition,"
Proceedings of the l977 IEEE International Conference on Acoustics,
Speech, and Signal Processing, Hartford, CT, May l977.
- Gupta, V. N., Bryan, J. K., and Gowdy, J. N., "Application of a
Combined Nearest-Neighbor and K-Nearest-Neighbor Rule in a Speech
Recognition System," Proceedings of the l977 Southeastcon,
Williamsburg, VA, April l977.
- Thaker, G. H., and Gowdy, J. N., "Comparison of Fast Fourier and
Walsh Transform Methods in Speech Recognition Systems," Proceedings of
the l977 Southeastcon, Williamsburg, VA, April l977.
- Gupta, V. N., Gowdy, J. N., and Bryan, J. K., "Evaluation of Some
Distance Measures for Computerized Speech Recognition," Proceedings of
the l977 Southeastcon, Williamsburg, VA, April l977.
- Gowdy, J. N., and Hinson, J. R., "A Computerized Reading
Evaluation System," Proceedings of the l976 Southeastcon, Clemson, SC,
April l976.
PUBLICATIONS ON OTHER DIGITAL SIGNAL PROCESSING TOPICS
- Karthik Narayanan, Raghunandan Kumaran, John Gowdy, " Stereo Based Elliptical Head Tracking ",
Eurpoean Signal Processing Conference, EUSIPCO 2005, Antalya, Turkey.
- Little, S. H., and Gowdy, J. N., "Evaluation of Computerized
Digital Filter Design Techniques," Proceedings of the 1986 IEEE
Southeastcon, Richmond, VA, March 1986.
- Brown, B. D., Gowdy, J. N., and Larson, V. D., "A System for the
Acquisition and Analysis of EEG Signals Evoked by Audio Stimuli,"
Proceedings of the l985 IEEE Southeastcon, Raleigh, NC, April l985.
- McCourry, T. L. and J. N. Gowdy, "Comparison of Several Digital
Filter Design Methods," Proceedings of the l984 IEEE Southeastcon,
Louisville, KY, April l984.
- Queen, G. W., Bryan, J. K., and Gowdy, J. N., "Improved Technique
for Image Data Compression, Proceedings of the l983 Southeastern
Sympos- ium on System Theory, Huntsville, AL, March l983.
- Queen, G. W., Gowdy, J. N., and Bryan, J. K., "Efficient
Implementation of the Karhunen-Loeve Transform for Processing Image
Data, Using the Power Method for the Eigenvalues," Proceedings of the
1982 IEEE Southeastcon, Destin, FL, April l982.
- Queen, G. W., Bryan, J. K., and Gowdy, J. N., "Investigation of
Several Discrete Transform Methods for Image Processing Applications,
Proceedings of the 1981 IEEE Southeastcon, Huntsville, AL, April l981.
- Bell, D. M., and Gowdy, J. N., "Power Spectral Estimation via
Nonlinear Frequency Warping," IEEE Transactions on Acoustics, Speech,
and Signal Processing, vol. ASSP-26, no. 5, October l978.
- Gaffney, B. P., and Gowdy, J. N., "An Algorithm to Evaluate the
L=83 Norm for Some Common Filters," IEEE Transactions on Acoustics,
Speech, and Signal Processing, vol. ASSP-25, no. 2, April l977.
- Gaffney, B. P., and Gowdy, J. N., "Design of Implementation
Configurations for Low Pass Chebyshev and Butterworth Digital Filters,"
Proceedings of the l975 Southeastcon, Charlotte, NC, April 1975.
- Brubaker, T. A., and Gowdy, J. N., "Computation of the Discrete
Auto- covariance, "International Journal of Electronics, vol. 37, no.
4, Oct. l974.
- Gowdy, J. N., and Hadstate, J. E., "Design of Optimum
Configurations of Digital Filters," Proceedings of the l973
Southeastcon, Louisville, KY, May l973.
- Brubaker, T. A., and Gowdy, J. N., "Limit Cycles in Digital
Filters," IEEE Transactions on Automatic Control, vol. AC-17, no. 5,
October l972.
SPEECH PROCESSING LAB IN E&CE DEPARTMENT
- KAY SONAGRAPH SYSTEM.
- LARYNGOGRAPH .
- SUN WORK STATIONS.
- SOUNDPROOF ISOLATION CHAMBER .
- SUN SPARC SYSTEMS
-
FORMER GRADUATES AND AREAS OF GRADUATE RESEARCH
Speech Recognition
Ph.D.
- Kepuska, V. Z.. "Neural Networks for Speech Recognition
Applications," 1990.
- Easwaran, S., "Investigations on Discrete-Symbol HIdden-Markov
Model Based Isolated-Word Speech Recognition," 1992.
- Neelakantan, V., "Hidden Markov Model-Based Speech
Recognition," December 1993
- Tufekci Z., " Mel-Scaled Discrete Wavelet Coefficients for
Noise-Robust Speech Recognition", December 2001
- Patterson Eric, "Audio Visual Speech Recognition", May
2002.
- Gurbuz S., "Robust and Efficient Parameters for Audio Visual
Speech Recognition", August 2002.
M.S.
- Vogel, K. R., "Design and Implementation of a Speech
Recognition System," May l984, MS.
- Christ, J. F., "Speech Recognition Using an Analog
Microprocessor," December 1981, MS.
- Li, D. T.-P., "Speech Pattern Recognition," May 1981, MS.
- Thaker, G. H., "Comparison of Fourier and Walsh Transforms in
Speech Recognition," August l977, MS.
- Hinson, J. R., "Implementation of a Computerized Reading
Evaluation System," August l974, MS.
Speaker Recognition/Verification
M.S.
- Berger, Gary L., "Neural Network-Based Speaker Identification,"
December 1993, MS.
- Weber, Alene, (MS), "A Real-Time, Text-Independent Speaker
Recognition System", 1990.
- Sethuraman, Radhakrishnan, "A Cepstral Based Speaker
Recognition System," May 1989, MS.
- Ward, Robert, "An Investigation of Speaker Verification
Accuracy Using Fundamental Frequency and Duration as Distinguishing
Features," (Engineering Report), December 1988, MS.
Speech Synthesis
Ph.D.
- Scordilis, M., "A Neural Network Controlled Formant Synthesizer
with Phoneme-Dependent Voicing, 1990.
M.S.
- Scordilis, M, "An Investigation into Digital Speech Synthesis,"
May l986, MS.
Other Digital Signal Processing Topics
Ph.D.
- Bell, D. M., "Digital Signal Processing for Textile
Irregularity Analysis", 1978.
- Gaffney, B. P. M., "An Analysis of Roundoff Noise in Fixed
Point Cascade Digital Filters Designed Using the Bilinear
Transformation," 1976.
M.S.
- Hosangadi, Guradutt, "Digital Signal Processing for Textile
Quality Measurement," May 1994, MS.
- Macemon, R. W., "Automatic Testing and Upgrading of Digital to
Analog Converters," (Engineering Report), August l985, MENGR.
- Brown, B. D., "A System for the Acquisition and Analysis of the
Frequency Following Response in Audiology Research," August 1984,
MS.
- McCourry, T. L., "A Comparison of Digital Filter Design
Techniques," December l983, MS.
- Lu, W.-P., "An 8086 Multiprocessor System for Digital Filtering
Application," May l981, MS.
- Lo, R. S. B., "Comparison of Several Computer-Aided Design
Techniques for Low-Pass Digital Filters," August 1977, MS.
- Tsai, S., "Analysis of Several Methods for Evaluating the
Quantization Error Propagation Functions for Digital Filters,"
August l977, MS.
- Wood, R. J., "Textile Irregularity Analysis Using the Variance
Length Curve with Real Time Digital Signal Processing," May l977,
MS.
- Bell, D. M., "The Determination of the Variance-Length Curve
for Textile Yarn Using a Real-Time Computer, May 1974, MS.
SENIOR HONORS THESIS ADVISEES
- Theobald, Brian, "Music and Speech Processing using the
Motorola DSP56000 Digital Signal Processing Chip," May 1993.
- Bowyer, Stephen, "Speech Recognition Using the Siemens Speech
Recognition Board," May 1994.
- Anderson, David L., "Development of a Digital Signal Processing
Laboratory with Emphasis on Speech Signal Processing," May 1994.
John N. Gowdy --john.gowdy@ces.clemson.edu
803 656-5249 telephone -- 803 656-5910 fax