| Patent Number |
Title Of Patent |
Date Issued |
| 7475011 |
Greedy algorithm for identifying values for vocal tract resonance vectors |
January 6, 2009 |
| A method and apparatus identify values for components of a vocal tract resonance vector by sequentially determining values for each component of the vocal tract resonance vector. To determine a value for a component, the other components are set to static values. A plurality of values fo |
| 7460992 |
Method of pattern recognition using noise reduction uncertainty |
December 2, 2008 |
| A method and apparatus are provided for using the uncertainty of a noise-removal process during pattern recognition. In particular, noise is removed from a representation of a portion of a noisy signal to produce a representation of a cleaned signal. In the meantime, an uncertainty a |
| 7454338 |
Training wideband acoustic models in the cepstral domain using mixed-bandwidth training data and |
November 18, 2008 |
| A method and apparatus are provided that generate values for a first set of dimensions of a feature vector from a speech signal. The values of the first set of dimensions are used to estimate values for a second set of dimensions of the feature vector to form an extended feature vector. |
| 7451083 |
Removing noise from feature vectors |
November 11, 2008 |
| A method and computer-readable medium are provided for identifying clean signal feature vectors from noisy signal feature vectors. One aspect of the invention includes using an iterative approach to identify the clean signal feature vector. Another aspect of the invention includes using |
| 7447630 |
Method and apparatus for multi-sensory speech enhancement |
November 4, 2008 |
| A method and system use an alternative sensor signal received from a sensor other than an air conduction microphone to estimate a clean speech value. The estimation uses either the alternative sensor signal alone, or in conjunction with the air conduction microphone signal. The clean |
| 7424423 |
Method and apparatus for formant tracking using a residual model |
September 9, 2008 |
| A method of tracking formants defines a formant search space comprising sets of formants to be searched. Formants are identified for a first frame in the speech utterance by searching the entirety of the formant search space using the codebook, and for the remaining frames by searchi |
| 7418383 |
Noise robust speech recognition with a switching linear dynamic model |
August 26, 2008 |
| A unified, nonlinear, non-stationary, stochastic model is disclosed for estimating and removing effects of background noise on speech cepstra. Generally stated, the model is a union of dynamic system equations for speech and noise, and a model describing how speech and noise are mixed. |
| 7409346 |
Two-stage implementation for phonetic recognition using a bi-directional target-filtering model |
August 5, 2008 |
| A structured generative model of a speech coarticulation and reduction is described with a novel two-stage implementation. At the first stage, the dynamics of formants or vocal tract resonance (VTR) are generated using prior information of resonance targets in the phone sequence. Bi- |
| 7406416 |
Representation of a deleted interpolation N-gram language model in ARPA standard format |
July 29, 2008 |
| A method and apparatus are provided for storing parameters of a deleted interpolation language model as parameters of a backoff language model. In particular, the parameters of the deleted interpolation language model are stored in the standard ARPA format. Under one embodiment, the dele |
| 7403894 |
Annotating programs for automatic summary generations |
July 22, 2008 |
| Audio/video programming content is made available to a receiver from a content provider, and meta data is made available to the receiver from a meta data provider. The meta data corresponds to the programming content, and identifies, for each of multiple portions of the programming conte |
| 7383181 |
Multi-sensory speech detection system |
June 3, 2008 |
| The present invention combines a conventional audio microphone with an additional speech sensor that provides a speech sensor signal based on an input. The speech sensor signal is generated based on an action undertaken by a speaker during speech, such as facial movement, bone vibrat |
| 7379867 |
Discriminative training of language models for text and speech classification |
May 27, 2008 |
| Methods are disclosed for estimating language models such that the conditional likelihood of a class given a word string, which is very well correlated with classification accuracy, is maximized. The methods comprise tuning statistical language model parameters jointly for all classe |
| 7363224 |
Method for entering text |
April 22, 2008 |
| In a method of entering text into a device a first character input is provided that is indicative of a first character of a text entry. Next, a vocalization of the text entry is captured. A probable word candidate is then identified for a first word of the vocalization based upon the fir |
| 7363221 |
Method of noise reduction using instantaneous signal-to-noise ratio as the principal quantity fo |
April 22, 2008 |
| A system and method are provided that accurately estimate noise and that reduce noise in pattern recognition signals. The method and system define a mapping random variable as a function of at least a clean signal random variable and a noise random variable. A model parameter that descri |
| 7346504 |
Multi-sensory speech enhancement using a clean speech prior |
March 18, 2008 |
| A method and apparatus determine a channel response for an alternative sensor using an alternative sensor signal, an air conduction microphone signal. The channel response and a prior probability distribution for clean speech values are then used to estimate a clean speech value. |
| 7328147 |
Automatic resolution of segmentation ambiguities in grammar authoring |
February 5, 2008 |
| A rules-based grammar is generated. Segmentation ambiguities are identified in training data. Rewrite rules for the ambiguous segmentations are enumerated and probabilities are generated for each. Ambiguities are resolved based on the probabilities. In one embodiment, this is done by |
| 7310599 |
Removing noise from feature vectors |
December 18, 2007 |
| A method and computer-readable medium are provided for identifying clean signal feature vectors from noisy signal feature vectors. Aspects of the invention use mixtures of distributions of noise feature vectors and/or channel distortion feature vectors when identifying the clean signal |
| 7289956 |
System and method for user modeling to enhance named entity recognition |
October 30, 2007 |
| The present invention employs user modeling to model a user's behavior patterns. The user's behavior patterns are then used to influence named entity (NE) recognition. |
| 7289955 |
Method of determining uncertainty associated with acoustic distortion-based noise reduction |
October 30, 2007 |
| A method and apparatus are provided for determining uncertainty in noise reduction based on a parametric model of speech distortion. The method is first used to reduce noise in a noisy signal. In particular, noise is reduced from a representation of a portion of a noisy signal to produce |
| 7266494 |
Method and apparatus for identifying noise environments from noisy signals |
September 4, 2007 |
| A method and apparatus are provided for identifying a noise environment for a frame of an input signal based on at least one feature for that frame. To identify the noise environment, a probability for a noise environment is determined by applying the noisy input feature vector to a |
| 7254536 |
Method of noise reduction using correction and scaling vectors with partitioning of the acoustic |
August 7, 2007 |
| A method and apparatus are provided for reducing noise in a training signal and/or test signal. The noise reduction technique uses a stereo signal formed of two channel signals, each channel containing the same pattern signal. One of the channel signals is "clean" and the other inclu |
| 7206741 |
Method of speech recognition using time-dependent interpolation and hidden dynamic value classes |
April 17, 2007 |
| A speech signal is decoded by determining a production-related value for a current state based on an optimal production-related value at the end of a preceding state, the optimal production-related value being selected from a set of continuous values. The production-related value is used |
| 7200557 |
Method of reducing index sizes used to represent spectral content vectors |
April 3, 2007 |
| A method identifies a codeword to represent a vector derived from an audio signal by applying the vector to first and second decision trees. The first decision tree is associated with a first type of audio sound and produces a first codeword. The second decision tree is associated with a |
| 7181390 |
Noise reduction using correction vectors based on dynamic aspects of speech and noise normalizat |
February 20, 2007 |
| A method and apparatus are provided for reducing noise in a signal. Under one aspect of the invention, a correction vector is selected based on a noisy feature vector that represents a noisy signal. The selected correction vector incorporates dynamic aspects of pattern signals. The s |
| 7174292 |
Method of determining uncertainty associated with acoustic distortion-based noise reduction |
February 6, 2007 |
| A method and apparatus are provided for determining uncertainty in noise reduction based on a parametric model of speech distortion. The method is first used to reduce noise in a noisy signal. In particular, noise is reduced from a representation of a portion of a noisy signal to produce |
| 7165026 |
Method of noise estimation using incremental bayes learning |
January 16, 2007 |
| A method and apparatus estimate additive noise in a noisy signal using incremental Bayes learning, where a time-varying noise prior distribution is assumed and hyperparameters (mean and variance) are updated recursively using an approximation for posterior computed at the preceding t |
| 7139703 |
Method of iterative noise estimation in a recursive framework |
November 21, 2006 |
| A method and apparatus estimate additive noise in a noisy signal using an iterative technique within a recursive framework. In particular, the noisy signal is divided into frames and the noise in each frame is determined based on the noise in another frame and the noise determined in |
| 7117153 |
Method and apparatus for predicting word error rates from text |
October 3, 2006 |
| A method of modeling a speech recognition system includes decoding a speech signal produced from a training text to produce a sequence of predicted speech units. The training text comprises a sequence of actual speech units that is used with the sequence of predicted speech units to |
| 7117148 |
Method of noise reduction using correction vectors based on dynamic aspects of speech and noise |
October 3, 2006 |
| A method and apparatus are provided for reducing noise in a signal. Under one aspect of the invention, a correction vector is selected based on a noisy feature vector that represents a noisy signal. The selected correction vector incorporates dynamic aspects of pattern signals. The s |
| 7107210 |
Method of noise reduction based on dynamic aspects of speech |
September 12, 2006 |
| A system and method are provided that reduce noise in pattern recognition signals. To do this, embodiments of the present invention utilize a prior model of dynamic aspects of clean speech together with one or both of a prior model of static aspects of clean speech, and an acoustic model |
| 7103544 |
Method and apparatus for predicting word error rates from text |
September 5, 2006 |
| A method of modeling a speech recognition system includes decoding a speech signal produced from a training text to produce a sequence of predicted speech units. The training text comprises a sequence of actual speech units that is used with the sequence of predicted speech units to |
| 7103540 |
Method of pattern recognition using noise reduction uncertainty |
September 5, 2006 |
| A method and apparatus are provided for using the uncertainty of a noise-removal process during pattern recognition. In particular, noise is removed from a representation of a portion of a noisy signal to produce a representation of a cleaned signal. In the meantime, an uncertainty a |
| 7080004 |
Grammar authoring system |
July 18, 2006 |
| A grammar authoring system uses multiple sources of information to aid grammar authoring. This produces a semantic grammar derived semi-automatically with a relatively small amount of data. |
| 7050975 |
Method of speech recognition using time-dependent interpolation and hidden dynamic value classes |
May 23, 2006 |
| A method of speech recognition is provided that identifies a production-related dynamics value by performing a linear interpolation between a production-related dynamics value at a previous time and a production-related target using a time-dependent interpolation weight. The hidden p |
| 7047189 |
Sound source separation using convolutional mixing and a priori sound source knowledge |
May 16, 2006 |
| Sound source separation, without permutation, using convolutional mixing independent component analysis based on a priori knowledge of the target sound source is disclosed. The target sound source can be a human speaker. The reconstruction filters used in the sound source separation |
| 7047047 |
Non-linear observation model for removing noise from corrupted signals |
May 16, 2006 |
| A new statistical model describes the corruption of spectral features caused by additive noise. In particular, the model explicitly represents the effect of unknown phase together with the unobserved clean signal and noise. Development of the model has realized three techniques for r |
| 7028325 |
Annotating programs for automatic summary generation |
April 11, 2006 |
| Audio/video programming content is made available to a receiver from a content provider, and meta data is made available to the receiver from a meta data provider. The meta data corresponds to the programming content, and identifies, for each of multiple portions of the programming conte |
| 7003455 |
Method of noise reduction using correction and scaling vectors with partitioning of the acoustic |
February 21, 2006 |
| A method and apparatus are provided for reducing noise in a training signal and/or test signal. The noise reduction technique uses a stereo signal formed of two channel signals, each channel containing the same pattern signal. One of the channel signals is "clean" and the other inclu |
| 6990447 |
Method and apparatus for denoising and deverberation using variational inference and strong spee |
January 24, 2006 |
| A probability distribution for speech model parameters, such as auto-regression parameters, is used to identify a distribution of denoised values from a noisy signal. Under one embodiment, the probability distributions of the speech model parameters and the denoised values are adjust |
| 6985858 |
Method and apparatus for removing noise from feature vectors |
January 10, 2006 |
| A method and computer-readable medium are provided for identifying clean signal feature vectors from noisy signal feature vectors. The method is based on variational inference techniques. One aspect of the invention includes using an iterative approach to identify the clean signal featur |
| 6959276 |
Including the category of environmental noise when processing speech signals |
October 25, 2005 |
| A method and apparatus are provided for identifying a noise environment for a frame of an input signal based on at least one feature for that frame. Under one embodiment, the noise environment is identified by determining the probability of each of a set of possible noise environments. F |
| 6944590 |
Method of iterative noise estimation in a recursive framework |
September 13, 2005 |
| A method and apparatus estimate additive noise in a noisy signal using an iterative technique within a recursive framework. In particular, the noisy signal is divided into frames and the noise in each frame is determined based on the noise in another frame and the noise determined in a p |
| 6879952 |
Sound source separation using convolutional mixing and a priori sound source knowledge |
April 12, 2005 |
| Sound source separation, without permutation, using convolutional mixing independent component analysis based on a priori knowledge of the target sound source is disclosed. The target sound source can be a human speaker. The reconstruction filters used in the sound source separation take |