Far-field Microphone Processing: Voice Signal Solutions
FAR FIELD VOICE PROCESSING
Speech recognition performance degrades drastically under noisy and reverberate environments. As in any home, office, or even outdoor application, sound is all around us. The greater distance a speaker is from a microphone, the greater the level of distortion with the addition of the ambient noise streams. Background noises, such as a running dishwasher, television set, children playing, dogs barking, need to be removed from the sound stream so that the keyword can be distinguished from other speech signals by the application.
HD AEC Clear Speech Solution
Adaptive Digital uses certain algorithms that recognize the dominant voice and suppress background chatter noise.
Far-field Voice Input Processing software first detects far-field speech, then reduces the clutter in the voice application can send a clear voice signal, or distinguish a wake-word from other noise sources.
For certain environments, a microphone array may be employed for voice capture. In a microphone array, a number of microphones can be arranged in either a circular, or linear pattern and used to pick up speech signals via phase steering. Essentially, the microphones, while not physically pointing in any specific direction will point acoustically in one or many directions. When a voice command emanates from a particular direction, the clutter noise on the periphery of that direction is either reduced or not picked up by the microphone array.
The number of microphones and the distance between them in the array will affect the accuracy, frequency and direction of the directional beam.
Adaptive Digtal offers TMS320C5517 and TMS320C6748 Clear Speech Solution with Acoustic Beamforming for high processor performance applications.
The process of cleaning up the sound stream is done through the implementation of noise reduction/suppression of any noise that is not voice.
The difference in location to the microphone will affect the intensity of the voice signal, and as with any human element such as speech, there are many differentiations of intensity, deep or high pitched, soft or loud in volume. A gain level adjusting algorithm is applied to the voice signal to adjust the signal to a consistent level no matter the intensity level of the original voice stream.
The clean and enhanced speech signal can then be recognized by the application, allowing speech detection/recognition to take place.
The future of voice recognition technologies will lie in the detection of inflection and emotion. Adaptive Digital’s clear speech algorithms will aid in the advancement of these technologies.
Other applications for the Adaptive Digital HD AEC clear voice solution include multiple microphone video conferencing systems, soft phones, bluetooth speakers, IP cameras, automobile cabins, USB headsets, voice Command and control applications (voice enabled Smart Speaker, voice enabled Smart Gateway) and security to name a few.