To: GC who wrote (429 ) 3/11/1999 3:32:00 PM From: GC Respond to of 767
yes another article Saturday, October 17, 1998 The message is: Choose the right words More FP Technology stories By GEOF WHEELWRIGHT For The Financial Post When most people think about speech recognition, recognition of continuous dictated speech comes to mind. This is the most complex and demanding of all speech-related computer technologies, and not even the application with the widest potential benefit. Speech-related computer technologies divide up into three kinds: text-to-speech, command-recognition and dictation-style speech-recognition. A look at each of these in detail reveals the full range of uses of this technology. Text-to-speech technology was pioneered as a way of helping visually disabled people hear a spoken version of computer-generated text. Now, however, it is proving to be invaluable when people want to access computer-generated text, but are unable to read it on a screen at that moment -- for example, someone driving a car who wants to be able to check e-mail. Microsoft and Clarion have developed a system based on Microsoft's Windows CE computer operating system that enables e-mail to arrive via a wireless data connection and be read out using text-to-speech technology. Command-recognition is really a subset of speech-recognition technology, but is far more reliable, requires much less computer-processing power and is much faster than dictation-style speech-recognition. This is because command-recognition enables the system to work with a much smaller set of anticipated responses than a dictation system. The most basic form of command-recognition is used in many automatic-call-response (ACR) and voice-mail systems. These rely on anticipating only the tones that can be generated from a touch-tone telephone system, with preprogrammed responses for each unique set of tones. Voice-based command-recognition provides for only a handful of appropriate voice responses and reacts differently to each. By limiting the number of responses, the command-recognition system can allow for a much wider variation in the tone, accent, pitch and intonation of the user. Voice-command systems require little or no training of the system. This contrasts sharply with dictation systems that often require you to spend an hour or more speaking into the computer, so that it becomes trained to the sound of your voice, and, as a result, can recognize words and phrases more accurately. On a computer, command recognition can be used to start applications quickly, open new on-screen windows, even execute "macros" or mini programs that perform a set of predetermined actions on the user's behalf. When used with a Web browser, common commands such as "up," "down," "back" and "forward" can be recognized quickly and executed. Dictation-style speech-recognition. Continuous speech-recognition has long been the holy grail of speech-recognition researchers, yet it faces a number of inherent technological and sociological barriers. The technological barriers include sound dictionaries large enough to encompass the range of words typically used by most people during dictation. The system must also have enough storage and processing power that it can store the voice patterns of users and match them against appropriate words and phrases. It must be able to handle tough linguistic challenges, such as handling homonyms. In a sentence, such as "Mr. Wright makes the right choice and writes it down," the system would have to distinguish between three different uses of the same sound pattern and select the correct word based entirely on context. This is not a trivial task. Plus, the sociological issues are great. In many offices, users would find themselves feeling uncomfortable talking directly to their computers as they write reports or memos. This potentially robs them of the privacy afforded by a screen turned from prying eyes. Lastly, the notion that computers can talk suggests a disquieting autonomy for these ubiquitous and increasingly powerful devices. Who can forget the evil machinations of the HAL9000 computer in 2001: A Space Odyssey? So far, computers can only think to themselves, "Sorry, Dave. I can't do that." CANOE home | We welcome your feedback. Copyright © 1999, Canoe Limited Partnership. All rights reserved. Please click here for full copyright terms and restrictions.