Silicon Investor (SI) -- The First Internet Community

STOCKTALK

We've detected that you're using an ad content blocking browser plug-in or feature. Ads provide a critical source of revenue to the continued operation of Silicon Investor. We ask that you disable ad blocking while on Silicon Investor in the best interests of our community. If you are not using an ad blocker but are still receiving this message, make sure your browser's tracking protection is set to the 'standard' level.

Technology Stocks : LHSP: Lernout En Hauspie -- Ignore unavailable to you. Want to Upgrade?

To: Dave Doriguzzi who wrote (1290)	7/8/1998 7:51:00 AM
From: Dr. Bob	Read Replies (1) \| Respond to of 2467

Dave, To amplify your remarks - Having MSFT license your product is not always the kiss of death - Java is an example of that, where MSFT tried very hard to come up with an alternative, but the market continues to choose Java, not Active-X. IF MSFT can come up with a "generic equivalent" it no doubt will, but as you and I have pointed out, LHSPF's lead, especially in the translation aspect, is a very high hurdle for anyone to overcome. As a part-owner of LHSPF, MSFT has at least some reason to partner with rather than bury LHSPF, and certainly recognizes that the market is well aware of this cannabalistic tendency, choosing ANY one other than MSFT when it can - thus making LHSPF more attractive as an arms-length partner than as a division of MSFT. Bob

To: Dave Doriguzzi who wrote (1290)	7/9/1998 1:47:00 AM
From: Goolie2	Respond to of 2467

Dave: First of all, thanks for the great post. I think it is a good policy not to get complacent on an investment. I too have a large core position in L&H and also will trade additional shares at times. In fact I had just sold my trading shares at $61 a few days ago and was ready to buy below $57 but the money got spent on other positions. I agree that L&H has been positioning itself to dominate certain segments of Speech tech and translation, and it is clearly more than a one trick pony. I also agree that I think that Speech tech will drive the next wave of PC demmand, but I think it is still a year or two away, and I think the mania in the internet stocks and bandwidth constraints have to be dealt with first. When we get to the other side of bandwidth constraints, I think you get massive acceleration in the development and rollout of speech tech worldwide, and then you will see L&H really do some tricks. I'm just trying to stay confident and long until then. Regards, Lou

To: Dave Doriguzzi who wrote (1290)	7/12/1998 7:37:00 AM
From: Dave Doriguzzi	Read Replies (1) \| Respond to of 2467

A few articles of interest: STMicro chips will include biometrics Saint Genis, France - STMicroelectronics is to use biometric verification technology from Keyware Technologies NV (Zaventum, Belgium) in forthcoming video and audio DSP chips intended for use in automotive, computer and telephony applications. Keyware combines several biometric techniques, which use a variety of a person's physical characteristics, in a control system called Layered Biometric Verification (LBV) security server. The audio DSP, a voice signal processing chip, will also include voice-recognition software based on technology developed by Lernout & Hauspie Speech Products NV (Ieper, Belgium), licensed to STM in 1997 and used for in-car applications. STM expects to offer evaluation samples of the first circuits with LBV technology in the fourth quarter of 1998, with volume production due to start in the second quarter of 1999. Copyright r 1998 CMP Media Inc. Business Is Talking To Its PCs Paula Rooney Waltham, Mass.-With the exception of 3-D Computer-Aided Design (CAD) engineering titles, real-time data-gathering applications and high-end games, few software applications harness the white-hot computational power of 400MHz Pentium II PCs and other high-end systems. This is particularly true for business software applications, and even leading software publishers, such as Symantec, concede the point. "There are very few mission-critical apps that require that kind of speed," said Jeff Cable, director of worldwide sales development for the Cupertino, Calif.-based software company. Not that it hurts. Cable said faster speed will mean quicker searches on Symantec's Act! database, for instance, and faster screen redraw for any application, but those performance improvements also depend on other factors, such as video transfer rates and Internet access speeds. There is, however, one exception to the no-need-for-speed rule in the business category: voice-recognition software. Analysts said major improvements over the last two years in voice-recognition titles like IBM's ViaVoice, Lernout & Hauspie's VoiceXPress and Dragon Systems' Naturally Speaking are directly attributable to the explosion in PC megahertz. "Voice-recognition software really uses this, because the hardware requirements are staggering," said Jeff Tarter, editor of Soft-letter, an industry newsletter based in Watertown, Mass. "The more powerful the PC, the better it works. They all ride the power curve." Over the past year, for example, developers of voice-recognition software have made dramatic improvements in accuracy and dictation speeds. Users once balked at "discrete" speech-recognition offerings-which required users to pause unnaturally between words-but most titles now give users reasonably accurate continuous-speech features. With such improvements, the category has become a "hundred-million-dollar business," Tarter said, noting that such titles once had high return rates because they didn't work as well as advertised. "Everyone thought this was 'Star Trek' quality," he said. But that's changing as more powerful PCs hit retail shelves. Microsoft and other software companies are preparing voice interfaces for Windows and all applications, with the goal of eventually replacing the ubiquitous graphical user interface. Symantec's Cable noted that for now, software developers are targeting application releases for the larger installed base of consumers with 486- or Pentium-based machines. "You have to develop applications that take advantage of those speeds, but you can't limit market opportunities," he said. "[The 400MHz PC base] is the bleeding-edge crowd." Copyright r 1998 CMP Media Inc. At Your Command -- Interactive Agents can work animation magic-if you have enough bandwidth. Martin Heller Remember this classic exchange from the science-fiction masterpiece 2,001: A Space Odyssey: "Open the pod bay doors, HAL." "I can't do that, Dave." Okay, let's stop right there; that was a bad example. But I wanted you to understand what it would be like working with a Microsoft interactive software Agent. Perhaps my mind is wandering to a different type of agent. Let's try again. You'll need a decent Web connection for this, plus a Windows 95 or NT system with a sound card, speakers and (optionally) a microphone. Using IE 3.0 or later with security set to medium, browse to my Agent demo at winmag.com and walk through the demo. By the way, if you don't have enough bandwidth to download the megabytes of required Agent software data, you can access Microsoft's Agent demo and Software Development Kit (SDK) from the Site Builder Network Web Snapshot CD-ROM (for more information on obtaining the CD, see microsoft.com. Once you've installed the software and run the Microsoft demo from the CD, my demo will load relatively quickly over the Web. Agents' abilities So what's it all about? Basically, the Agent provides a conversational user interface, which is normally employed to enhance, rather than replace, the usual Windows graphical interface. An Agent animation runs in its own window, typically a borderless one with a transparent background so that the Agent appears to hover over the Desktop. In many ways, the Microsoft Agent is a logical extension of Office 97's animated Assistant. What can an Agent do? Each of the three characters-Genie (pictured in the sidebar "The Magic Agent"), Merlin and Robbie-has a palette of animations, a synthesized voice, the ability to synchronize its movements with recorded voices, and the ability to respond to mouse clicks and voice commands. An Agent's functionality comes partly from the Microsoft Agent service provider, partly from additional services accessed through the provider and partly from the Agent's character definition file(s). The service provider includes an OLE automation server application (AGENTSVR.EXE), as well as a file and docfile provider (AGENTDPV.DLL). It also relies on Speech API-compliant engines for its text-to-speech and voice recognition capabilities. An Agent control (AGENTCTL.DLL) simplifies the programming interface and lets you use Agents from scripting languages. Microsoft has made its Command and Control speech recognition engine, the Agent software and Lernout & Hauspie Speech Products' TruVoice Text-To-Speech engine easy and free to use on a Web page; simply include tags that point to URLs on Microsoft's site. Microsoft has also made the three stock characters available for use over the Web by pointing the Agent control's Load method to the appropriate URL. If you want to redistribute Agents with a non-Web application, however, you'll need a license from Microsoft. Every Agent character has a documented list of stock methods and states, with an animation assigned to each state and accompanying return state. In addition, each has a documented list of animations it can play, some of which are not assigned to states, and all of which can be given custom names. For instance, every character has a state for its Show method, called Showing, but the animation used for this state might be called Show or something else altogether. You can program Agents through either a COM interface or an ActiveX control. I wrote my sample programs as Web pages using VBScript, so the ActiveX control was my only choice. I started with some of Microsoft's samples and altered them to suit my tastes. I found that the actual scripting of Agents is pretty trivial. For instance, in my first sample (http://www.winmag.com/people/mheller/agent1.htm) I decided to use Genie, so I loaded him from the Agent Web site: AgentControl.Characters. Load "Genie", "http://agent.microsoft.com/characters/genie/genie.acf". Then I saved his character object to a variable to simplify the rest of the code: Set Genie=AgentControl.Characters("Genie"). I wanted him to perform a series of actions: Show himself in a puff of smoke (the "showing" state animation), bow to you (the "greet" animation), say "Hello," give you a thumbs up (the "congratulate" animation), tell you everything's okay, wave to you, say goodbye and finally vanish in another puff of smoke. That script is shown in the sidebar "Speak, Genie!" One thing I learned when developing this was to be careful with my Agent tag codebase fields. Several of the more prominent third-party Agent pages on the Web have errors that result in multiple downloads and installations of the Microsoft Agent ActiveX control. My Agent tags were current when I posted my demo pages; I based them on the tags Microsoft uses in its own demos, rather than on the information in its online documentation. Speak for yourself As you've already found out if you've run my three demos, Agents have two ways to speak: with a synthesized voice and with a recorded voice. Either way, they can display what they're saying in a balloon and synchronize their lip movements fairly well with what they're saying. Using the Text-To-Speech engine to make your Agents talk is easy and efficient, but the results tend to be rather mechanical. You can vastly improve the quality of the agent's speech by using recorded voices-at the cost of some extra effort for you and some additional download time for your users. If you want the character's lips to move in synchronization with the recorded voice, you need to add some extra information to the data found in an ordinary WAV file recording. To do this, use the Microsoft Linguistic Information Sound Editing Tool; you can download it, along with all the software you need to run and develop Agents, from Microsoft's Web site (http://www.microsoft.com/workshop/prog/agent). You'll also find tips for using the sound editor at my Agent demo page. Once you've tweaked the recording, save the combined sound and linguistic information in an LWV file and upload the file to your Web site. You can now play the LWV file from the Agent control using the character's Speak method. When I used the Speak method for synthesized speech (as shown in "Speak, Genie!"), the text was the first argument to the method. The second argument to the Speak method is the WAV or LWV file (for example, Genie.Speak_, "http://www.winmag.com/people/mheller/hellowrl.lwv"). You'll notice that I left out the first argument, so the Speak method uses the text in the LWV file for the Genie's balloon. Linguistics lessons Getting Agents to understand spoken commands is a little more complicated than getting them to speak, though it's not as complicated as you might expect. Essentially, you need to add items to the agent's Commands collection, setting the voice recognition string and caption for each command. Then you need to add each command case to the Agent control's Command event handler and implement a script for each command. Voice recognition strings use a variation on Backus-Naur Form (BNF) for their syntax. For instance, "(go away \| [say] goodbye \| scram)" matches "go away" or "say goodbye" or "goodbye" or "scram." The only caveat I found is that the voice recognition engine has trouble making fine distinctions. For instance, it can easily confuse "home" with "hello," if you speak quickly. It had no problem distinguishing "go home" from "say hello," however. To help define unambiguous command sets, keep the vowel patterns of all the recognized phrases unique. I haven't even touched on the process of editing your own Agent characters. Microsoft supplies a tool for assembling the relevant animations for a character frame by frame, with options for action-sequence branching and mouth overlays. But Microsoft expects you to supply your own frames in BMP or GIF format. Given the sheer number of frames required to make a convincing character, I would probably use a modeling tool to generate the Agent action sequences, instead of drawing key and in-between frames. But then again, I haven't tried it yet. Senior contributing editor Martin Heller programs and writes about it from Andover, Mass. Contact Martin at winmag.com or care of the editor at the addresses on page 20. SIDEBAR: Speak, Genie! Here's an example of the VBScript needed for the Genie Agent character. 'Preload the states and animations we need Genie.Get "State", "Showing, Speaking" Genie.Get "Animation", "Greet, GreetReturn" Genie.Get "Animation", "Wave, WaveReturn" Genie.Get "Animation", "Congratulate, CongratulateReturn" Genie.Get "State", "Hiding" 'Do the actual animation Genie.Show Genie.Play "Greet" Genie.Speak "Hello!" Genie.Play "Congratulate" Genie.Speak "Congratulations! If you can hear this you have the Agent control and the text to speech engine installed properly." Genie.Play "Wave" Genie.Speak "Goodbye" Genie.Hide Copyright r 1998 CMP Media Inc.