Dragon NaturallySpeaking

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 89.97.166.182 (talk) at 08:25, 13 October 2008 (→‎References). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Dragon NaturallySpeaking
Developer(s)Nuance Communications
Stable release
10.0 / August 2008
Operating systemMicrosoft Windows
TypeVoice Recognition
LicenseProprietary
Websitehttp://www.nuance.com

Dragon NaturallySpeaking is a speech recognition software package developed by Dragon Systems, and sold by Nuance Communications for Windows personal computers (PCs). It was among the first programs to make speech recognition practical on a PC.[1]

NaturallySpeaking uses a minimal visual interface. Dictated words appear in a floating tooltip as they are spoken, and when the speaker pauses, the program transcribes the words into the active window at the location of the cursor. Like other speech recognition software, NaturallySpeaking has three primary areas of functionality. Dictation, whereby spoken language is transcribed to written text; commands that control, whereby spoken language is recognized as a command to click widgets (controls); and finally text-to-speech whereby written text is converted to synthesized audio stream. Early versions of the software had to be trained for approximately 10 minutes to recognize the user's voice, though version 9 no longer requires the initial training.

Dragon Naturally Speaking allows you to access your voice profiles in a networked environment. Doing this requires that each computer in the network have the same audio hardware which is easily achieved with a number of USB microphones that have built-in sound cards.

Nuance claims that using NaturallySpeaking, writing a 900 word essay would take 6 minutes, while typing 40 words per minute and writing a 900 word essay would take 22 minutes.[citation needed]

History

NaturallySpeaking has passed through four companies and evolved considerably since its first beginnings in the early 1980s as a research prototype called DRAGON. The married couple Dr. James Baker and Dr. Janet Baker founded Dragon Systems in 1982, deciding to commercialize DRAGON when their funding was cut by DARPA. Their first product DragonDictate was sold for a number of years. Dr. James Baker departed from the conventional AI, and was a pioneer in Hidden Markov models, a way of using statistics for recognition of speech. Dr. Janet Baker developed the expert system named Hearsay.

In March of 1990, Dragon Systems began selling DragonDictate (for DOS) at a cost of $9000 for a single-user license. As hardware became less expensive over the next several years the price decreased, and by the time NaturallySpeaking 1.0 was released, the price of DragonDictate for Windows was about $2000. The hardware of the time was not yet powerful enough to address the difficult problem of word segmentation, and was unable to determine the boundaries of words in the continuous signal that constitute human voice. Users had to pronounce one word at a time, each clearly separated by a small pause before the next. DragonDictate is based on a trigram model, and is known as a discrete speech recognition engine.

In 1997 advances in hardware technology allowed NaturallySpeaking version 1.0 to launch as the first available continuous dictation system. During this time the speech recognition industry promoted enthusiastically the notion that speech input was "the" natural modality that would eventually supersede more "primitive" methods such as keyboards. Trying to reach a mass market, vendors dropped prices to levels that were unsustainable.

Lernout & Hauspie bought Dragon Systems in 2000. The dictation system bubble burst in 2001, and Lernout & Hauspie went bankrupt. ScanSoft Inc. bought the rights to Dragon products. In 2005, ScanSoft merged with Nuance Communications, and changed the name of the combined entity to Nuance.

The software today is being advertised as potentially up to 99% accurate.[2]

Editions

There are a range of editions of Dragon NaturallySpeaking [1], each of which comes with a noise-canceling headset microphone:

  • Dragon NaturallySpeaking Standard Edition, which is an entry-level version of the product that enables command and control of the PC, as well as speech-to-text input for email, instant messaging clients and word processing.
  • Dragon NaturallySpeaking Preferred Edition, which adds robust integration with Microsoft Office (Word, Excel, etc.) and Corel WordPerfect, as well as support for digital recorders and Bluetooth headset microphones. This version also allows users to control the formatting of documents by voice (increase font size, set colors, set columns, insert tables, etc.)
  • Dragon NaturallySpeaking Mobile Edition, which is the Preferred edition with a digital voice recorder. Users can dictate on the recorder, upload the recording to their PC, and have Dragon NaturallySpeaking convert the recording into a text or Microsoft Word document.
  • Dragon NaturallySpeaking Professional Edition, which adds robust support for custom voice commands (where the user can associate a word or phrase to pre-defined text or graphics) and scripting (which enables automated operation and integration with more specialized applications such as electronic medical records and case management applications.)
  • Dragon NaturallySpeaking Professional Medical and Professional Legal, which are versions of the Professional edition with extended vocabularies for those domains.
  • Dragon NaturallySpeaking Client and Server Developer Toolkits (SDK's), which are used by commercial and in-house developers to integrate speech with their applications. This can include front-end speech recognition (adding speech input to the application) and back-end speech recognition (batch processing of recorded speech for search, transcription or other application areas).

Issues

  • Several users report difficulties using Dragon in a networked environment with networked profiles that disappear. Dragon has not yet fixed this problem. [citation needed]
  • Dragon Naturally Speaking (DNS) version 10.0 does not work under 64 bit operating systems (Windows Vista 64 bit or Windows XP 64 bit). Nuance does not support 64 bit operating systems, and are not planning to fix this any time soon. [citation needed]
  • Despite Nuance's lack of support, DNS 9.0 up to DNS 9.5 DOES work on Windows XP 64 and Windows Vista 64. There is a fix to force an install on Windows Vista – see this external link


Versions

Version Release date Editions
1.0 June 1997 Personal
2.0 November 1997 Standard, Preferred, Deluxe
3.0 October 1998 Point & Speak, Standard, Preferred, Professional (with optional Legal and Medical add-on products)
3.01 Teens
4.0 August 4, 1999 Essentials,Standard, Preferred, Professional, Legal, Medical, Mobile
5.0 August 2000 Essentials, Standard, Preferred, Professional, Legal, Medical
6.0 November 15, 2001 Essentials, Standard, Preferred, Professional, Legal, Medical
7.0 March 2003 Essentials, Standard, Preferred, Professional, Legal, Medical
8.0 November 2004 Standard, Preferred, Professional, Legal, Medical
9.0 July 2006 Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server
9.1 ?? Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server
9.5 January 2007 Standard, Preferred, Professional, Legal, Medical, SDK client, SDK server
10.0 August 7, 2008 Standard, Preferred, Professional, Legal, Medical

See also

References

  1. ^ Sindya N. Bhanoo. July 16, 2007. Language quest moves to Hopkins: Speech-to-text technology expert joins defense work. Baltimore Sun. Retrieved on July 20, 2007.
  2. ^ "Dragon NaturallySpeaking 9". Nuance Communications. Retrieved 2008-04-25.

NaturallySpeaking Awards NaturallySpeaking Video Demonstrations

External links