Registering and logging in removes this ad.
Registering and logging in removes this ad.
DNS8: Question about correcting phrases
This question is about correcting phrases in DNS8.
Let's say I dictate something like this (just an illustrative example):
"Did you buy lima beans?"
and Dragon gets it this way:
"Did you byline a beans?"
So you say "Correct byline a beans"?
You enter the correction in the correction window, and select "train".
The next box allows you to train "buy lima beans" and then "byline a beans."
Question: if the misrecognition is a nonsensical phrase, is there any value (or detriment) to training it? Or should you just train the phrase that would be correct and skip the nonsense phrase?
(When the error phrase and the desired phrase are both phrases I might use, I will train them both at this point.)
Thanks,
Stan Schwartz


snsok wrote:This question is
This question is about correcting phrases in DNS8.
Let's say I dictate something like this (just an illustrative example):
"Did you buy lima beans?"
and Dragon gets it this way:
"Did you byline a beans?"
So you say "Correct byline a beans"?
You enter the correction in the correction window, and select "train".
The next box allows you to train "buy lima beans" and then "byline a beans."
Question: if the misrecognition is a nonsensical phrase, is there any value (or detriment) to training it? Or should you just train the phrase that would be correct and skip the nonsense phrase?
(When the error phrase and the desired phrase are both phrases I might use, I will train them both at this point.)
Thanks,
Stan Schwartz
Stan,
First, don't be too quick to correct or train. Before you assume that you need to use the correction function, select the entire phrase, preferably the misrecognition and the two words to the right and left of such. Then redictate it. If it gets it correct on the second dictation, then the problem was how you spoke the phrase/sentence, not whether DNS misrecognized it.
Second, if it gets misrecognized again, then use the Correction or Spell dialog to correct it. However, you should review my explanations in the following link posted previously in this forum concerning this topic.
http://www.speechcomputing.com/node/607
Chuck Runquist
Former DNS SDK & Senior Technical Solutions PM for DNS with Lernout & Hauspie (L&H)
Chuck, Thank you so much for
Chuck,
Thank you so much for the help. One other question about correction that I do not understand.
I have my options set to show the correction box when I select a word by saying "select". If I redictate a word when the correction box is openby virtue of saying "select ", and I don't choose one of the numbered spelling options, does Dragon interpret that as a correction?
Conversely, if the correction box opens by virtue of a selection rather than a correction, and I do choose one of the listed options, does Dragon interpreted as a correction?
In other words, does it matter which word results in the opening of the correction box in terms of how the correction is processed?
Maybe it is best not to have the correction option appear with a selection??
Stan
snsok wrote:Chuck, Thank you
Chuck,
Thank you so much for the help. One other question about correction that I do not understand.
I have my options set to show the correction box when I select a word by saying "select". If I redictate a word when the correction box is openby virtue of saying "select ", and I don't choose one of the numbered spelling options, does Dragon interpret that as a correction?
Conversely, if the correction box opens by virtue of a selection rather than a correction, and I do choose one of the listed options, does Dragon interpreted as a correction?
In other words, does it matter which word results in the opening of the correction box in terms of how the correction is processed?
Maybe it is best not to have the correction option appear with a selection??
Stan
Stan,
First, most users tend to think of all of the various utilities in DNS as being unique and distinct from one another. This is not the case. All functions in DNS are related to the basic speech models: The Acoustic Model and the Language Model. The specific speech model to which a particular function, or utility, applies depends upon whether or not that function relates to the spoken form or the written form. For example, all training is linked to the Acoustic Model, and all functions that deal with writing style or context analysis are linked to the Language Model, of which there are 3: Vocabulary, bigram, and trigram models.
Second, the Acoustic Model and the various language models are linked via the "spoken form" and the "written form." The most common observation that you can make in this regard is in terms of how the Vocabulary is configured. The Vocabulary is the single most important Language Model. The reason for this is simply because without the Vocabulary, there is no link between the Acoustic Model and the various Language Models. It is via the Vocabulary that the written and spoken forms are linked. In the Vocabulary Editor, this relationship is most obvious because you have the option to enter a written form and a spoken form. If you leave the spoken form blank, than the Vocabulary Editor automatically assumes that the normal phonetic pronunciation of what ever word or phrase you are adding to the vocabulary is the default form for dictating that word or phrase. If you add a spoken form, then DNS assumes that the pronunciation of that word or phrase is based on the spoken form, and the phonetic equivalents for that spoken form are added along with your vocabulary addition. If you train a word or phrase added to the vocabulary that contains a spoken form, it is the spoken form that you are training and that spoken form is then added to the vocabulary linked to that word or phrase using the phonetic equivalent of your pronunciation. My point here is that this is how the Acoustic Model and the Language Models are related to one another. That is, through the link between the written form and the spoken form. So the basics of speech recognition in DNS, as well as in all the other SR products, is the relationship between the Acoustic Model and the Language Models, which are basically referred to technically as "speech models." This relationship underlies all of the utilities and functions in DNS.
Third, DNS does not learn from your dictation. It only learns via correction. In other words, this is a simple algorithm that says, in essence, “if what I say is not corrected, then it is correct as dictated by default.” If, on the other hand, what is dictated is perceived by the user as a misrecognition, then any time the correction dialog, whether that be the Correction Window or the Spell dialog, is invoked, Dragon NaturallySpeaking® assumes automatically that any changes made to the original dictation in either of these dialogs is, by default, a correction. Juxtaposed to this, if you select a word or phrase and redictate it, Dragon NaturallySpeaking® assumes, by default, that this is a substitution, not a correction, and therefore simply assumes that, for lack of a better analogy, you simply erased what you dictated originally and redictate it, or changed it.
Fourth, corrections are stored in both your user, unless you do not save your user when closing your DNS session, and in the appropriate inclusions relative to the Acoustic Optimizer. When you run the Acoustic Optimizer, the Acoustic Model, along with the appropriate links to the various Language Models, is reanalyzed based on all of your corrections made in between the current running of the Acoustic Optimizer and the last running of the Acoustic Optimizer. In DNS 8 your entire user is a readapted using the same algorithm that displays the “Adapting User” at the end of general training. One of the reasons why, in DNS 8, the Acoustic Optimizer displays the time in terms of several hours is that it is going back through your entire acoustic profile and checking it against all of the corrections that you have made since the last time that you ran the Acoustic Optimizer. It then readapts your Acoustic Model based on these changes. In short, it's not dissimilar to completely re-adapting your user. In fact that is exactly what's happening, even though it only takes between 10 and 30 minutes of actual time to do so.
Lastly, what happens in the correction dialogs is this. If you "choose" an option, the original “spoken form,” what was recorded by DNS when you originally dictated the word or phrase that you are correcting, becomes linked with the written form that you "chose." This is why it's so important to have the automatic playback on correction enabled. This allows you to listen to what you said when you originally dictated what you are correcting so that you can make a decision as to whether to keep the original "spoken form" or retrain it. Since we've already covered training we won't go into that. However, if you choose to train a correction, then the New "spoken form" is then linked to the New “written form.” On the other hand, if there is no playback when you select a word or phrase for correction, then whatever is associated (i.e. spoken form) from any point in the past where the original word or phrase was dictated and recorded in the Acoustic Model is retained as the spoken form associated with your "correction."
The basic answer to your question, after the above diatribe, is simply this. DNS only learns from your corrections, DNS assumes that a correction is being made only if the correction function is used. Any other form of replacing text in a document that does not invoke the correction function, is not a correction, but rather a substitution. DNS treats substitutions as if they were a normal part of your dictation. That is, if a substitution is not corrected, then it is assumed to be correct as dictated in terms of the link between both the spoken form and the written form.
Chuck Runquist
Former DNS SDK & Senior Technical Solutions PM for DNS with Lernout & Hauspie (L&H)
Chuck, Many thanks for
Chuck,
Many thanks for explanation of how Dragon "learns" re corrections; as a novice, I hope I've understood but sometimes just when I think I've grasped the meaning, it slips away from me again (often happened to me in maths classes)
Re corrections: there was something I used to do in the past, which I'm not sure was a good idea or not, and I'd appreciate your comment on it.
I always have playback turned on, and when a phrase was misrecognised, I used to select the phrase and say "Correct That" which brought up the dialog box with the phrase in it, and several alternative versions of what Dragon "thought" I'd said.
Then, but only if the text in dialog box revealed something totally different from what I had said but I could clearly hear the phrase I had intended on playback, I used to correct the phrase by saying the same thing every time, eg "No way do I want this phrase", and clicking the OK button.
Presumably, unless I dictated "No way do I want this phrase", I would never get that version of what Dragon "thought" I said again.
Was this a good or bad idea?
Chuck, Thanks. Your
Chuck,
Thanks. Your explanation is clear.
I am not sure, though, of the answer to my last question.
I have the option set so that if I say "select " or "correct ", I get the correction window in either case. At that point, if I make a correction in the correction window, either by redictating into the window or selecting one of the numbered choices, does DNS process my action the same?
In other words, once the correction window is opened, does how it got opened make a difference in the outcome of the correction?
I believe from what you say, once I am working in the correction window, how I got there is irrelevant.
Stan
Not Chuck but yes is the
Not Chuck but yes is the answer !
Graham
www.itspeaking.co.uk
Good morning Graham, I
Good morning Graham, I presume it was my question you were responding to?
Yes, it was a Good Idea, or yes, it was a Bad Idea
Chris
Good Evening Chris! Sorry
Good Evening Chris! Sorry for the ambiguity.
You said
"I believe from what you say, once I am working in the correction window, how I got there is irrelevant."
Yes - the way you got there is irrelevent.
Graham
Thanks Graham but this was
Thanks Graham but this was not my posting to which you replied. Never mind, all knowledge is grist to the mill.
Chris