Help in understanding a post in Node 607

The following two paragraphs were written by Chuck Runquist (node 607). The subject was: When to correct the misrecognitions. I have quoted the two paragraphs here so that anyone who chooses to respond to my query wouldn't have to look it up.

I'm very new to speech recognition and I'm using Dragon preferred.

Could someone clarify for me the first sentence of the second paragraph. I'm not getting just what the conditions referred to are. Thanks.

Here are the two paragraphs:

Not exactly correct. If the playback is clearly what you said, or intended to say, and the correct word or phrase is in the selections shown, then select the correct word or phrase, but don't train it. Again, the reason is that what you said originally was said in the context of your total dictation. If you train your correction under this condition you run the risk of training the word or phrase out of the total context of you original dication (i.e., we tend to speak differently during normal speaking than when we pronounce a word or phrase during correction). This changes the Acoustic Model representation for the way you normally speak. If the difference is significant, then you alter the spoken form such that it may result in increasing the propensity for the correction to be misrecognized. It is important to note that a one time correction/training will not generally result in future misrecognitions. However, doing this continually eventually alters the Acoutic Model in a negative way, thus causing accuracy degredation, which requires correction, which further exacerbates the problem. In short, it is the repitition of continually correcting and training the same word or phrase over time that is one of the causes of accuracy degredation.

The other part of this is that if a word or phrase is misrecognized, you should always correct it, but only train it under the above conditions. DNS does not learn from your normal dictation. The speech models and the recognizer work on the principle that if you do not make a correction, what was said and what was displayed are correctly associated. Only corrections train the speech models, and DNS only learns from correcting misrecognitions. Acoustic and Language models are distinctly different and how each is modified is distinctly different. The Language Model is only modified by document analysis. The Acoustic Model is only modified by training. When performing corrections, selecting the correct word or phrase from the selection options, or typing the correction, is based on the document analysis algorithm and changes the Language Model (context). Training changes the Acoustic Model. Performing corrections links the Acoustic Model to the Language Model either in terms of what was said originally or in terms of any training done during correction. It is critically important to remember these distinctions. This is the way SR works. Unfortunately, it is not subject to debate. SR algorithms work the way they do whether we disagree with them or not.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Wumply I'd say Chuck means

Wumply

I'd say Chuck means that any training you do should bear in mind his comments in his first paragraph. I'd further say that the best advice given this is to train only new words and phrases you add to the Vocabulary. (I tend not even to train those unless they aren't, after being added, recognised; and some people would go further and say, don't train them, ever, but add a written and spoken form.)

Judy

Judy: this is a whale of a

Judy: this is a whale of a lot to ask, I know, so if you should choose not to respond that would be understandably, certainly. But these questions are the specific ones I'd ask if I HAD a teacher...but I don't! Internet articles do not readily address such detail either.
When I select a correction or type a correction in, that's modifying the language model--is that correct...it's correcting my language "context'. It's learning/remembering then, yes? (I gather this may require a repetition or two on occasion.)

Example: The other day I dictated the following words as part of a longer sentence: "But re the avalanche...". This resulted in a printed misrepresentation (which sounded very clear to me on playback.) Of the 9 correction choices none had the word "re". 8 had the word "read" and the 9th had the word "really". OK, I typed in "But re the avalanche" which then printed correctly.

My question then is: (1) If today I were to dictate "But re the avalanche..." as part of a longer sentence, would Dragon get it right or at least in a 2nd or 3rd. instance?

AND (2) does Dragon remember each and every such correction?

AND (3) would it get "re" right if I were to use "re" in a different context...in many different contexts? If "yes" then Dragon continually learns? Well, would Dragon need no or minimal correction after dictating (and correcting as needed) a 1/2 page letter daily for a year(using everyday English) to a friend or two? By that time I figure I'd have dictated most of phrases I commonly use in context.

Finally (4) I get the feeling you think Dragon has little real need for an acoustic model? Would that be correct?

Wumply (note Matt's post)

Wumply (note Matt's post) I'm simply working from what Chuck says and has said previously. I may need to re-read some of his posts to answer you properly, still, here goes.

Quote:

When I select a correction or type a correction in, that's modifying the language model--is that correct...it's correcting my language "context'. It's learning/remembering then, yes? (I gather this may require a repetition or two on occasion.)

(According to Chuck, it doesn't always learn -- that depends on the type of change you make -- but I think it's best to assume it does.) I'm not sure -- given what Chuck says -- that it's modifying the language model, it could though be modifying both that and the acoustic model (if the user files are saved, of course!)

Quote:

My question then is: (1) If today I were to dictate "But re the avalanche..." as part of a longer sentence, would Dragon get it right or at least in a 2nd or 3rd. instance?

I believe it should

Quote:

AND (2) does Dragon remember each and every such correction?

again it's best to assume it does

Quote:

AND (3) would it get "re" right if I were to use "re" in a different context...in many different contexts?

My guess is, no. (Not immediately.) It might, it might not. The way to get it to learn "re" in many different contexts (if it doesn't immediately), IMO, is to run through the Accuracy Centre utility Add Words from Documents texts that contain a number of non-identical "re" phrases. Another way of course is to keep on correcting misrecognitions of "re" phrases.

Quote:

Well, would Dragon need no or minimal correction after dictating (and correcting as needed) a 1/2 page letter daily for a year(using everyday English) to a friend or two? By that time I figure I'd have dictated most of phrases I commonly use in context.

It shouldn't take that long!! Also the pain-free way is -- as above -- to run the letter through Add Words; make ten copies of the letter, combine them in a .txt file, run that through.

Quote:

Finally (4) I get the feeling you think Dragon has little real need for an acoustic model? Would that be correct?

No. I'm agreeing with Chuck that modifications to the model that introduce unnatural accentuations/stressing -- as may well happen when words are trained -- may degrade it.

I hope this is some help

Judy

Until Chuck responds, I

Until Chuck responds, I suggest that the sentence:

"DNS does not learn from your normal dictation."

means that DNS does not learn from uncorrected dictation, which to me is an unnessarily circular way of expressing a tautology. So long as DNS is interpreting correctly, it makes no mistake from your perspective. So you don't correct because you have no need to do so.

Its only when DNS begins to make mistakes that it needs/invites corrections. Pah! Backassward ways of not saying what is most important: Learn when and how to correct.

Bruce

Apparently in version 9 this

Apparently in version 9 this is not entirely true: it does learn from your dictation, at least within a single document.

Make a note to reprogram

Make a note to reprogram Chuck's boilerplate Smiling

Bruce

My website:

My website: www.metrocast.net/~wumply/exper-1.html Some moving, sad and joyous poems I'd like to share with others. If you should choose to tell your friends, I'd be appreciative. John
==============
Hey Judy...

Thanks so much for your good reply! It was just satisfying to have specific answers; I appreciate your taking the time and the "paste-and-answer" method you used. I love it when people do that. I am thinking in passing Dragon must be a tremendously-complex program!

John

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.




view recent posts