• Ganglioneuroma: Rarest and most benign
    • It's done
    • Fun with Yelp...
    • That's no moon...
    • Online classes
    • Insert your getting stoned joke here
    • The new Gmail look and feel...
    • Garmin 60Csx vs Oregon 450
    • Our 2011 Apple Harvest
    • Expense report
    • Hard Drive Destruction
    • It's the small things...
    • Random passwords
    • Cherry Dutch Baby
    • The paperless office needs a paperless toilet
    • Cilantro-pistachio pesto pesto, rice and beans
    • My first iPhone hide
    • Yeast Waffles
    • Seiko battery replacement
    • Nikon D40 won't power up
    • Mapnificent
    • Geocache Queries
    • iPhone 4 travel map
    • I'm Here To Put You Back On Schedule
    • Disruptive technologies
    • Fraud alert
    • Cleaning between the door glass of a Frigidaire oven
    • Snap, Crackle and Pop
    • Dolphin Kick
    • Conversation killer
    Schedule chicken Beets, potato salad, and limeade

    Voice recognition software

    By jim On 23 June 2004 · 6 Comments · In science

    I’m playing with
    Dragon NaturallySpeaking
    again because I want a more efficient means of getting ideas into a more permanent form. When I get excited, I can talk about 5x faster than I type, and I can type 10x faster than I can handwrite. Handwriting doesn’t work very well, especially on a blog (though it does offer some near unbreakable encryption, according to my spouse). DNS claims it can do 3x one’s typing speed.



    There are several “levels” of the software. I have the “preferred” edition because when I bought it, I was regularly using Outlook as a mail/calendaring tool. As a special bonus, DNS also works with the “vi” text editor. I don’t have the patience to enter manual editing commands, though.



    I’ve run through several of the software’s training passages. Doing so helps it tailor my inflections and improve its accuracy. This is a very tedious process, however David Pogue at the NYT swears it will pay off. (He’s largely the reason I use DNS over IBM’s Via Voice, although both are sold by ScanSoft.).



    The software relies heavily on contextual analysis. Thus, as you type, it won’t display until it’s chained together a bunch of phonemes and formed a probable sentence. You have to do this in a relatively quiet space as it does pick up background noise fairly easily. (In my old cubicle environment, this was a big problem.) It’s still not perfect, though, but it shows a lot of promise.

    Here are three sample passages and the results of my reading them.




    The first passage is some lyrics taken from the middle of the song “You Can Call Me Al,” from Paul Simon’s excellent Graceland CD. First, the original:


    A man walks down the street,

    he says, why am I short of attention?

    Got a short little span of attention,

    and whoa, my nights are so long!

    Where’s my wife and family?

    What if I die here?

    Who’ll be my role-model?

    Now that my role-model is …. gone …… gone,

    he ducked back down the alley,

    with some roly-poly, little bat-faced girl.

    All along …. along ….

    there were incidents and accidents,

    there were hints and allegations …..

    Here is Dragon Naturally Speaking’s transcription, unmodified except for line breaks for visual comparison:

    If a man walks down the street,
    he says, why am I short of attention?
    Got a short little span of attention,
    and whoa, my nights are so long!
    Where’s my wife and family?
    What if I die here?
    Who’ll be my role model
    of that my role model is … gone… gone,
    he ducked back down the alley
    with some roly-poly, little bat faced girl.
    All along in its use along in its use
    they were incidents and accidents,
    there were hints and allegations…

    This was better than I expected. It got the dashes between roly-poly, but not bat-faced. It also thought I meant “in its use” instead of “…” (pronounced elipses).



    The second passage was taken from yesterday’s blog entry on schedule chicken. I thought this might be a little different because of the long, complex sentences. Here is the original:


    If you’ve watched a lot of old movies, you’ve no doubt seen the one where the two hooligans with something to prove decide to race their cars towards each other at high speeds. The first one to turn away, averting certain collision, is the “chicken” and loses face among his cohorts. The other player, referred to as the “winner,” gets bragging rights and access to the fickle heroine.

    Here is the DNS transcription. When you read, you have to tell the program where punctuation goes. It’s very unnatural at first. I didn’t know how to tell it to put double quotes around the two words, but otherwise, this is unedited:


    And if you’ve watched a lot of old movies, you’ve no doubt seen the one with two hooligans with something to decide race their cars towards each other at high speeds. The first one to turn away, avoiding certain collision, is the chicken and loses face among his cohorts. The other player, referred to as the winner, if bragging rights and access the fickle heroine.

    This is not that bad – an added “and,” missing “prove”, “to” and “if” instead of “gets.” I’m still a little new at this, so when I read into the microphone, I’m making more of a conscious effort to enunciate clearly, but I’m still talking at my rapid rate. The program takes upwards of 30 seconds to make sense of what I’m saying before typing the text in the place.



    The final passage is excerpted from Fran’s June 18th entry. This is interesting test case because it includes a formal product name and a geographical place:


    My proofreading job on Microsoft Encarta ended today. We all went out to lunch and then called it a day. I 

    • Share:
    Share →
    Tweet

    6 Responses to Voice recognition software

    1. Fran says:
      23 June 2004 at 16:17

      Hey, that’s pretty cool! Sure, you can use my bit of blather.

      Reply
    2. Debbie says:
      24 June 2004 at 8:04

      Drool. I’ve been thinking about getting something like that. I have a bit of a southern drawl, but I could make the extra effort to enunciate so that it might better understand my hick talk.

      I speak slowly, but I type even slower. :)

      Reply
    3. Talia says:
      1 July 2004 at 1:10

      Keep us posted on the progress! I bought this same software a few months ago because I have carpal tunnel, and am in school (lots of note-taking/paper writing).

      I found that it was slow-going with all the medical terminology and ultimately I was spending too much time training the program instead of getting my work done. I’m going to play with it more when I get free time. But for now I’ve had to shelve it.

      I hope it works well for you!

      Reply
    4. Jurabek says:
      11 September 2005 at 23:45

      I need some hints of speaking.Because I got little score from this section.SO that I’m lack of hints .I used 101 IELTS book,but it gave not such a good result.please,deliver your advice or the sites for this topic.

      Reply
    5. Ron says:
      8 February 2006 at 10:57

      I have a fairly neutral accent and I want to talk to a family member who is now very deaf indeed. Would a laptop and Dragon 8 be the answer. The most powerful hearing aid is now not keeping up with deteriorating hearing. Do you think all the family could use it? How quiet does the room have to be?
      Thanks in advance for any help and advice.

      Reply
    6. Nan says:
      12 February 2006 at 12:20

      I shall be very interested to read any replies to Ron’s query of Feb 8th about the use of Voice recog for deaf people. Very deaf myself, my hearing aid is not keeping up with my need. I could often communicate with my husband better if I could read what he was saying – there’s also the need to communicate with friends. How quickly can Dragon learn a new person’s speech? Must the microphone be really close to the speaker’s mouth or could it be placed to pick up speech from a small group? The experience of any deaf people would be invaluable and much appreciated.

      Reply

    Leave a Reply Cancel reply

    Your email address will not be published. Required fields are marked *

    *

    *

    You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

    • Recent Posts

      • Ganglioneuroma: Rarest and most benign
      • It’s done
      • Fun with Yelp…
      • That’s no moon…
      • Online classes
      • Insert your getting stoned joke here
      • The new Gmail look and feel…
      • Garmin 60Csx vs Oregon 450
      • Our 2011 Apple Harvest
      • Expense report
      • Hard Drive Destruction
      • It’s the small things…
      • Random passwords
      • Cherry Dutch Baby
      • The paperless office needs a paperless toilet
    • Recent Comments

      • Cleaning between the door glass of a Frigidaire oven
        • Kate: I say that to my 30-year-old fiance on a fairly regular basis as well. ;)
        • Melissa: Thank you! Thank you! Thank you! Been fighting with a coat hanger and rags on...
        • Regina: THank you so much for this info. I have had a line down my stove for almost 2...
        • Yoda: So happy to find this info. Wish I had had it for my old range, but will keep it...
        • Tanya: I actually called them to ask how to clean that part ..so many drips on mine..I...
      • It’s done
        • jim: Thanks, you all. I am feeling much better. @John – When I knew the surgery...
      • Ganglioneuroma: Rarest and most benign
        • jim: Thanks, guys. @Phil – I am looking forward to our next hike! @John –...
        • John: Descriptions of medical procedures are cringe-worthy unless you’re the one...
        • Phil: Fun read on a not so fun experience. As much as I enjoyed our ‘pain scale...
    • Twits

      • No Tweets Available
      @jim_carson
    Unknown Twitter error.  — jim_carson
    loading Cancel
    Post was not sent - check your email addresses!
    Email check failed, please try again
    Sorry, your blog cannot share posts by email.