Intel Teams with Nuance To Bring Voice Search to Ultrabooks

Status
Not open for further replies.

razor512

Distinguished
Jun 16, 2007
2,110
57
19,890
Hopefully it works as well as dragon, (I use it pretty frequently to control my PC while I do other stuff like solder). It is pretty accurate, especially with commands, but transcribing text could use a little work.

I was able to significantly improve me accuracy by upgrading from the crappy headset that to program comes with, to a blue microphone yeti, but even then, it makes more mistakes than I would like.

Since it's errors are with word choice and not spelling, it is difficult to find it's errors when you use it to type up a 20 page paper.
 
[citation][nom]A Bad Day[/nom]How well would it work if there's heavy accent being used?[/citation]

Some voice software like this can adjust to your own personal voice, so although it might take a while, it might be able to adjust to someone's accent.
 

alidan

Splendid
Aug 5, 2009
5,303
0
25,780
[citation][nom]Razor512[/nom]Hopefully it works as well as dragon, (I use it pretty frequently to control my PC while I do other stuff like solder). It is pretty accurate, especially with commands, but transcribing text could use a little work.I was able to significantly improve me accuracy by upgrading from the crappy headset that to program comes with, to a blue microphone yeti, but even then, it makes more mistakes than I would like.Since it's errors are with word choice and not spelling, it is difficult to find it's errors when you use it to type up a 20 page paper.[/citation]

god yes same here, blue yeti makes that program almost perfect... but i still get problems, like the program not being able to correct for some reason and such... its a shame because i would use it more often if it would correct, and stop using the word ship when i say a... similar word.
 

TeraMedia

Distinguished
Jan 26, 2006
904
1
18,990
This technology could work well at home or in a private office or lab environment, but isn't well-suited to cube-world. In light of that, it should probably be enhanced to more appropriately focus on those use-cases.

How well does this technology differentiate between a statement directed to it (e.g. "open 'c:\The Door.docx'") a statement made in the ambient environment (e.g. "Open the door.") I get the "hello dragon" and "go to sleep" part, but is there something like Siri where you address the device by name, or does it simply assume that you're talking to it?

Here's where I think MSFT should take this:
First, use a pair of microphones so that you can determine the direction of the speaker.
Second, when you address the device, either with an individual statement such as, "Siri, find me a Chinese restaurant nearby" or with a batch statement such as, "Siri, record the following dictation," the following should occur:
- the device should identify and authenticate you using voiceprint recognition on its identifier - in other words, how you say "Siri" (or whatever name you choose to give it). If you have setup casual authentication, it automatically changes context to your associated windows login - and incorporates all of your favorites, characteristics, etc. that you have defined in that login. If you have setup strict authentication, the device should require some form of challenge-response authentication before it will accept any command on your behalf.
- the device should give a visual cue that it is in command mode, and is listening to you. Maybe a pair of semi-transparent eyes focused in your direction, positioned on the screen based on config preferences. The visual characteristics of the cue could be used to indicate who you have been authenticated as (to help avoid mistakes in a crowded room).
- audible and visual representations of the understood command should be (configurably) echoed so that you can verify it understood you accurately.
- identifications and commands from other individuals should be ignored for the duration of the command or session (e.g. "thank you Siri" or "good bye, Siri"), at which point the visual cues should clear from the screen.

Anyway, it's nice that they're trying to make it easier to use a computer. I suppose we'll have to see how well they designed the voice interface.
 

mcd023

Distinguished
Nov 9, 2010
370
0
18,780
in all honesty, I find that the speech recognition built into windows (7 and 8) is really accurate. I can't use the area mic on my lappy, I have to use one of my headsets (no prob). It doesn't have the voice controls like Siri does, but you can still use it to control your pc. It was intended for disabled people and probably to be used in conjunction with Narrator, but it works really well. For what I do, typing is still quicker and it's a bit gimmicky since I'm not disabled, but it was fun, especially with the speech recognition macros. I could at least make people think that I was having a conversation with a HAL wannabe. lol. That was fun.
 
Status
Not open for further replies.