Help me about speech to text

Sep 13, 2013 at 7:36pm
how to write speech to text application?

without any plugin all code by myself like Google or Microsoft SAPI or Apple Siri
Sep 13, 2013 at 7:44pm
Just the fact the you are asking this question suggests to me you're not up to doing it...
Sep 13, 2013 at 8:08pm
sorry. my English is poor for now
Sep 13, 2013 at 9:22pm
LOL
Sep 13, 2013 at 9:43pm
Well, if you're serious about doing this, what do you mean when you say 'all code by myself'? I mean, do you want to make this with just your operating system and C++ compiler and nothing else?
Sep 13, 2013 at 9:46pm
I'm pretty sure you need at least an API(?) with the device (unless you make this from scratch, too). Try looking up any documentation for receiving the device's input.
Sep 14, 2013 at 8:04pm
i want write program to act like google voice. i speak and it write
like Apple Siri, apple dictation
Sep 14, 2013 at 8:28pm
You're missing the point. Speech-recognition is a notoriously difficult task. Even superior software, developed by people with millions of dollars to pour into it, typically requires calibration to a particular speaker's voice.

Use a library of some kind. Your application will actually get finished some day if you do.

Good luck!
Sep 14, 2013 at 8:37pm
i want it to be fast like google voice. Microsoft Speech Recognition is not fast and is not good for work.

get me best library for c++ or python
Sep 14, 2013 at 8:48pm
closed account (3qX21hU5)
I assume you are looking to make something like Dragon from Nuance (Of course on a much smaller scale). As others have pointed out if you are asking how to do it you don't really have the programming skill to take on such a task as of right now.

Specially if you want to make something better then those that are out there (Like Microsoft Speech Recognition, Dragon, Siri, ect) which all had teams of hundreds of highly experienced programmers working on them and budgets of millions of dollars.

We aren't trying to deter you from what you want to do, but we are just being realistic. This is not something that a beginner would be able to accomplish and I doubt even a highly experienced programmer could accomplish it without a massive budget and a big team.

Though if you just want to learn about speech regonition in generally this might be a good open source library to check out http://cmusphinx.sourceforge.net/ there is also the Mircosoft SAPI http://msdn.microsoft.com/en-us/library/ms723627(v=vs.85).aspx

Though be warned you have to have a pretty good grounding in whatever language you choose to use. If you do you can come up with a nice little speech recognition program, but just don't expect it to be in the same league as Dragon, Siri and the other.
Last edited on Sep 14, 2013 at 8:49pm
Sep 14, 2013 at 8:54pm
can i add my language Persian to recognition.
Sep 14, 2013 at 9:10pm
If you did want to try to do this without a library, it would be tough, but not impossible. This would be done in several sections:

1. Get a stream from a microphone. You need to interface with a driver, use your OS, or a library like OpenAL to start recording from the microphone. You'll get a stream of data.
2. Recognize when to split up your stream. If you can recognize the space between words, or between sentences then package that data as a single sound clip and pass it onto the next step, then record your next clip.
3. Convert the sample to text. This is the toughest part and you'll really need to think about how to do this. One way could be to pre-record some words, then calculate a similarity ration. Another way could be to pre-record your words and study the waveforms in a graphical format such as what is offered with Audacity. If you can find patterns for specific syllables, letters, or words, and you can associate those patterns with your stream, then you've got it and you've probably just made $1,000,000.

Of course, doing it yourself allows you to choose whatever language you want.

Sep 14, 2013 at 9:18pm
OK. forget add Persian language. i want library work on Linux, mac and windows
i think library better for python
Last edited on Sep 14, 2013 at 9:19pm
Sep 14, 2013 at 10:31pm
What experience in programming do you actually have? @Sadegh2007?
Sep 14, 2013 at 11:00pm
Sorry to barge in on this thread guys, but I want to build a castle. Can anyone tell me what to build it out of? I don't want to use bricks, but if I have to, which bricks are the best? Also, I want the castle to be able to transform like megatron.
Sep 15, 2013 at 2:20am
Interesting you should mention that xismn. I'm currently digging a new river, on my own, with a spade. It's going to be roughly the size of the Yangtze, without use of explosives, machinery or extra manpower. Anyway, it would not be much trouble for me to redirect a bit of that water to give you a moat for your castle. :)
Sep 15, 2013 at 2:47am
Current posters are assuming OP is spamming. He might not be. So let's not be rude, okay?

[edit]
@Sadegh2007
I still don't think you understand the magnitude of what you are trying to do.

Lucas had it backwards. R2-D2 would very easily speak to people. But Luke would have to go over and use a keyboard on R2's head to talk to him. Or get C3P0 to translate into whistles for him.
Last edited on Sep 15, 2013 at 2:49am
Topic archived. No new replies allowed.