This page has been robot translated, sorry for typos if any. Original content here.

We speak with the computer

Everyone probably watched that in the films the characters with might and main talk with the computer, moreover, he gives them an intelligent answer. Now programmatic progress has made all this accessible to every user. In the presence of a processor from 1 GHz, the microphone is also allowed to train Windows to perform various voice commands, and in any applications. Suppose you are allowed to make commands that mimic keystrokes, but then use this "feature" in toys: let's say for the phrase "Choose a knife" press "1", for the phrase "Choose a gun" - "2".

So, let's begin. First, let's look at the microphone. I note that for use in toys it is more convenient to use a microphone that is attached to the head - such a bundle works more clearly, and various noises do not interfere with it. And to control a computer, obviously, a desktop microphone is better, so that it was allowed to give commands without going to a computer.

For owners of the popular Creative SBLive! 24bit I will note the fact that the microphone input is more intelligent there than on other sound cards. If you are using an unsophisticated condenser microphone (having just a pair of wires), it also doesn’t work for you, then connect the contact L to the contact R in its connector (if it has a mono connector, you must first solder it to stereo) . Otherwise, the microphone input works in low sensitivity mode. It is impossible to use a microphone that has been modified in this way with some other cards - other cards on the R-contact sometimes have feeding for electret microphones.

If you eat two sound cards, for example, a regular PCI is also integrated on the mother's reward, then it will be more rational to use an integrated sound card for a microphone. As on the main card, the mixer can be periodically reconfigured for other purposes (to the sample, for recording for transmissions from the tuner). And an additional card is allowed to be allocated to the monopoly possession of the voice control program.

The microphone must be configured first in Windows. Double click on the mixer icon in the tray, the mixer appears. Mixers for each sound card in Windows pair - one for playback, another for recording. We need to configure what to record. To do this, go to "Options -> Properties" and select the "Record Settings" menu there.

Allow display regulator "Microphone". Then in the parameters of the first window of the mixer we put a checkbox on the "Additional parameters".

Under the microphone control there will be an additional button "Settings", go there.

Usually, there is the option of additional amplification of the microphone signal, for example, "+ 20dB Boost" or "Booster" 1. It should be activated, if you use a desktop microphone, but not attached to the headphones.

Now let's do the main program. So, we will use Game Commander 2. Despite the gaming name, this is a very advanced program also with toys in no way connected. Download, install2.

Go to the settings (to do this, press the button with the hint "Edit").

In the settings already eat a lot of presets for different toys, but we do not need them, because they use the letter-by-word recognition of the English voice. They are all allowed to be removed. Then make our way into the microphone settings "Tools -> Microphone level".

Here you have to choose the location of the Volume slider so that all the spoken phrases at the top read "Speech detected". If the microphone is desktop, then, most likely, it is necessary to install the controller in the location of "High". However, later in actual use, the extremely high level of sensitivity of the Game Commander can start to confuse the commands, making false positives. In this case, reduce the volume for the microphone. Close the microphone settings.

Now we will create phrases processed by the program.

The program owns a couple of type of actions - Global Commands also for specific programs. Actions (more precisely, the imitation of keystrokes) from the Global Commands list is done in any active window, as if you were directly pressing the key at that moment. In most cases, this is inconvenient as well. And actions for a given program are executed, only if this program is also running its window is active.

For the source, let's practice on Notepad. First, start Notepad, then in the Game Commander settings, go to "Add -> Command File ..." and select notepad.exe from the list of processes.

Then in the settings log we open a new Notepad branch, expand it also in the Command Set and add the Command.

On the General tab that appears on the right, select Voice Training mode in Voice input.

An order will appear to dictate the key expression of the command.

It is necessary to press Start ,, watching the countdown, dictate the command three times.

Then, again on the General tab, in the Insert keystroke into action list field, enter one or several keys, the pressing of which should be imitated.

Is done. Today it is allowed to add a few more commands to the Command Set to react to other spoken words. On the Feedback tab of the separately selected command, you are allowed to enter sound files that will be played until also after the command is executed. You can, for example, write in them the phrases "the command accepted" also the "command executed".

And on the Feedback tab of the entire profile of settings, it is allowed to specify sound files that will be played when a command is recognized or not recognized.

On the Audio Devices tab you can choose from which sound card to use the microphone - this allows you to use it not on the main card, but, let's say, on an additional one. The Check audio setup option allows you to restore mixer settings, if they have been changed by some other program or fun.

Everything, having adjusted, we save changes, we also close the settings window. Today we proceed to the study of the first window of the Game Commander.

The first button - Listen - globally allows the program to listen attentively and execute commands. Another button - Audible Feedback - allows the application to respond (with predefined sound files) to your commands. The third button - Global Commands - allows the execution of commands from the global list.

Turn on Listen, start Notepad. We pronounce the previously specified command. Pay attention that Game Commander will accept (and execute) the command only if the Notepad window was active. If he blah blah he scribbles Not recognized - it doesn’t always mean that he didn’t recognize the command. This expresses that he did not do anything, for example, because there was no active window for which the command was spoken. But the commands from the Global Commands list will be executed in any active window if, undoubtedly, the Global Commands button is pressed.

Global commands can be used for general computer control, for example: switch tasks, turn off the computer, launch a web browser. Game Commander does not know how to run programs on its own, it can only simulate keystrokes. But this problem can be solved with the help of MediaKey. First, in the Game Commander we set up an imitation of pressing, for example, LShift + LCtrl + A, but then we prescribe this blah blah combination in the MediaKey and assign a file launch to it there.

Commands related to programs, as was also intended by the creators, can be most used in games3. Many different actions are allowed to be assigned to voice commands: choosing one or another weapon, displaying a card, reloading, repairing is also much more (in modern toys, as a position, each key on the keyboard performs something).

Now, having understood the principles of using Game Commander, you can find an application for it.


1. For Creative SBLive! 24bit (and, obviously, some other sound cards) due to an error in the driver, the amplifier is allowed to be turned on only in the microphone settings in the playback mixer, but it also affects the performance of the recording mixer.

2. Recall that in gratuitous mode, he has significant limitations: it is impossible to simulate pressing several keys in a row, and it is also impossible to "press" keys globally (only in previously specified programs).

3. We emphasize the fact that software and technical progress have long since made it possible to make voice interactive computer control everywhere, but directional support from application programs is required. For example, imagine this dialogue:
- Computer: New mail received.
- Man: read.
- Computer: (reads).

Already long ago, such popular programs as TheBat !, ICQ, Miranda can in one way or another voice to announce the receipt of mail when they want to, and are able to pronounce it (with the help of third-party plug-ins). But they can’t act on any additional event (like, for example, the team of a person in the presented dialogue) if they read it, then read it immediately upon receipt.