Студопедия

Главная страница Случайная страница

Разделы сайта

АвтомобилиАстрономияБиологияГеографияДом и садДругие языкиДругоеИнформатикаИсторияКультураЛитератураЛогикаМатематикаМедицинаМеталлургияМеханикаОбразованиеОхрана трудаПедагогикаПолитикаПравоПсихологияРелигияРиторикаСоциологияСпортСтроительствоТехнологияТуризмФизикаФилософияФинансыХимияЧерчениеЭкологияЭкономикаЭлектроника






Part 4 Voice input






Read the text and get ready to answer the questions to it.

Voice input, sometimes referred to as speech or voice recognition, allows you to enter data and issue commands to the computer with spoken words. It is an alternative to typing on a keyboard. The software has been developed to provide a fast method of writing onto a computer and can help people with a variety of disabilities. It is useful for people with physical disabilities who often find typing difficult, painful or impossible. Voice recognition software can also help those with spelling difficulties, including dyslexic users, because recognized words are always correctly spelled. Put simply, you talk to the computer and your words appear on the screen.

Voice recognition promises to be the easiest method for data entry, word processing and conversational computing, since speech is the easiest, most natural means of human communication. Voice input has now become technologically and economically feasible for a variety of applications. Early voice recognition products used discrete speech recognition, where you had to pause between each spoken word. New continuous speech recognition software recognizes continuous, conversationally paced speech.

Voice recognition systems analyze and classify speech or vocal tract patterns and convert them into digital codes for entry into a computer system. For a computer to decipher the signal, it must have a digital database, or vocabulary, of words or syllables, and a speedy means of comparing this datawith signals. The speech patterns are stored on the hard drive and loaded into memory when the program is run. A comparatorchecks these stored patterns against the output of the A/D converter.

In practice, the size of a voice-recognition program’s effective vocabulary is directly related to the random access memory (RAM) capacity of the computer in which it is installed. A voice-recognition program runs many times faster if the entire vocabulary can be loaded into RAM, as compared with searching the hard drive for some of the matches. Processing speed is criticalas well, because it affects how fast the computer can search the RAM for matches.

Allvoice-recognition systems or programs make errors. Screaming children, barking dogs, and loud external conversations can produce false input. Much of this can be avoided only by using the system in a quiet 100m. There is also a problem with words that sound alike but are spelled differently and have different meanings — for example, «hear» and «here.» This problem might someday be largely overcome using stored contextual information. However, this will require more RAM and faster processors than are currently available in personal computers.

Several programs are available that provide voice recognition. These systems work best on Windows 2000 and Windows XP. A number of voice recognition programs can be used with Windows, including a basic one l hat is supplied with Microsoft Office XP and 2003. Most special voice applications include a software CD, a microphone headset, a manual and a quick reference card. You connect the microphone to the computer, either into the soundcard (sockets on the back of a computer) or via a USB connection. Then you can begin talking using the following steps.

Enrolment

Everybody sounds slightly different, so the first step in using a voice recognition system involves reading an article displayed on the screen. This process, called enrolment, takes less than 10 minutes and results in a set of files being created which tell the software how you speak. The enrolment only has to be done once, after which the software can be started as needed. The new pieces of software claim that the enrolment process is even easier then in previous versions.

Dictating and Correcting

When talking, people often hesitate, mumble or slur their words. One of the key skills in using voice recognition software is learning how to talk clearly so that the computer can recognize what you are saying. This means planning what to say and then delivering speech in complete phrases or sentences. The voice recognition software will misunderstand some of the words spoken and it is necessary to proofread and then correct your mistakes. Corrections can be made by using the mouse and keyboard or by using your voice. When corrections are made the voice recognition software will adapt and learn, so that the same mistake will not occur again. Accuracy should improve with careful dictation and correction.

Editing and Formatting Text

Text can be changed (edited) very easily. The text to be changed can be selected (highlighted) by using commands like «select line», «select paragraph» and then the changes can be spoken into the computer. These will then replace the selected text.

Typically, voice recognition systems with large vocabularies require training the computer to recognize your voice in order to achieve a high degree of accuracy. Training such systems involves repeating a variety of words and phrases in a training session and using the system extensively. Trained systems regularly achieve a 95 to 99 percent word recognition rate. Training to 95 percent accuracy takes only a few hours.

Two examples of continuous speech recognition software for word processing are Naturally Speaking by Dragon Systems and Via Voice by IBM.

Dragon Naturally Speaking

This program is distributed by Nuance. NaturallySpeaking is recognised as the market leader and is the alternative most frequently recommended by AbilityNet.

IBM ViaVoice

This is also distributed by Nuance. It offers good accuracy, but is not as easy to use as NaturallySpeaking.

Qpointer Qpointer

provides good command and control facilities, but is not so good for writing tasks as it makes more recognition errors. It operates differently to Naturally Speaking and Via Voice.

Minimum requirements are a 133 MHz Pentium class microprocessor, 32 MB of RAM, an industry standard sound card, and 50 MB of available hard disk capacity. The products have 30, 000-word vocabularies expandable to 60, 000 words.

Exercises to the subject:

I. Match the words from the text with their definitions.

Recognition; feasible, discrete, accuracy, to respond, pattern

a) separate, distinct (tech.);

b) a person, thing, or form that is example to copy;

c) to do something in answer;

d) the fact of knowing someone or something;

e) able to be carried out or done; possible;

f) the quality of being accurate, exactness or correctness.

II. Identify whether the following sentences are true or false. Use the model:

1) Student A: I’ve heard that IBM ViaVoice offers good accuracy, but is not
as easy to use as NaturallySpeaking.

Student B: Yes, that is quite right.

2) Student A: All voice-recognition systems are faultless.

Student B: No, that is false, because all voice-recognition systems make errors. Such noises as screaming children, barking dogs, and loud external conversations can produce false input.

1. Early voice recognition products used conversationally paced speech recognition.

2. Training to 95 percent accuracy takes only a few hours.

3. Voice recognition devices in work situations provide faster and more accurate input.

4. The sounds of a sequence of words are like a series of sound bites.

5. Trained systems regularly achieve a 95 to 99 percent word recognition rate.

6. Voice recognition systems work the best only on Windows 2000.

III. Complete the following sentences using words and expressions given below.

1. Voice recognition allows you to enter data and issue commands to the computer with... words.

2. The goal has always been to enable computers to understand our... speech.

3. Speech is the easiest and the most natural means of human...

4. Training voice recognition systems involves... a variety of words and
phrases in a training session.

5. The speech patterns are stored on the...

Spoken; hard drive; repeating; continuous; communication

IV. Complete the row.

1. to recognize — recognition — recognizer

2. to process — processing —...

3. to convert —... — converter

4.... — comparison — comparator

5. to install —... — installer

6. produce — production —...

7. to correct —... — corrector

V. Answer the questions.

1. What does voice input allow?

2. What is the goal of voice recognition?

3. What are the sounds of a sequence of words like?

4. Why does voice recognition promise to be the easiest method for data
entry, word processing, and conversational computing?

5. What speech recognition did early voice recognition products use?

6. What must the computer have to decipher the signal?

7. What makes a voice-recognition program run many times faster?

8. What pace does new continuous speech recognition (CSR) software recognize?

9. What do voice recognition systems do?

10. Why do voice recognition systems with large vocabularies require
training the computer?

11. What does training voice recognition systems involve?

12. What are the steps you follow beginning to talk?

13. What are the examples of continuous speech recognition software for
word processing?






© 2023 :: MyLektsii.ru :: Мои Лекции
Все материалы представленные на сайте исключительно с целью ознакомления читателями и не преследуют коммерческих целей или нарушение авторских прав.
Копирование текстов разрешено только с указанием индексируемой ссылки на источник.