Differences between revisions 15 and 16
Revision 15 as of 2011-07-04 20:30:01
Size: 5839
Editor: shoobe01
Comment:
Revision 16 as of 2011-07-25 22:21:35
Size: 6742
Editor: shoobe01
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:
Certain classes of users, or any user in certain contexts, must be able to consume content without reading the screen. You must allow certain classes of users, or any user in certain contexts, to consume content without reading the screen.

Practically all mobile devices have audio output of some sort, and it can be accessed by almost every application or website. There can be strict limits, such as devices which only output over headsets, or those which only send phone call audio over Bluetooth, that can limit the use of some tones.
Line 6: Line 8:
Mobile devices must be able to read text displayed on the screen, so it can be accessed and understood by users who cannot use the screen. Mobile devices must be able to read text displayed on the screen for the user, so it can be accessed and understood by those who cannot use the screen.
Line 10: Line 12:
The user may well choose to use '''Voice Readback''' to allow themselves to use their hands and eyes for other purposes, in the way radio is used when video cannot be employed. For example, while working or performing hobbies which do not require excessive cognitive load themselves. The user may well choose to use '''Voice Readback''' to allow themselves to use their hands and eyes for other purposes. While working, or performing hobbies which do not require excessive cognitive load themselves, such as most driving, radio and other audio output is used to gather information or provide entertainment. Video is generally more entertaining, but is totally unsuitable for these situations.
Line 15: Line 17:
'''Voice Readback''' always works in broadly the same way, but what is being read varies:
Line 16: Line 19:
Line 18: Line 20:
Line 38: Line 39:
Audio should be played through the external speaker, or speakerphone. The last set in-call volume (or equivalent playback volume for non-phones) should be used. Whenever possible, detect the ambient noise level and adjust the volume accordingly, in order to make it audible. Audio should be played by default through the external speaker, or speakerphone. The last set in-call volume (or equivalent playback volume for non-phones) should be used. Whenever possible, detect the ambient noise level and adjust the volume accordingly, in order to make it audible.
Line 42: Line 43:
Content read must be identical to that printed on the screen. The condition that resulted in user employing '''Voice Readback''' may be temporary and transient; the user may switch between the screen and audio channels, or may even read along with the voice output. Even for users with a vision deficit, others may be accompanying them, who may also wish to use the device. Content read must be identical to that printed on the screen. The condition that resulted in the user employing '''Voice Readback''' may be temporary and transient. Allow the user to switch between the screen and audio channels. They may even wish to read along with the voice output. Even for users with a vision deficit, others may be accompanying them, who may also wish to use the device.
Line 44: Line 45:
There may be delays between phrases, or before the start of the audio readback. To inform the user that audio is about to commence, and to prepare them for the volume level, a subtle tone should be played immediately beforehand. Also be sure that content scrolls as the audio gets to that part, so that the item being read back is in the viewport.
Line 46: Line 47:
A similar tone should be used when '''Voice Readback''' has completed for a significant time, or for the selected setting, to confirm this condition to the user. Otherwise they may be left guessing due to just There may be delays between phrases, or before the start of the audio readback. To inform the user that audio is about to commence, and to prepare them for the volume level, play a subtle tone immediately beforehand.

Use a similar tone when '''Voice Readback''' has stopped for a significant time, or to communicate a selected ovum setting, to confirm this condition to the user.
Line 52: Line 55:
The voice used must be as understandable as possible. Text-to-voice translation of names, especially, can be difficult to understand or improperly pronounced. If quality is too low with the available hardware and software, do not implement the solution. The voice you select must be as understandable as possible. Text-to-voice translation of names, especially, can be difficult to understand or improperly pronounced. If quality is too low with the available hardware and software, do not implement the solution.
Line 54: Line 57:
Keep in mind that users may be wearing headsets. Some headsets will not accept all output, so if your application relies on '''Voice Readback," make sure the targeted devices support your application or service sending audio to all attached headsets or other audio devices.
Line 60: Line 64:
Please do not change content above this like, as it's a perfect match with the printed book. Everything else you want to add goes down here. Please do not change content above this line, as it's a perfect match with the printed book. Everything else you want to add goes down here.

Problem

You must allow certain classes of users, or any user in certain contexts, to consume content without reading the screen.

Practically all mobile devices have audio output of some sort, and it can be accessed by almost every application or website. There can be strict limits, such as devices which only output over headsets, or those which only send phone call audio over Bluetooth, that can limit the use of some tones.

Solution

Mobile devices must be able to read text displayed on the screen for the user, so it can be accessed and understood by those who cannot use the screen.

Due to mobiles being contextually employed, there are numerous instances in which the user may not be able to, not be allowed to, or may choose not to read the screen.

The user may well choose to use Voice Readback to allow themselves to use their hands and eyes for other purposes. While working, or performing hobbies which do not require excessive cognitive load themselves, such as most driving, radio and other audio output is used to gather information or provide entertainment. Video is generally more entertaining, but is totally unsuitable for these situations.

Voice Readback can form an integral part of a complete voice UI for mobile devices. Readback is used to prompt for commands, and then confirms the user input, or declares how the system has interpreted the command. It will also read on-screen displays and options, to allow the user to select appropriate items without looking at the screen.

Variations

Voice Readback always works in broadly the same way, but what is being read varies:

  • Universal - The entire interface is read, to allow the device to be used without any view of the display. This is usually combined with the Voice Input pattern to create a complete voice UI, as an alternative to the conventional button (or touch) and screen UI native to the device. Even if used for only one section, action, or phrase, this same method is used for any readback of voice commands.

  • Elemental - An entire document, such as a PDF, email or web page is read until the user cancels the action or the entire document is read.

  • Selected - A selection the user has specified within any context -- for example by highlighting text in a web page -- is read in it's entirety.

Voice output that is presented based on conditions, such as position or time, are discussed under the Voice Notifications pattern.

Selections made by the user can be read. The play control is usually contextual, and related to the selection such as the Pop-Up menu shown above.

Interaction Details

Voice Readback can be turned on as a setting for the entire OS, or on an application basis. It will then be used automatically, whenever a change in the application is initiated.

Other input methods, such as keypads and Directional Controls will still function. Buttons will generally be needed to unlock or refresh a screen so that the current condition is read aloud.

Readback can also be initiated, for the Elemental and Selection variations, within the application or as a contextual control such as a Pop-Up, menu, Annotation or other control.

Readback for single-use cases of UI control is the result of Voice Input. Initiation is discussed under that pattern.

When entire documents, long passages or even just marquee text, is being read the document will scroll to always have to current reading selection visible in the viewport. A cursor or highlight -- as shown above -- should be displayed to correspond to the word currently being read. An indicator of audio playback should be on the screen at all times, and a control should be provided to immediately mute or pause the audio.

Presentation Details

Audio should be played by default through the external speaker, or speakerphone. The last set in-call volume (or equivalent playback volume for non-phones) should be used. Whenever possible, detect the ambient noise level and adjust the volume accordingly, in order to make it audible.

When a headset is attached (either physically or by a link such as Bluetooth), the playback should default to this device, and use the last set in-call volume for this device.

Content read must be identical to that printed on the screen. The condition that resulted in the user employing Voice Readback may be temporary and transient. Allow the user to switch between the screen and audio channels. They may even wish to read along with the voice output. Even for users with a vision deficit, others may be accompanying them, who may also wish to use the device.

Also be sure that content scrolls as the audio gets to that part, so that the item being read back is in the viewport.

There may be delays between phrases, or before the start of the audio readback. To inform the user that audio is about to commence, and to prepare them for the volume level, play a subtle tone immediately beforehand.

Use a similar tone when Voice Readback has stopped for a significant time, or to communicate a selected ovum setting, to confirm this condition to the user.

Antipatterns

Avoid mixing readback of commands and text. When the two must be used together, use delays, tones, changes in voice and clear syntax (such as "You said...") to indicate the difference.

The voice you select must be as understandable as possible. Text-to-voice translation of names, especially, can be difficult to understand or improperly pronounced. If quality is too low with the available hardware and software, do not implement the solution.

Keep in mind that users may be wearing headsets. Some headsets will not accept all output, so if your application relies on Voice Readback," make sure the targeted devices support your application or service sending audio to all attached headsets or other audio devices.


Discuss & Add

Please do not change content above this line, as it's a perfect match with the printed book. Everything else you want to add goes down here.

Examples

If you want to add examples (and we occasionally do also) add them here.

Make a new section

Just like this. If, for example, you want to argue about the differences between, say, Tidwell's Vertical Stack, and our general concept of the List, then add a section to discuss. If we're successful, we'll get to make a new edition and will take all these discussions into account.

Voice Readback (last edited 2011-07-31 23:53:54 by shoobe01)