CoffeeS4

Microsoft Speech SDK

The Microsoft.com Speech website Microsoft Speech SDK SAPI 5.1

CoffeeS4

Introduction

CoffeeS4 is the fifth sample application in a tutorial series named Coffee. It uses a consistent coffee shop motif. Customers enter the shop, go to the service counter, speak to order drinks or to enter the front office.

The samples are intended to demonstrate speech recognition capabilities within an application. They are designed for the application-level (API) programmer and for those not familiar with speech technology. Each sample will progressively add new features and increase in complexity. The tutorial chapters explain in detail particulars of the code. You are encouraged to read each chapter. Writing engines such as speech recognition or text-to-speech, also called device driver programming, will be covered separately. The samples can use engines provided by the SAPI SDK or third party SAPI-compliant engines.

Using CoffeeS4

CoffeeS4 introduces the concepts of resources and resource management. SAPI stores information in the form of tokens. These tokens are used later to instantiate features such as voices and recognizers. However, programmers can query SAPI for the presence of tokens to learn more about available features. For example, each available voice is kept as a token.

CoffeeS4 displays available voices and may even speak using the currently active voice. Three new commands are used. To do so, enter the office by saying, “go to the office” or “enter office.” Once there, display the voice list by saying, “manage the employees.” A list of available voices will display on the right side of the screen. The active voice will be indicated in red.

To have the employee speak, say, “hear them speak.” The statement “I will be the best employee you've ever had. Let me work.” will be spoken in the current voice. The voice may be changed using Speech properties in Control Panel.

New Commands List

Choosing one word from each line of a category forms the command. Commands in parenthesis are optional and do need to be included. Words or phrases separated by slashes indicate any of the listed choices may be used although only one may be selected. Sections marked RULEREF indicate words or phrases may be chosen from the corresponding rule ID. Rule names are the same as listed in the corresponding XML configuration file.

 

XML rule ID: VID_Manage

  • (please)
  • manage
  • (the)
  • RULEREF: VID_ThingsToManage
 

XML rule ID: VID_ThingsToManage

  • employees
 

XML rule ID: VID_HearTheVoice

  • hear them speak