Table of contents
- Welcome to Microsoft Speech SDK
- Getting Started
- System Requirements
- End User License Agreement
- About This SDK
- SAPI 5 Introduction
-
Application-Level Interfaces
- Audio Interfaces
- Eventing Interfaces
- Grammar Compiler Interfaces
- Lexicon Interfaces
- Resource Interfaces
-
Speech Recognition Interfaces
- ISpRecoContext
-
ISpRecoGrammar
- ::GetGrammarId
- ::GetRecoContext
- ::LoadCmdFromFile
- ::LoadCmdFromObject
- ::LoadCmdFromResource
- ::LoadCmdFromMemory
- ::LoadCmdFromProprietaryGrammar
- ::SetRuleState
- ::SetRuleIdState
- ::LoadDictation
- ::UnloadDictation
- ::SetDictationState
- ::SetWordSequenceData
- ::SetTextSelection
- ::IsPronounceable
- ::SetGrammarState
- ::SaveCmd
- ::GetGrammarState
- ISpRecoResult
- ISpRecognizer
- ISpPhrase
- ISpPhraseAlt
- ISpProperties
-
Text-to-Speech Interfaces
- Overview
- TTS Engine Characteristics
- Text synthesis
-
ISpVoice
- ::SetOutput
- ::GetOutputObjectToken
- ::GetOutputStream
- ::Pause
- ::Resume
- ::SetVoice
- ::GetVoice
- ::Speak
- ::SpeakStream
- ::GetStatus
- ::Skip
- ::SetPriority
- ::GetPriority
- ::SetAlertBoundary
- ::GetAlertBoundary
- ::SetRate
- ::GetRate
- ::SetVolume
- ::GetVolume
- ::WaitUntilDone
- ::SetSyncSpeakTimeout
- ::GetSyncSpeakTimeout
- ::SpeakCompleteEvent
- ::IsUISupported
- ::DisplayUI
-
Engine-Level Interfaces
- Grammar Compiler Interfaces
- Resource Interfaces
- Speech Recognition Interfaces
-
Speech Recognition Engine Interfaces
- ISpPrivateEngineCall
-
ISpSREngine
- ::SetSite
- ::GetInputAudioFormat
- ::RecognizeStream
- ::SetRecoProfile
- ::OnCreateGrammar
- ::OnDeleteGrammar
- ::LoadProprietaryGrammar
- ::UnloadProprietaryGrammar
- ::SetProprietaryRuleState
- ::SetProprietaryRuleIdState
- ::LoadSLM
- ::UnloadSLM
- ::SetSLMState
- ::SetWordSequenceData
- ::SetTextSelection
- ::IsPronounceable
- ::OnCreateRecoContext
- ::OnDeleteRecoContext
- ::PrivateCall
- ::SetAdaptationData
- ::SetPropertyNum
- ::GetPropertyNum
- ::SetPropertyString
- ::GetPropertyString
- ::SetGrammarState
- ::WordNotify
- ::RuleNotify
- ::PrivateCallEx
- ::SetContextState
-
ISpSREngineSite
- ::Read
- ::DataAvailable
- ::SetBufferNotifySize
- ::ParseFromTransitions
- ::Recognition
- ::AddEvent
- ::Synchronize
- ::GetWordInfo
- ::SetWordClientContext
- ::GetRuleInfo
- ::SetRuleClientContext
- ::GetStateInfo
- ::GetResource
- ::GetTransitionProperty
- ::IsAlternate
- ::GetMaxAlternates
- ::GetContextMaxAlternates
- ::UpdateRecoPos
- ISpSRAlternates
- Text-to-Speech Engine Interface
-
Structures
- SPAUDIOBUFFERINFO
- SPAUDIOSTATUS
- SPBINARYGRAMMAR
- SPEVENT
- SPEVENTSOURCEINFO
- SPPARSEINFO
- SPPATHENTRY
- SPPHRASE
- SPPHRASEALT
- SPPHRASEALTREQUEST
- SPPHRASEELEMENT
- SPPHRASEPROPERTY
- SPPHRASEREPLACEMENT
- SPPHRASERULE
- SPPROPERTYINFO
- SPRECOCONTEXTSTATUS
- SPRECOGNIZERSTATUS
- SPRECORESULTINFO
- SPRECORESULTTIMES
- SPRULEENTRY
- SPSERIALIZEDEVENT
- SPSERIALIZEDEVENT64
- SPSERIALIZEDPHRASE
- SPSERIALIZEDRESULT
- SPSTATEINFO
- SPTEXTSELECTIONINFO
- SPTMTHREADINFO
- SPTRANSITIONENTRY
- SPTRANSITIONPROPERTY
- SPVCONTEXT
- SPVOICESTATUS
- SPVPITCH
- SPVSTATE
- SPVTEXTFRAG
- SPWORD
- SPWORDENTRY
- SPWORDLIST
- SPWORDPRONUNCIATION
- SPWORDPRONUNCIATIONLIST
- WAVEFORMATEX
-
Enumerations
- SPAUDIOOPTIONS
- SPAUDIOSTATE
- SPBOOKMARKOPTIONS
- SPCFGNOTIFY
- SPCFGRULEATTRIBUTES
- SPCONTEXTSTATE
- SPDATAKEYLOCATION
- SPDISPLAYATTRIBUTES
- SPEAKFLAGS
- SPENDSRSTREAMFLAGS
- SPEVENTENUM
- SPEVENTLPARAMTYPE
- SPFILEMODE
- SPGRAMMARSTATE
- SPGRAMMARWORDTYPE
- SPINTERFERENCE
- SPLEXICONTYPE
- SPLOADOPTIONS
- SPPARTOFSPEECH
- SPPHRASERNG
- SPPROPSRC
- SPRECOEVENTFLAGS
- SPRECOSTATE
- SPRESULTTYPE
- SPRULEINFOOPT
- SPRULESTATE
- SPRUNSTATE
- SPSTREAMFORMAT
- SPTRANSITIONTYPE
- SPVACTIONS
- SPVALUETYPE
- SPVESACTIONS
- SPVFEATURE
- SPVISEMES
- SPVLIMITS
- SPVPRIORITY
- SPVSKIPTYPE
- SPWAVEFORMATTYPE
- SPWORDINFOOPT
- SPWORDPRONOUNCEABLE
- SPWORDTYPE
-
Helper Functions
- CSpStreamFormat Class
- CSpDynamicString Class
-
CSpEvent
- Constructor
- Destructor
- AddrOf
- BookmarkName
- Clear
- CopyFrom
- CopyTo
- Deserialize
- Detach
- EndStreamResult
- GetFrom
- InputSentLen
- InputSentPos
- InputStreamReleased
- InputWordLen
- InputWordPos
- Interference
- IsEmulated
- IsPaused
- Object
- ObjectToken
- PersistVoiceChange
- Phoneme
- PropertyName
- PropertyNumValue
- PropertyStringValue
- RecoResult
- RecoState
- RequestTypeOfUI
- Serialize
- SerializeSize
- String
- Viseme
- VoiceToken
-
UI Helper Functions
- SpAddTokenToComboBox
- SpAddTokenToListBox
- SpDeleteCurSelComboBoxToken
- SpDeleteCurSelListBoxToken
- SpDestroyTokenComboBox
- SpDestroyTokenListBox
- SpGetComboBoxToken
- SpGetCurSelComboBoxToken
- SpGetCurSelListBoxToken
- SpGetListBoxToken
- SpInitTokenComboBox
- SpInitTokenListBox
- SpUpdateCurSelComboBoxToken
- SpUpdateCurSelListBoxToken
- SpCreateBestObject
- SPCreateDefaultObjectFromCategoryID
- SpCreateNewToken (by Category ID)
- SpCreateNewToken (by Token ID)
- SpCreateNewTokenEx (by Category ID)
- SpCreateNewTokenEx (by Token ID)
- SpCreateObjectFromSubToken
- SpCreateObjectFromToken
- SpCreateObjectFromTokenId
- SpCreatePhoneConverter
- SpEnumTokens
- SpFindBestToken
- SpGetCategoryFromId
- SpGetDefaultTokenFromCategoryId
- SpGetDefaultTokenIdFromCategoryId
- SpGetDescription
- SpGetSubTokenFromToken
- SpGetTokenFromId
- SpGetUserDefaultUILanguage
- SpSetCommonTokenData.
- SpSetDefaultTokenForCategoryId
- SpSetDefaultTokenIdForCategoryId
- SPFEI
- SPBindToFile
- SpClearEvent
- SpConvertStreamFormatEnum
- SpEventSerializeSize
- SpGetDescription
- SpInitEvent
- SpSetDescription
-
SAPI Object Classes
-
SAPI Application Object Classes
- SpNotifyTranslator
- SpObjectTokenCategory
- SpObjectTokenEnum
- SpObjectToken
- SpDataKey
- SpResourceManager
- SpStreamFormatConverter
- SpMMAudioEnum
- SpMMAudioIn
- SpMMAudioOut
- SpRecPlayAudio
- SpStream
- SpVoice
- SpSharedRecognizer
- SpInprocRecognizer
- SpRecoContext
- SpSharedRecoContext
- SpRecoGrammar
- SpRecoResult
- SpPhraseAlt
- SpLexicon
- SpUnCompressedLexicon
- SpCompressedLexicon
- SpPhoneConverter
- SAPI DDK Object Classes
-
SAPI Application Object Classes
- Error Codes
- Miscellanea
- SDK Samples, Tools, and Tutorials
-
White Papers
- SAPI 5.0 SR Properties White Paper
- TTS Engine Vendor Porting Guide White Paper
- SR Engine Vendor Porting Guide White Paper
- Object Tokens and Registry Settings White Paper
- VendorPreferred Attribute
- Simple TTS Applications
- SAPI 5.1 64-bit Issues
- Speech Telephony Application Guide
- Using Sample Audio Object (SpAudioPlug)
- Audio Object
- Compliance Tests White Paper
- Microsoft Speech SDK Setup 5.1
- XML Schema : Grammar
- XML Schema : SAPI
- XML TTS Tutorial
- Text Normalization
- Using MFC to Automate SAPI
- Persisting Recognized Wave Audio from the Speech Recognition Engine
- Using Wave File Input with the Speech Recognition Engine
-
Automation
- Automation Overview
- Objects And Classes Overview
- Events Overview
- Using the Code Examples
- Programming Notes For Visual Basic
- Sample Recognition Application Code (Shared)
- Sample C&C Recognition Application Code
- Sample Recognition Application Code (Inproc)
- Sample DLL Code
-
Enumerations
- SpeechAudioFormatType
- SpeechAudioState
- SpeechBookmarkOptions
- SpeechDataKeyLocation
- SpeechDiscardType
- SpeechDisplayAttributes
- SpeechEngineConfidence
- SpeechFormatType
- SpeechGrammarRuleStateTransitionType
- SpeechGrammarState
- SpeechGrammarWordType
- SpeechInterference
- SpeechLexiconType
- SpeechLoadOption
- SpeechPartOfSpeech
- SpeechRecoContextState
- SpeechRecoEvents
- SpeechRecognitionType
- SpeechRecognizerState
- SpeechRetainedAudioOptions
- SpeechRuleAttributes
- SpeechRuleState
- SpeechRunState
- SpeechSpecialTransitionType
- SpeechStreamFileMode
- SpeechStreamSeekPositionType
- SpeechTokenContext
- SpeechTokenShellFolder
- SpeechVisemeFeature
- SpeechVisemeType
- SpeechVoiceEvents
- SpeechVoicePriority
- SpeechVoiceSpeakFlags
- SpeechWordPronounceable
- SpeechWordType
- ISpeechAudio
- ISpeechAudioBufferInfo
- ISpeechAudioStatus
- ISpeechBaseStream
- ISpeechDataKey
- ISpeechGrammarRule
- ISpeechGrammarRules
- ISpeechGrammarRuleState
- ISpeechGrammarRuleStateTransition
- ISpeechGrammarRuleStateTransitions
- ISpeechLexiconPronunciation
- ISpeechLexiconPronunciations
- ISpeechLexiconWord
- ISpeechLexiconWords
- ISpeechObjectTokens
- ISpeechPhraseAlternate
- ISpeechPhraseAlternates
- ISpeechPhraseElement
- ISpeechPhraseElements
- ISpeechPhraseInfo
- ISpeechPhraseProperties
- ISpeechPhraseProperty
- ISpeechPhraseReplacement
- ISpeechPhraseReplacements
- ISpeechPhraseRule
- ISpeechPhraseRules
- ISpeechRecognizerStatus
- ISpeechRecoGrammar
- ISpeechRecoResult
- ISpeechRecoResultTimes
- ISpeechVoiceStatus
- SpAudioFormat
- SpCustomStream
- SpFileStream
- SpInProcRecoContext
-
SpInProcRecoContext (Events)
- Adaptation Event
- AudioLevel Event
- Bookmark Event
- EndStream Event
- EnginePrivate Event
- FalseRecognition Event
- Hypothesis Event
- Interference Event
- PhraseStart Event
- PropertyNumberChange Event
- PropertyStringChange Event
- Recognition Event
- RecognitionForOtherContext Event
- RecognizerStateChange Event
- RequestUI Event
- SoundEnd Event
- SoundStart Event
- StartStream Event
- SpInProcRecognizer
- SpLexicon
- SpMemoryStream
- SpMMAudioIn
- SpMMAudioOut
- SpObjectToken
- SpObjectTokenCategory
- SpPhoneConverter
- SpPhraseInfoBuilder
- SpSharedRecoContext
-
SpSharedRecoContext (Events)
- Adaptation Event
- AudioLevel Event
- Bookmark Event
- EndStream Event
- EnginePrivate Event
- FalseRecognition Event
- Hypothesis Event
- Interference Event
- PhraseStart Event
- PropertyNumberChange Event
- PropertyStringChange Event
- Recognition Event
- RecognitionForOtherContext Event
- RecognizerStateChange Event
- RequestUI Event
- SoundEnd Event
- SoundStart Event
- StartStream Event
- SpSharedRecognizer
- SpTextSelectionInformation
- SpUnCompressedLexicon
- SpVoice
- SpVoice (Events)
- SpWaveFormatEx