SPPHRASEELEMENT (Microsoft Speech Platform)

Microsoft Speech Platform SDK 11

Microsoft Speech Platform

SPPHRASEELEMENT

SPPHRASEELEMENT contains the information for a spoken word.

typedef struct SPPHRASEELEMENT
{
    ULONG             ulAudioTimeOffset;
    ULONG             ulAudioSizeTime;
    ULONG             ulAudioStreamOffset;
    ULONG             ulAudioSizeBytes;
    ULONG             ulRetainedStreamOffset;
    ULONG             ulRetainedSizeBytes;
    LPCWSTR          *pszDisplayText;
    LPCWSTR          *pszLexicalForm;
    const SPPHONEID  *pszPronunciation;
    BYTE              bDisplayAttributes;
    char              RequiredConfidence;
    char              ActualConfidence;
    BYTE              Reserved;
    float             SREngineConfidence;
} SPPHRASEELEMENT;

Members

ulAudioTimeOffset
This is the starting offset of the element in 100-nanosecond units of time relative to the start of the phrase.
ulAudioSizeTime
This is the length of the element in 100-nanosecond units of time.
ulAudioStreamOffset
This is the starting offset of the element in bytes relative to the start of the phrase in the original input stream.
ulAudioSizeBytes
This is the size of the element in bytes in the original input stream.
ulRetainedStreamOffset
This is the starting offset of the element in bytes relative to the start of the phrase in the retained audio stream
ulRetainedSizeBytes
This is the size of the element in bytes in the retained audio stream.
pszDisplayText
The display text for this element (e.g., ",").
pszLexicalForm
The lexical form of this element (e.g., "comma" for ",").
pszPronunciation
The pronunciation for this element as a null-terminated array of SPPHONEID.
bDisplayAttributes
A bit field of SPDISPLAYATTRIBUTES defining extra display information which the application should honor when displaying this word.
RequiredConfidence
The required confidence for this element (either SP_LOW_CONFIDENCE, SP_NORMAL_CONFIDENCE, or SP_HIGH_CONFIDENCE). If a word is prefixed with a '-' (minus), the RequiredConfidence is SP_LOW_CONFIDENCE, and '+' (plus) will set this field to SP_HIGH_CONFIDENCE (e.g., "This -is -a +test").
ActualConfidence
The actual confidence for this element (either SP_LOW_CONFIDENCE, SP_NORMAL_CONFIDENCE, or SP_HIGH_CONFIDENCE). This is always at least the RequiredConfidence.
Reserved
Reserved for future use.
SREngineConfidence
The confidence score computed by the SR engine. The value range is engine dependent. It can be used to optimize an application's performance with a specific engine. Using this value will improve the application with a particular speech engine but more than likely will make it worse with other engines and should be used with care. This value is more useful with speaker-independent engines because it allows a large corpus of recorded usage to correctly optimize the overall accuracy of the application.