SPPHRASEELEMENT (Microsoft Speech Platform)

Microsoft Speech Platform SDK 11

previous page next page

Microsoft Speech Platform

SPPHRASEELEMENT

SPPHRASEELEMENT contains the information for a spoken word.

typedef struct SPPHRASEELEMENT
{
    ULONG             ulAudioTimeOffset;
    ULONG             ulAudioSizeTime;
    ULONG             ulAudioStreamOffset;
    ULONG             ulAudioSizeBytes;
    ULONG             ulRetainedStreamOffset;
    ULONG             ulRetainedSizeBytes;
    LPCWSTR          *pszDisplayText;
    LPCWSTR          *pszLexicalForm;
    const SPPHONEID  *pszPronunciation;
    BYTE              bDisplayAttributes;
    char              RequiredConfidence;
    char              ActualConfidence;
    BYTE              Reserved;
    float             SREngineConfidence;
} SPPHRASEELEMENT;

Members

ulAudioTimeOffset: This is the starting offset of the element in 100-nanosecond units of time relative to the start of the phrase.
ulAudioSizeTime: This is the length of the element in 100-nanosecond units of time.
ulAudioStreamOffset: This is the starting offset of the element in bytes relative to the start of the phrase in the original input stream.
ulAudioSizeBytes: This is the size of the element in bytes in the original input stream.
ulRetainedStreamOffset: This is the starting offset of the element in bytes relative to the start of the phrase in the retained audio stream
ulRetainedSizeBytes: This is the size of the element in bytes in the retained audio stream.
pszDisplayText: The display text for this element (e.g., ",").
pszLexicalForm: The lexical form of this element (e.g., "comma" for ",").
pszPronunciation: The pronunciation for this element as a null-terminated array of SPPHONEID.
bDisplayAttributes: A bit field of SPDISPLAYATTRIBUTES defining extra display information which the application should honor when displaying this word.
RequiredConfidence: The required confidence for this element (either SP_LOW_CONFIDENCE, SP_NORMAL_CONFIDENCE, or SP_HIGH_CONFIDENCE). If a word is prefixed with a '-' (minus), the RequiredConfidence is SP_LOW_CONFIDENCE, and '+' (plus) will set this field to SP_HIGH_CONFIDENCE (e.g., "This -is -a +test").
ActualConfidence: The actual confidence for this element (either SP_LOW_CONFIDENCE, SP_NORMAL_CONFIDENCE, or SP_HIGH_CONFIDENCE). This is always at least the RequiredConfidence.
Reserved: Reserved for future use.
SREngineConfidence: The confidence score computed by the SR engine. The value range is engine dependent. It can be used to optimize an application's performance with a specific engine. Using this value will improve the application with a particular speech engine but more than likely will make it worse with other engines and should be used with care. This value is more useful with speaker-independent engines because it allows a large corpus of recorded usage to correctly optimize the overall accuracy of the application.

previous page start next page