grapheme Element (Microsoft.Speech)

Microsoft Speech Platform SDK 11

Collapse image Expand Image Copy image CopyHover image

Contains the written representation of a word or a short phrase.

Syntax

XML Copy imageCopy Code
<grapheme> string </grapheme>

Attributes

None.

Remarks

The grapheme element must contain one or more letters, numbers, or characters.

There may be alternative textual representations for the same word or phrase, for example:

  • Regional spelling variations, such as "colour" and "color".

  • Free spelling variations, such as "judgment" and "judgement"

  • Alternate writing systems, for example Japanese uses a mixture of Han ideographs (Kanji), and phonemic spelling systems (Katakana or Hiragana) for representing the orthography of a word or phrase, and such mixture sometimes has several variations as in kana suffixes following kanji stems (Okurigana) for example "okonau" (行なう vs. 行う);

  • Reformed spellings, for example in German some of the words which once were written with "ß" are now written with "ss".

To accommodate the variations described above, the lexeme element may contain more than one grapheme element.

Example

The following example has two grapheme entries to encompass multiple spellings of a word, both of which share the same pronunciation.

 Copy imageCopy Code
<?xml version="1.0" encoding="UTF-8"?>
<lexicon version="1.0" 
      xmlns="http://www.w3.org/2005/01/pronunciation-lexicon"
      xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
      xsi:schemaLocation="http://www.w3.org/2005/01/pronunciation-lexicon 
        http://www.w3.org/TR/2007/CR-pronunciation-lexicon-20071212/pls.xsd"
      alphabet="ipa" xml:lang="en-US">
  <!-- English entry showing how alternative spellings are handled -->

  <lexeme>
    <grapheme> colour </grapheme>
    <grapheme> color </grapheme>
    <phoneme> ˈkʌlər </phoneme>
  </lexeme>

</lexicon>