1      Definition

A Data Type consisting of characters, typically of a limited length.

2      Functional Requirements

  1. When the Text Word is the output of an ASR, the Text Segment may include:
    1. a Confidence Score.
    2. A Time stamp indicating the times the word starts and ends.
  2. The Confidence Score is a number comprised between 0 and 1.

3      Syntax

https://schemas.mpai.community/MMC/V2.2/data/TextWord.json

4      Semantics

Label Size Description
Header N1 Bytes Text Word Header
– Standard-TextWord 9 Bytes The characters “MMC-TXW-V”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters
MInstanceID N4 Bytes Identifier of M-Instance.
TextWordID N5 Bytes Identifier of Text Segment.
TextWordConfidenceScore N6 Bytes The Confidence of the ASR in the correctness of the Text Word.
TextWordData N7 Bytes Data of Text Word.
– String N8 Bytes String representing the Word.
– Time N9 Bytes Time indicating the start and end of the Word.
DescrMetadata N9 Bytes Descriptive Metadata