1     Definition 2     Functional Requirements 3     Syntax
4     Semantics 5    Conformance Testing 6     Performance Assessment

1      Definition

A Data Type consisting of characters, typically of a limited length.

2      Functional Requirements

  1. When the Text Word is the output of an ASR, the Text Segment may include:
    1. a Confidence Score.
    2. A Time stamp indicating the times the word starts and ends.
  2. The Confidence Score is a number comprised between 0 and 1.

3      Syntax

https://schemas.mpai.community/MMC/V2.3/data/TextWord.json

4      Semantics

Label Size Description
Header N1 Bytes Text Word Header
– Standard-TextWord 9 Bytes The characters “MMC-TXW-V”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters
MInstanceID N4 Bytes Identifier of M-Instance.
TextWordID N5 Bytes Identifier of Text Segment.
TextWordConfidenceScore N6 Bytes The Confidence of the ASR in the correctness of the Text Word.
TextWordData N7 Bytes Data of Text Word.
– String N8 Bytes String representing the Word.
– Time N9 Bytes Time indicating the start and end of the Word.
DescrMetadata N9 Bytes Descriptive Metadata

5     Conformance Testing

A Data instance Conforms with MPAI-MMC V2.3 Text Word (MMC-TXW) if:

  1. The Data validates against the Text Word’s JSON Schema.
  2. All Data in the  Text Word’s JSON Schema
    1. Have the specified type
    2. Validate against their JSON Schemas
    3. Conform with their Data Qualifiers if present.

6     Performance Assessment