1 Definition
A Data Type consisting of Text Words separated by spaces, typically of a limited length.
2 Functional Requirements
- A Text Segment may include the Time of the start and the end of the speech segment.
- When the Text Segment is the output of an ASR Implementation, a Confidence Score may be attached to the Text Segment.
- The Confidence Score is a number between 0 and 1.
3 Syntax
https://schemas.mpai.community/MMC/V2.2/data/TextSegment.json
4 Semantics
| Label | Size | Description |
| Header | N1 Bytes | Text Segment Header |
| – Standard-TextSegment | 9 Bytes | The characters “MMC-TXS-V” |
| – Version | N2 Bytes | Major version – 1 or 2 characters |
| – Dot-separator | 1 Byte | The character “.” |
| – Subversion | N3 Bytes | Minor version – 1 or 2 characters |
| MInstanceID | N4 Bytes | Identifier of M-Instance. |
| Time | N5 Bytes | Time of the start of Text Segment. |
| TextSegmentID | N6 Bytes | Identifier of Text Segment. |
| ConfidenceScore | N7 Bytes | The Confidence of the ASR in the correctness of the Text Segment. |
| TextSegmentData[] | N8 Bytes | Data of Text Segment. |
| – TextWord | N9 Bytes | Text represented by a string. |
| DescrMetadata | N10 Bytes | Descriptive Metadata |