1 Definition | 2 Functional Requirements | 3 Syntax |
4 Semantics | 5 Conformance Testing | 6 Performance Assessment |
1 Definition
A Data Type consisting of Text Words separated by spaces, typically of a limited length.
2 Functional Requirements
- A Text Segment may include the Time of the start and the end of the speech segment.
- When the Text Segment is the output of an ASR Implementation, a Confidence Score may be attached to the Text Segment.
- The Confidence Score is a number between 0 and 1.
3 Syntax
https://schemas.mpai.community/MMC/V2.4/data/TextSegment.json
4 Semantics
Label | Description |
Header | Text Segment Header |
– Standard-TextSegment | The characters “MMC-TXS-V” |
– Version | Major version – 1 or 2 characters |
– Dot-separator | The character “.” |
– Subversion | Minor version – 1 or 2 characters |
MInstanceID | Identifier of M-Instance. |
Time | Time of the start of Text Segment. |
TextSegmentID | Identifier of Text Segment. |
ConfidenceScore | The Confidence of the ASR in the correctness of the Text Segment. |
TextSegmentData[] | Data of Text Segment. |
– TextWord | Text represented by a string. |
DescrMetadata | Descriptive Metadata |
5 Conformance Testing
A Data instance Conforms with MPAI-MMC V2.4 Text Segment (MMC-TXS) if:
- The Data validates against the Text Segment’s JSON Schema.
- All Data in the Text Segment’s JSON Schema
- Have the specified type
- Validate against their JSON Schemas
- Conform with their Data Qualifiers if present.