| 1 Definition | 2 Functional Requirements | 3 Syntax | 
| 4 Semantics | 5 Conformance Testing | 6 Performance Assessment | 
1 Definition
A Data Type including:
- Audio Data whose rendering is perceived by a hearing device or audible to a human with attached Qualifier.
 - Descriptive Data regarding Sub-Types, Formats and Attributes of the Audio Data (optionally).
 - Space-Time information.
 
2 Functional Requirements
An Audio Object includes:
- The ID of a Virtual Space (M-Instance) where it is or will be located.
 - The ID of the Audio Object.
 - The ID(s) of the Parent Object(s) supporting two cases:
- The Parent Object has spawned two (or more) Objects.
 - Two (or more) Parent Objects have merged into one.
 
 - The Space-Time information of all Parent Objects in the M-Instance.
 - The Space-Time information of the Visual Data in an M-Instance.
 - The Visual Data Qualifier.
 - The Audio Data Annotations, including:
- Annotation
 - Annotation Space-Time
 - Process Action IDs
 
 - The Audio Object-specific Data:
- Visual Data Qualifier.
 - Visual Data Annotation.
 - Visual Data length in Bytes.
 - Visual Data URI.
 
 
3 Syntax
https://schemas.mpai.community/CAE1/V2.3/data/AudioObject.json
4 Semantics
| Label | Size | Description | 
| Header | N1 Bytes | Audio Object Header | 
| – Standard-AudioObject | 9 Bytes | The characters “CAE-AUO-V” | 
| – Version | N2 Bytes | Major version – 1 or 2 characters | 
| – Dot-separator | 1 Byte | The character “.” | 
| – Subversion | N3 Bytes | Minor version – 1 or 2 characters | 
| MInstanceID | N4 Bytes | Identifier of M-Instance. | 
| AudioObjectID | N5 Bytes | Identifier of the Audio Object. | 
| ParentAudioObjects[] | N6 Bytes | Identifier(s) of Parent Audio Objects. | 
| – ParentAudioObjectID | N7 Bytes | ID of a Parent Audio Object | 
| – ParentAudioDataSpaceTime | N8 Bytes | Space Time info of Parent Audio Data | 
| AudioDataSpaceTime | N9 Bytes | Space-Time info of Audio Data. | 
| AudioDataQualifier | N10 Bytes | Audio Data Qualifier. | 
| SpeechDataAnnotations[] | N11 Bytes | Annotations of Speech Data | 
| – Annotation | N12 Bytes | ID of Annotation | 
| – AnnotationSpaceTime | N13 Bytes | Where/when Annotation is attached. | 
| – ProcessActionIDs | N14 Bytes | What is possible to do with the Annotation | 
| AudioDataLength | N15 Bytes | Number of Bytes of Audio Data | 
| AudioDataURI | N16 Bytes | URI of Data of Audio Data | 
| DescrMetadata | N17 Bytes | Descriptive Metadata | 
5 Conformance Testing
A Data instance Conforms with CAE-USC V2.3 Audio Object (CAE-AUO) if:
- JSON Data validate against the Audio Object’s JSON Schema.
 - All Data in the Audio Object’s JSON Schema
- Have the specified types.
 - Validate against their JSON Schemas.
 - Conform with their Data Qualifiers if present.