Definition

Audio Object is a Data Type digitally representing either

  1. An object in the real world that a human can perceive with the hearing sense, or
  2. A synthetically generated Object that a human can perceive with the hearing sense when rendered.

The Format of an AudioObject is signalled by FormatID.

Syntax

https://schemas.mpai.community/CAE/V2.1/data/AudioObject.json

Semantics

Table 1 provides the way the Audio Scene Descriptors Format conveys information on an Audio Object.

Table 1 – Audio Object signalling in Audio Scene Descriptors

Label Size Description
HEADER 9 Bytes
·       Standard 7 Bytes The string CAE-ASD
·       Version 1 Byte Major version
·       Subversion 1 Byte Minor version
AOBID 16 Bytes UUID Identifier of the Audio Object.
AudioObjectData N1 Bytes Data associated to each Audio Object.
·       SamplingRate 0-3 bits 0:8, 1:16, 2:24, 3:32, 4:44.1, 5:48, 6: 64, 7: 96, 8: 192 (all kHz)
·       SampleType 4-5 bits 0:16, 1:24, 2:32, 3:64 (all bits/sample)
·       Reserved 6-7 bits
·       AudioObject N3 Bytes
◦   FormatID 1 Byte Audio Object Format Identifier
◦   ObjectLength 4 Bytes Number of Bytes in Audio Object
◦   DataInObject N4 Bytes Data of Audio Object