1     Version

V2.1

2     Functions

Audio Object Identification:

  1. Receives
    1. Audio Scene Geometry
    2. Audio Objects.
  2. Produces an Audio Instance ID identifying an Audio Object in the Scene that belongs to some level in a taxonomy.

3      Reference Architecture

Figure 1 depicts the Reference Architecture of the Audio Object Identification AIM.

Figure 1 – Audio Object Identification AIM

Note that the Audio Object Identification AIM can parse either an AV Scene Geometry or its Audio Scene Geometry subset.

4      I/O Data

Table 1 specifies the Input and Output Data of the Audio Object Identification AIM.

Table 1 – I/O Data of the Audio Object Identification AIM

Input Description
Audio Scene Geometry The digital representation of the spatial arrangement of the Visual Objects of the Scene.
Audio Objects The Audio Objects in the Audio Scene Geometry with an identifiable source target of identification.
Output Description
Audio Instance Identifier The Identifier of the specific Audio Object belonging to a level in the taxonomy.

5      SubAIMs

No SubAIMs.

6     JSON Metadata

https://schemas.mpai.community/CAE/V2.1/AIMs/AudioObjectIdentification.json