1     Version

V1.0

2     Functions

Visual Object Identification (OSD-VOI):

  1. Receives
    1. Visual Scene Geometry
    2. Visual Objects
    3. Body Descriptors.
  2. Produces Visual Instance ID that identifies a Visual Object in the Scene that belongs to some level in a taxonomy.

3      Reference Architecture

Figure 1 depicts the Reference Architecture of the Visual Object Identification AIM.

Figure 1 – The Visual Object Identification Composite AIM

Note that the Visual Direction Identification AIM can parse either an AV Scene Geometry or its Visual Scene Geometry subset.

4      I/O Data

Table 17 specifies the Input and Output Data of the Visual Object Identification AIM.

Table 17 – I/O Data of the Visual Object Identification AIM

Input Description
Body Descriptors The Descriptors of the Body Objects of Entities in the Visual Scene.
Visual Scene Geometry The digital representation of the spatial arrangement of the Visual Objects of the Scene.
Visual Object The Visual Objects in the Visual Scene that are not Entities.
Output Description
Visual Instance Identifier The Identifier of the specific Visual Object belonging to a level in the taxonomy.

5      SubAIMs

Visual Direction Identification
Visual Object Extraction
Visual Instance Identification

6     JSON Metadata

https://schemas.mpai.community/OSD/V1.0/AIMs/VisualObjectIdentification.json