Go to MPAI-OSD V1.5 AI Modules

Function
Ref. Model
I/O Data
SubAIMs
JSON MData
Profiles
Ref. Software
Conformance
Performance

1 Functions

The Visual Object Identification (OSD‑VOI) AIM produces the Identifier based on a Taxonomy of a Visual Object included in a Visual Scene Geometry that is crossed by a Point of View:

Receives Visual Scene Geometry The arrangement of the objects in the Scene, a subset of Visual Scene Descriptors.
Visual Objects The Objects in the Scene.
Body Descriptors Descriptors of the Body indicating the object.
Produces Visual Instance ID Identifying a Visual Object in the Scene that belongs to some level in a taxonomy.

2 Reference Model

Figure 1 depicts the Reference Model of the Visual Object Identification (OSD‑VOI) AIM.

Visual Object Identification OSD-VOI AIM

Figure 1 – The Visual Object Identification (OSD‑VOI) AIM

3 I/O Data

Table 1 specifies the Input and Output Data of the Visual Object Identification (OSD‑VOI) AIM.

Table 1 – I/O Data of the Visual Object Identification (OSD‑VOI) AIM

Input Description
Body Descriptors Object The Descriptors of the Body Objects of Entities in the Visual Scene.
Visual Scene Geometry The digital representation of the spatial arrangement of the Visual Objects of the Scene.
Visual Object The Visual Objects in the Visual Scene that are not Entities.
Output Description
Visual Instance Identifier The Identifier of the specific Visual Object belonging to a level in the taxonomy.

4 SubAIMs

4.1 Functions of SubAIMs

The Visual Object Identification (OSD‑VOI) AIM is a Composite AIM depicted in Figure 2.

Note that the Visual Direction Identification AIM can parse either an AV Scene Geometry or its Visual Scene Geometry subset.

Visual Object Identification OSD-VOI Composite AIM

Figure 2 – The Visual Object Identification (OSD‑VOI) Composite AIM

4.2 Operation

Visual Direction Identification receives Body Descriptors and Visual Scene Geometry from which it finds the Point of View of the human finger. Visual Object Extraction uses this information to find which object in the scene are in the line from the Point of View and produces the Visual Object whose Identifier the Visual Instance Identification provides as output.

4.3 I/O Data of SubAIMs

Table 2 provides the Input and Output Data of the three OSD‑VOI SubAIMs.

Table 2 – Input and Output Data of the OSD‑VOI SubAIMs

SubAIM Input Output
Visual Direction Identification Body Descriptors Object
Visual Scene Geometry
Point of View
Visual Object Extraction Point of View
Visual Scene Geometry
Visual Object
Visual Instance Identification Visual Scene Geometry
Visual Object
Visual Instance Identifier

4.4 AIMs and JSON Metadata

Table 3 provides the links to the AIM specifications and JSON schemas. AIM1 indicates the Composite AIM and AIM2 its SubAIMs.

Table 3 – AIMs and JSON Metadata of the Visual Object Identification (OSD‑VOI)

AIM1 AIM2 Name JSON
OSD‑VOI Visual Object Identification X
OSD‑VDI Visual Direction Identification X
OSD‑VOE Visual Object Extraction X
OSD‑VII Visual Instance Identification X

5 JSON Metadata

https://schemas.mpai.community/OSD/V1.5/AIMs/VisualObjectIdentification.json

6 Profiles

No Profiles.

7 Reference Software

Not part of this specification.

8 Conformance Testing

Table 4 provides the Conformance Testing Method for the OSD‑VOI AIM. Conformance Testing of the individual SubAIMs is given by the individual AIM specifications.

If a schema contains references to other schemas, conformance of data for the primary schema implies that any data referencing a secondary schema shall also validate against the relevant schema, if present, and conform with the Qualifier, if present.

Table 4 – Conformance Testing Method for the OSD‑VOI AIM

Receives Visual Scene Geometry Shall validate against Visual Scene Geometry schema.
Visual Objects Shall validate against Visual Object schema. Visual Data shall conform with Visual Qualifier.
Body Descriptors Object Shall validate against Body Descriptors schema.
Produces Visual Instance ID Shall validate against Instance ID schema.

9 Performance Assessment

Not part of this specification.

Go to MPAI-OSD V1.5 AI Modules