1. Definition 2. Functional Requirements 3. Syntax 4. Semantics

1. Definition

The focused status data obtained by observing the scene and individual and collective behaviour of Performers in the Real Environment by interpreting the Descriptors, per the current Cue Point, including behaviours, Personal Status (Emotion, Cognitive State, and Social Attitude), as needed by the Action Descriptor Generation AIM to service the current cue point.

2. Functional Requirements

  1. Visual (performers and scene)
    1. Visual status data
      1. From Proximity Performer A
      2. Gaze: performer A looking at Performer B
    2. Scene attributes
      1. Position and Orientation (Spatial Attitude) of set pieces
      2. The scene as volume (Volumetric, e.g., point cloud)
  2. Audio behaviour
    1. Performers are captured by one or more microphones (on body/stage).
    2. Status on a per-performer basis:
      1. Text uttered by performer.
      2. Performer’s audio activity:
        1. Performer doing a certain activity, e.g., Laughing, Clapping, Booing, Shouting, Singing
        2. Intensity of the Performer’s activity
        3. Particular phrase/text uttered
  3. Personal Status.
  4. Object data:
    1. The scene as volume or mesh (Volumetric, e.g., point cloud).
    2. Position and Orientation (Spatial Attitude) of set pieces.
    3. Other data (encoders, triggers, measuring devices, motion/proximity sensors, etc.).

3. Syntax

4. Semantics

Label Size Description
Header N1 Bytes Header
– Standard- 9 Bytes The characters “XRV-PFS”
– Version N2 Bytes Major version – 1 or 2 characters
– Dot-separator 1 Byte The character “.”
– Subversion N3 Bytes Minor version – 1 or 2 characters
MInstanceID N4 Bytes Identifier of M-Instance.
ID N5 Bytes Identifier of
SpaceTime N7 Bytes  Space-Time info of CogState.