PGM-AUA V1.0 Data Types - Visual Scene Enhancers

(Tentative)

Definition

Visual Scene Enhancers is a domain-aware augmentation of Visual Scene Descriptors. It captures refined spatial semantics, object relations, constraints, and directive-aligned hints so downstream AIMs (e.g., Prompt Creation, Rendering, Action Planning) can reason and act with higher reliability. VSE is produced through an iterative enrichment loop—typically between Visual Spatial Reasoning (VSR) and Domain Access (DAC)—that normalises raw descriptors, applies domain knowledge, encodes feasibility/occlusion/affordances, and binds resolution hints consistent with current Visual Action Directives.

Functional Requirements

Semantic Enrichment
- - Must transform baseline Visual Scene Descriptors (VSD0) into enriched structures (VSD1→VSD2→VSD3) that add relations, salience, occlusion, affordances, and reachability needed by downstream AIMs.
  - Must preserve and normalise original object attributes (IDs, types, bounding data) while appending domain-aware annotations.
- Directive Alignment
  - Must encode the active Visual Action Directive (VAD)—including priorities, constraints, and scope—so enrichment remains goal-aligned across the loop with DAC and VSR.
  - Must support conditional logic that binds enrichment to directive classes and parameters, mirroring the intent/domain-class consistency pattern in the Intent data-type page.
- Spatial Reasoning Features
  - - Must provide refined localisation (pose, depth/occlusion state, proximity zones), affordance inference, and salience mapping usable by Prompt Creation and A-User Formation AIM.
    - Should expose confidence scores per inference (e.g., occlusion, affordance) to enable downstream filtering and fallback, following the confidence pattern used in Intent.
- Object & Relation Graph
  - Must represent inter-object relations (e.g., adjacency, containment, support, collision risk) with time-stamped updates for dynamic scenes.
  - Should include semantic tags (domain-specific labels) for objects and zones to improve reasoning fidelity, consistent with other PGM data-type semantics.
- Constraints & Feasibility
  - Must report feasibility flags (e.g., reachable, valid, occluded levels) and residual constraints not satisfied during enrichment, enabling conformance decisions.
  - Must capture directive- and domain-derived constraints injected by DAC (e.g., prohibited zones, safety margins, governance limits).
- Resolution Hints & Rendering Guidance
  - Must provide Resolution Hints (e.g., preferred viewpoints, LOD prioritisation, update cadence) to assist rendering and prompt planning.
  - Should include alternative viewpoints and fallback hint sets when confidence or feasibility drops below thresholds.
- Traceability & Governance
  - Must carry a Trace block (originating module, timestamps aligned to M-Instance time base, version), supporting audit and provenance across enrichment cycles.
  - Should encode governance metadata (policy references, compliance status, decision path) to satisfy PGM conformance and potential cross-standard governance needs.
- Identity & Referencing
  - Must include globally unique IDs for the VSE record, referenced objects/zones, and linkage to M-Instance/M-Environment identifiers to enable chaining and logging.
  - Should support cross-referencing to other data types (e.g., Intent, Context, Prompt Plans) for end-to-end workflow integration.
- Performance Hooks
  - Should expose metrics fields for enrichment latency, coverage (percentage of scene enriched), and uncertainty summaries to feed performance sections of the spec.
Syntax

https://schemas.mpai.community/PGM1/V1.0/data/VisualSceneEnhancers.json

Semantics

Label	Description
Header	Visual Scene Enhancers Header
– Standard-VSE	The characters ‘PGP-VSE-V’
– Version	Major version – 1 or 2 characters
– Dot-separator	The character ‘.’ separating version components
– Subversion	Minor version – 1 or 2 characters
MInstanceID	Identifier of the M-Instance
VSDID	Array referencing baseline VSD1 descriptors
EnhancersTime	Timestamp of enrichment
VSEData	Array of enrichment entries
– Scope	Target scope: global \| object \| zone
– ObjectID	Required when Scope=object; must resolve to VSD1 ObjectID
– ZoneID	Required when Scope=zone; must resolve to a known Scene Zone
– Roles	Array of role annotations for visual actors/entities
–– visualRole	Role label (system, service, narrator, player, etc.)
–– authorityLevel	Authority level of the role
–– affiliation	Affiliation or organisation tag
–– roleConfidence	Confidence score [0..1] for role assignment
– Rights	Rights structure (MMM4/V2.1/data/Rights.json)
– Rule	Rule structure (MMM4/V2.1/data/Rule.json)
– ComplianceStatus	Enum: compliant \| violating \| unknown
– RequiredDelegation	Boolean flag indicating delegation requirement
– Events	Array of event objects
–– EventType	Enum: gesture, display_change, notification, alarm, etc.
–– EventSubType	Optional sub-type string
–– Urgency	Enum: low \| medium \| high
–– Channel	Event channel (e.g., visual, system)
–– Intensity	Number [0..1] indicating event intensity
– DomainContext	Domain-specific context information
–– locationSemantic	Semantic label of location
–– activityPhase	Enum: lobby, warm-up, match, quest, etc.
–– zoneID	Zone link (MMM4/V2.1/data/MLocation.json)
– Constraints	Reachability and visibility constraints
–– reachable_from	PointOfView reference (OSD/V1.4/data/PointOfView.json)
–– visible_from	PointOfView reference (OSD/V1.4/data/PointOfView.json)
–– exclude_occluded	Boolean to exclude occluded elements
–– distance_limit_m	Number ≥0 indicating max distance in meters
– ResolutionHints	Array of rendering guidance hints
–– HintType	Enum: prioritise_gaze, prioritise_proximity, etc.
–– HintValue	Associated value or rationale
– Confidence	Overall confidence score [0..1] for enrichment
– Justifications	Array of URI strings providing evidence
– Trace	Provenance metadata
–– Origin	Module or AIM name producing this entry
–– Timestamp	Timestamp aligned to SpaceTime schema
Versioning	Version lineage and validity interval
– supersedes	Previous VSE ID superseded by this instance
– validityInterval	Interval of validity (OSD/V1.4/data/SpaceTime.json)
DescrMetadata	Free-form descriptive metadata (title, summary, tags)

Cookie	Duration	Description
cookielawinfo-checkbox-necessary	1 year	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Technical".
CookieLawInfoConsent	1 year	The cookie is set by the GDPR Cookie Consent plug-in and is used to store whether the user has consented to the use of cookies or not. It does not store any personal data.
viewed_cookie_policy	1 year	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
_pk_id.6.08a8	13 months	Used to store a few details about the user such as the unique visitor ID
_pk_ses.6.08a8	30 minutes	Short lived cookies used to temporarily store data for the visit

PGM-AUA V1.0 Data Types – Visual Scene Enhancers

(Tentative)

Definition

Functional Requirements

Syntax

Semantics

Notice