(Tentative)
| Definition | Functional Requirements | Syntax | Semantics |
Definition
Visual Scene Enhancers is a domain-aware augmentation of Visual Scene Descriptors. It captures refined spatial semantics, object relations, constraints, and directive-aligned hints so downstream AIMs (e.g., Prompt Creation, Rendering, Action Planning) can reason and act with higher reliability. VSE is produced through an iterative enrichment loop—typically between Visual Spatial Reasoning (VSR) and Domain Access (DAC)—that normalises raw descriptors, applies domain knowledge, encodes feasibility/occlusion/affordances, and binds resolution hints consistent with current Visual Action Directives.
Functional Requirements
- Semantic Enrichment
-
- Must transform baseline Visual Scene Descriptors (VSD0) into enriched structures (VSD1→VSD2→VSD3) that add relations, salience, occlusion, affordances, and reachability needed by downstream AIMs.
- Must preserve and normalise original object attributes (IDs, types, bounding data) while appending domain-aware annotations.
- Directive Alignment
- Must encode the active Visual Action Directive (VAD)—including priorities, constraints, and scope—so enrichment remains goal-aligned across the loop with DAC and VSR.
- Must support conditional logic that binds enrichment to directive classes and parameters, mirroring the intent/domain-class consistency pattern in the Intent data-type page.
- Spatial Reasoning Features
-
- Must provide refined localisation (pose, depth/occlusion state, proximity zones), affordance inference, and salience mapping usable by Prompt Creation and A-User Formation AIM.
- Should expose confidence scores per inference (e.g., occlusion, affordance) to enable downstream filtering and fallback, following the confidence pattern used in Intent.
-
- Object & Relation Graph
- Must represent inter-object relations (e.g., adjacency, containment, support, collision risk) with time-stamped updates for dynamic scenes.
- Should include semantic tags (domain-specific labels) for objects and zones to improve reasoning fidelity, consistent with other PGM data-type semantics.
- Constraints & Feasibility
- Must report feasibility flags (e.g., reachable, valid, occluded levels) and residual constraints not satisfied during enrichment, enabling conformance decisions.
- Must capture directive- and domain-derived constraints injected by DAC (e.g., prohibited zones, safety margins, governance limits).
- Resolution Hints & Rendering Guidance
- Must provide Resolution Hints (e.g., preferred viewpoints, LOD prioritisation, update cadence) to assist rendering and prompt planning.
- Should include alternative viewpoints and fallback hint sets when confidence or feasibility drops below thresholds.
- Traceability & Governance
- Must carry a Trace block (originating module, timestamps aligned to M-Instance time base, version), supporting audit and provenance across enrichment cycles.
- Should encode governance metadata (policy references, compliance status, decision path) to satisfy PGM conformance and potential cross-standard governance needs.
- Identity & Referencing
- Must include globally unique IDs for the VSE record, referenced objects/zones, and linkage to M-Instance/M-Environment identifiers to enable chaining and logging.
- Should support cross-referencing to other data types (e.g., Intent, Context, Prompt Plans) for end-to-end workflow integration.
- Performance Hooks
- Should expose metrics fields for enrichment latency, coverage (percentage of scene enriched), and uncertainty summaries to feed performance sections of the spec.
Syntax
https://schemas.mpai.community/PGM1/V1.0/data/VisualSceneEnhancers.json
Semantics
-
| Label | Description |
| Header | Visual Scene Enhancers Header |
| – Standard-VSE | The characters ‘PGP-VSE-V’ |
| – Version | Major version – 1 or 2 characters |
| – Dot-separator | The character ‘.’ separating version components |
| – Subversion | Minor version – 1 or 2 characters |
| MInstanceID | Identifier of the M-Instance |
| VSDID | Array referencing baseline VSD1 descriptors |
| EnhancersTime | Timestamp of enrichment |
| VSEData | Array of enrichment entries |
| – Scope | Target scope: global | object | zone |
| – ObjectID | Required when Scope=object; must resolve to VSD1 ObjectID |
| – ZoneID | Required when Scope=zone; must resolve to a known Scene Zone |
| – Roles | Array of role annotations for visual actors/entities |
| –– visualRole | Role label (system, service, narrator, player, etc.) |
| –– authorityLevel | Authority level of the role |
| –– affiliation | Affiliation or organisation tag |
| –– roleConfidence | Confidence score [0..1] for role assignment |
| – Rights | Rights structure (MMM4/V2.1/data/Rights.json) |
| – Rule | Rule structure (MMM4/V2.1/data/Rule.json) |
| – ComplianceStatus | Enum: compliant | violating | unknown |
| – RequiredDelegation | Boolean flag indicating delegation requirement |
| – Events | Array of event objects |
| –– EventType | Enum: gesture, display_change, notification, alarm, etc. |
| –– EventSubType | Optional sub-type string |
| –– Urgency | Enum: low | medium | high |
| –– Channel | Event channel (e.g., visual, system) |
| –– Intensity | Number [0..1] indicating event intensity |
| – DomainContext | Domain-specific context information |
| –– locationSemantic | Semantic label of location |
| –– activityPhase | Enum: lobby, warm-up, match, quest, etc. |
| –– zoneID | Zone link (MMM4/V2.1/data/MLocation.json) |
| – Constraints | Reachability and visibility constraints |
| –– reachable_from | PointOfView reference (OSD/V1.4/data/PointOfView.json) |
| –– visible_from | PointOfView reference (OSD/V1.4/data/PointOfView.json) |
| –– exclude_occluded | Boolean to exclude occluded elements |
| –– distance_limit_m | Number ≥0 indicating max distance in meters |
| – ResolutionHints | Array of rendering guidance hints |
| –– HintType | Enum: prioritise_gaze, prioritise_proximity, etc. |
| –– HintValue | Associated value or rationale |
| – Confidence | Overall confidence score [0..1] for enrichment |
| – Justifications | Array of URI strings providing evidence |
| – Trace | Provenance metadata |
| –– Origin | Module or AIM name producing this entry |
| –– Timestamp | Timestamp aligned to SpaceTime schema |
| Versioning | Version lineage and validity interval |
| – supersedes | Previous VSE ID superseded by this instance |
| – validityInterval | Interval of validity (OSD/V1.4/data/SpaceTime.json) |
| DescrMetadata | Free-form descriptive metadata (title, summary, tags) |