(Tentative)

Definition Functional Requirements Syntax Semantics

Definition

Visual Scene Enhancers  is a  domain-aware augmentation of Visual Scene Descriptors. It captures refined spatial semantics, object relations, constraints, and directive-aligned hints so downstream AIMs (e.g., Prompt Creation, Rendering, Action Planning) can reason and act with higher reliability. VSE is produced through an iterative enrichment loop—typically between Visual Spatial Reasoning (VSR) and Domain Access (DAC)—that normalises raw descriptors, applies domain knowledge, encodes feasibility/occlusion/affordances, and binds resolution hints consistent with current Visual Action Directives.

Functional Requirements

  • Semantic Enrichment
      • Must transform baseline Visual Scene Descriptors (VSD0) into enriched structures (VSD1→VSD2→VSD3) that add relations, salience, occlusion, affordances, and reachability needed by downstream AIMs.
      • Must preserve and normalise original object attributes (IDs, types, bounding data) while appending domain-aware annotations.
    • Directive Alignment
      • Must encode the active Visual Action Directive (VAD)—including priorities, constraints, and scope—so enrichment remains goal-aligned across the loop with DAC and VSR.
      • Must support conditional logic that binds enrichment to directive classes and parameters, mirroring the intent/domain-class consistency pattern in the Intent data-type page.
    • Spatial Reasoning Features
        • Must provide refined localisation (pose, depth/occlusion state, proximity zones), affordance inference, and salience mapping usable by Prompt Creation and A-User Formation AIM.
        • Should expose confidence scores per inference (e.g., occlusion, affordance) to enable downstream filtering and fallback, following the confidence pattern used in Intent.
    • Object & Relation Graph
      • Must represent inter-object relations (e.g., adjacency, containment, support, collision risk) with time-stamped updates for dynamic scenes.
      • Should include semantic tags (domain-specific labels) for objects and zones to improve reasoning fidelity, consistent with other PGM data-type semantics.
    • Constraints & Feasibility
      • Must report feasibility flags (e.g., reachable, valid, occluded levels) and residual constraints not satisfied during enrichment, enabling conformance decisions.
      • Must capture directive- and domain-derived constraints injected by DAC (e.g., prohibited zones, safety margins, governance limits).
    • Resolution Hints & Rendering Guidance
      • Must provide Resolution Hints (e.g., preferred viewpoints, LOD prioritisation, update cadence) to assist rendering and prompt planning.
      • Should include alternative viewpoints and fallback hint sets when confidence or feasibility drops below thresholds.
    • Traceability & Governance
      • Must carry a Trace block (originating module, timestamps aligned to M-Instance time base, version), supporting audit and provenance across enrichment cycles.
      • Should encode governance metadata (policy references, compliance status, decision path) to satisfy PGM conformance and potential cross-standard governance needs.
    • Identity & Referencing
      • Must include globally unique IDs for the VSE record, referenced objects/zones, and linkage to M-Instance/M-Environment identifiers to enable chaining and logging.
      • Should support cross-referencing to other data types (e.g., Intent, Context, Prompt Plans) for end-to-end workflow integration.
    • Performance Hooks
      • Should expose metrics fields for enrichment latency, coverage (percentage of scene enriched), and uncertainty summaries to feed performance sections of the spec.

    Syntax

    https://schemas.mpai.community/PGM1/V1.0/data/VisualSceneEnhancers.json

    Semantics

Label Description
Header Visual Scene Enhancers Header
– Standard-VSE The characters ‘PGP-VSE-V’
– Version Major version – 1 or 2 characters
– Dot-separator The character ‘.’ separating version components
– Subversion Minor version – 1 or 2 characters
MInstanceID Identifier of the M-Instance
VSDID Array referencing baseline VSD1 descriptors
EnhancersTime Timestamp of enrichment
VSEData Array of enrichment entries
– Scope Target scope: global | object | zone
– ObjectID Required when Scope=object; must resolve to VSD1 ObjectID
– ZoneID Required when Scope=zone; must resolve to a known Scene Zone
– Roles Array of role annotations for visual actors/entities
–– visualRole Role label (system, service, narrator, player, etc.)
–– authorityLevel Authority level of the role
–– affiliation Affiliation or organisation tag
–– roleConfidence Confidence score [0..1] for role assignment
– Rights Rights structure (MMM4/V2.1/data/Rights.json)
– Rule Rule structure (MMM4/V2.1/data/Rule.json)
– ComplianceStatus Enum: compliant | violating | unknown
– RequiredDelegation Boolean flag indicating delegation requirement
– Events Array of event objects
–– EventType Enum: gesture, display_change, notification, alarm, etc.
–– EventSubType Optional sub-type string
–– Urgency Enum: low | medium | high
–– Channel Event channel (e.g., visual, system)
–– Intensity Number [0..1] indicating event intensity
– DomainContext Domain-specific context information
–– locationSemantic Semantic label of location
–– activityPhase Enum: lobby, warm-up, match, quest, etc.
–– zoneID Zone link (MMM4/V2.1/data/MLocation.json)
– Constraints Reachability and visibility constraints
–– reachable_from PointOfView reference (OSD/V1.4/data/PointOfView.json)
–– visible_from PointOfView reference (OSD/V1.4/data/PointOfView.json)
–– exclude_occluded Boolean to exclude occluded elements
–– distance_limit_m Number ≥0 indicating max distance in meters
– ResolutionHints Array of rendering guidance hints
–– HintType Enum: prioritise_gaze, prioritise_proximity, etc.
–– HintValue Associated value or rationale
– Confidence Overall confidence score [0..1] for enrichment
– Justifications Array of URI strings providing evidence
– Trace Provenance metadata
–– Origin Module or AIM name producing this entry
–– Timestamp Timestamp aligned to SpaceTime schema
Versioning Version lineage and validity interval
– supersedes Previous VSE ID superseded by this instance
– validityInterval Interval of validity (OSD/V1.4/data/SpaceTime.json)
DescrMetadata Free-form descriptive metadata (title, summary, tags)