(Tentative)

Function Reference Model Input/Output Data
SubAIMs JSON Metadata Profiles

1. Function

Prompt Creation AIM (PGM-PRC)

  1. Receives
    1. Context
    2. Audio and Visual Scene Descriptors from Audio and Visual Spatial Reasoning AIMs and, indirectly, Domain Access (PGM-DAC).
    3. Prompt Creation Directive from A-User Control.
  2. Synthesises these multimodal and contextual signals into coherent natural language prompts that describe the User’s interaction, clarify intent, and confirm inferred goals.
  3. Provides
    1. Textual PC-Prompt to Basic Knowledge LLM.
    2. Prompt Creation Statusto A-User Control.

The resulting outputs enable A-User Control, Personality Alignment, and Rendering AIMs to operate with full awareness of the User’s communicative intent, supporting expression coherence, goal-driven orchestration, and context-sensitive interaction.

2. Reference Model

Figure 1 gives Reference Model of Prompt Creation (PGM-PRC).

Figure – The Reference Model of Prompt Creation (PGM-PRC)

3. Input/Output Data

Table 1 – Input/Output Data of PGM-PRC

Input Description
Context A structured and time-stamped snapshot representing the initial understanding that the A-User achieves of the environment and of the User posture.
Audio Scene Descriptors A Data Type that conveys spatially grounded semantic audio data from the Spatial Reasoning AIM that enables the Prompt Construction AIM to generate context-aware and referentially precise prompts.
Visual Scene Descriptors A Data Type that conveys spatially grounded semantic visual data from the Spatial Reasoning AIM that enables the Prompt Construction AIM to generate context-aware and referentially precise prompts.
Prompt Creation Directive Trigger to initiates prompt generation or refinement from PGM-AUC.
Output Description
PC-Prompt Prompt to Basic Knowledge
Prompt Creation Status Prompt readiness, alignment status, and semantic goal framing to PGM-AUA.

4. SubAIMs

Table 2 describes the informative SubAIMs that have been identified for PGM-PRC.

Table 2 – Informative SubAIMs

SubAIM Function Inputs Outputs To
CIP – Context Integration & Parsing Assemble multimodal context and Entity State for prompt; select exact referents/relations in prompt scope. ASD1; VSD1; ASD3; VSD3; Entity State Item Zone Map; Validation Flags CVV
CVV – Context & Validation Verifier (Prompt‑Scoped) Final verification of consistency wrt input-text-referenced items/relations; apply policy gates; check zone feasibility. Item Zone Map; ASD1; VSD1; ASD3; VSD3; Entity State; Validation Flags Validation Status; Zone Feasibility Tag; Scene Consistency Flag BSL
BSL – Behaviour Selection & Logic Retrieve and rank candidate behaviours based on validated context and Entity State. Entity State; Validation Status; Scene Consistency Flag Behaviour Candidates List EFR
EFR – Execution Feasibility & Recovery Assess feasibility and produce fallback or adjusted prompt when needed. Behaviour Candidates List; Validation Status; Zone Feasibility Tag Execution Feasibility Status; Fallback Action PCP
PCP – Prompt Composition & Planning Prepare structured PC-Prompt Plan (JSON) integrating validated context, constraints, and behaviours. Execution Feasibility Status; Behaviour Candidates List; Scene/Entity Tags PC-Prompt Plan PFT
PFT – Prompt Formatter (new) Convert PC-Prompt Plan (JSON) into natural language PC-Prompt for BKN. PC-Prompt Plan PC-Prompt BKN
LTM – Lifecycle Trace Metadata Record provenance and traceability for audit/lifecycle management. ASD1 ID; ASD3 ID; VSD1 ID; VSD3 ID; Execution Feasibility Status Trace Log AUC

Table 3 defines the terms used.

Table 3 – Terminology used in this AIM

Term Definition
ASD1 / VSD1 Audio Scene Descriptor (ASD1) and Visual Scene Descriptor (VSD1) generated by Context Capture. These represent the initial scene understanding before refinement.
ASD3 / VSD3 Audio Scene Descriptor (ASD3) and Visual Scene Descriptor (VSD3) produced by ASR/VSR after their final interaction cycle with DAC. These are the most refined descriptors available to PRC.
Entity State A structured representation of the current state of relevant entities (e.g., position, orientation, status flags) in the scene. Replaces the older concept of “User State.”
Item Zone Map A mapping of referenced items (from the input text) to their spatial zones and relationships in the scene, derived from ASD1/VSD1 and ASD3/VSD3. Used to ground prompt content.
Validation Flags Indicators generated by CIP to highlight potential ambiguities or risks (e.g., unresolved referents, missing zone info) for CVV to resolve.
Validation Status The result of CVV’s checks (OK / WARN / FAIL) including reasons and confidence scores for multimodal consistency and policy compliance.
Zone Feasibility Tag A tag indicating whether the spatial zone referenced in the prompt is physically/logically feasible for the intended action.
Scene Consistency Flag A graded indicator confirming that the scene context assumed by the prompt matches the actual scene descriptors.
Behaviour Candidates List A ranked list of possible behaviours/actions retrieved by BSL based on validated context and Entity State.
Execution Feasibility Status A status object indicating whether the selected behaviour(s) can be executed under current constraints (e.g., zone reachability, operational limits).
Fallback Action An alternative behaviour or adjusted prompt generated by EFR when execution feasibility fails.
PC-Prompt Plan A structured JSON plan integrating validated context, constraints, and behaviours for natural language generation. Used internally by PRC to produce the PC-Prompt.
PC-Prompt The natural-language prompt assembled by PRC and sent to BKN (LLM).
Trace Log Provenance and lifecycle metadata recorded by LTM for audit and orchestration purposes.

5. JSON Metadata

https://schemas.mpai.community/PGM1/V1.0/AIMs/PromptCreation.json

6. Profiles

No Profiles.