(Tentative)
| Definition | Functional Requirements | Syntax | Semantics |
Definition
The Audio Action Directive (PGM-AAD) is a structured instruction set issued by the A-User Control AIM to guide the Audio Spatial Reasoning AIM (PGM-ASR) in interpreting and acting upon the audio scene. It encapsulates interaction goals, focus parameters, refinement logic, and temporal constraints.
Functional Requirements
- Interaction Intent: Specifies the type of audio interaction (e.g., listen, respond, ignore).
- Source Focus: Identifies priority sound sources or regions.
- Directional Constraints: Limits interpretation to specific azimuth/elevation ranges.
- Temporal Scope: Defines time windows for action relevance.
- Salience Override: Allows manual override of salience rankings.
- Refinement Logic: Specifies algorithms or heuristics to apply.
Syntax
Semantics
| Label | Description |
|---|---|
| Header | Schema identifier and version tag |
| ├─ Standard-AAD | The characters “PGM-AAD-V” |
| ├─ Version | Major version – 1 or 2 characters |
| ├─ Dot-separator | The character “.” separating version components |
| └─ Subversion | Minor version – 1 or 2 characters |
| AudioActionDirectiveID | Unique identifier for this Audio Action Directive instance |
| AudioSpatialOutputID | Identifier of the Audio Spatial Output instance being acted upon |
| MInstanceID | Metaverse instance ID |
| MEnvironmentID | Environment ID |
| InteractionIntent | Operational goal for audio interaction |
| TargetSources | List of sound sources with priority and confidence |
| ├─ SourceID | Identifier of the sound source |
| ├─ Priority | Relative importance: high, medium, or low |
| └─ Confidence | Confidence score (range: 0 to 1) |
| DirectionalConstraints | Azimuth and elevation bounds for spatial filtering |
| ├─ AzimuthRange | Horizontal angle range in degrees |
| └─ ElevationRange | Vertical angle range in degrees |
| TemporalScope | Time window for directive relevance |
| ├─ StartTime | Start of time window (ISO 8601 format) |
| └─ EndTime | End of time window (ISO 8601 format) |
| SalienceOverride | Manual override of salience scores |
| ├─ SourceID | Identifier of the sound source |
| └─ OverrideScore | New salience score (range: 0 to 1) |
| RefinementLogic | Algorithmic guidance for interpretation refinement |
| ├─ Algorithm | Name of the algorithm or heuristic |
| └─ Parameters | Key-value pairs for algorithm configuration |
| Trace | Provenance metadata |
| ├─ Origin | Module or subsystem that issued the directive |
| └─ Timestamp | Time of creation (refers to MPAI Time schema) |
| DescrMetadata | Descriptive metadata for audit, indexing, or semantic linking |