1 Function 2 Reference Model 3 Input/Output Data
4 SubAIMs 5 JSON Metadata 6 Profiles
7 Reference Software 8 Conformance Texting 9 Performance Assessment

1 Functions

  1. Receives the Spherical Harmonic Decomposition coefficients of the sound field
  2. Detects and directions of active sound sources (either be a speech or a non-speech signal) and to separate them.
  3. Produces the Transformed Speech and Audio Scene Geometry..

2 Reference Model

3 Input/Output Data

Input data Semantics
Spherical Harmonics Decomposition Coefficients Result of the transformation of Transform Multichannel Audio into the spherical frequency domain.
Source Model KB info Discrete-time and discrete-valued simple acoustic source models used in source separation.
Output data Semantics
Transform Audio Audio in the Transform domain.
Audio Scene Geometry Geometry of the Audio Scene.

4 SubAIMs

No SubAIMs.

5 JSON Metadata

https://schemas.mpai.community/CAE1/V2.4/AIMs/SpeechDetectionAndSeparation.json