This standard specifies a concept and data format for representation of the human voice at the raw-data level with optional inclusion of nonstandardized extended data. it does not address handling of data that has been processed to the feature or voice model levels.