Audio remains AAC 256k; choosing .mp3 only changes the file extension.
Leave blank to use server-configured token