Sound coding refers to methods for storing and transmitting audio data. The following article describes how such encodings work.
Note that this is a rather complicated topic - "Sound coding depth." The definition of this concept will also be given in our article. The concepts presented in this article are for general review only. We will reveal the concepts of sound coding depth. Some of this reference data may be helpful in understanding how the API works, as well as how to formulate and process audio in your applications.
How to find audio encoding depth
. , , WAV, , . WAV- , PCM.
, FLAC , , . Speech API FLAC — , , . . FLAC Speech API, . FLAC, «.FLAC».
WAV FLAC. , API WAV FLAC . , , API .
— ?
, . , , , . ( ) .
, . . — , , , .
. , . , ( ). .
-, , . , (20-20000 ), 40000 ( , CD 44100 ).
— . . , , .
"/" . 16 , CD . , , , . DVD Audio 24 , 8 .
( ) . ( -) ( PCM). , .
PCM ( , ) , - LINEAR16 Speech API. , , 16 . PCM 44 100 -, . 16000 .
PCM (LINEAR16) , . , Linear PCM, 16 (2 ) . — Linear PCM , - . .
, , , . , . , . , .
, , . .
, ? . , FLAC 0 ( ) 8 ( ). FLAC . .
API Speech : FLAC LINEAR16. LINEAR16 « », . , FLAC .
. Speech API , , .
MP3- . MP3 , MP3 .
, CD PCM 16 . :
441000 * 2 * 16 = 1411200 (/) = 1411 /
, MP3 , , 320 /, 128 / 96 /, . MP3 , . . , MP3- 96 / 128 /.
MULAW — 8- PCM, , . uLaw . uLaw , 16- LINEAR16 ( PCM) - 8- uLaw.
AMR and AMR_WB modulate the encoded audio cash register by introducing a variable bit rate into the original audio sample.
Although the Speech API supports several lossy formats, you should avoid them if you have control over the original audio. Although deleting such data through lossy compression may not have a noticeable effect on the sound heard by the human ear, the loss of such data for the speech recognition mechanism can significantly impair accuracy.