Sound coding depth - what is it? Definition, formula

Sound coding refers to methods for storing and transmitting audio data. The following article describes how such encodings work.

Note that this is a rather complicated topic - "Sound coding depth." The definition of this concept will also be given in our article. The concepts presented in this article are for general review only. We will reveal the concepts of sound coding depth. Some of this reference data may be helpful in understanding how the API works, as well as how to formulate and process audio in your applications.

How to find audio encoding depth

. , , WAV, , . WAV- , PCM.

, FLAC , , . Speech API FLAC — , , . . FLAC Speech API, . FLAC, «.FLAC».

sound coding depth is








WAV FLAC. , API WAV FLAC . , , API .

— ?

, . , , , . ( ) .

, . . — , , , .

. , . , ( ). .

sound coding depth is 8 bits








-, , . , (20-20000 ), 40000 ( , CD 44100 ).

— . . , , .

"/" . 16 , CD . , , , . DVD Audio 24 , 8 .

if the audio encoding depth is 16 bits




( ) . ( -) ( PCM). , .

PCM ( , ) , - LINEAR16 Speech API. , , 16 . PCM 44 100 -, . 16000 .

PCM (LINEAR16) , . , Linear PCM, 16 (2 ) . — Linear PCM , - . .

, , , . , . , . ​​ , .

expand concepts sound coding depth




, , . .

, ? . , FLAC 0 ( ) 8 ( ). FLAC . .

API Speech : FLAC LINEAR16. LINEAR16 « », . , FLAC .

. Speech API , , .

sound coding depth formula




MP3- . MP3 , MP3 .

, CD PCM 16 . :

441000 * 2 * 16 = 1411200 (/) = 1411 /

, MP3 , , 320 /, 128 / 96 /, . MP3 , . . , MP3- 96 / 128 /.

sound coding depth definition




MULAW — 8- PCM, , . uLaw . uLaw , 16- LINEAR16 ( PCM) - 8- uLaw.

AMR and AMR_WB modulate the encoded audio cash register by introducing a variable bit rate into the original audio sample.

how to find sound coding depth




Although the Speech API supports several lossy formats, you should avoid them if you have control over the original audio. Although deleting such data through lossy compression may not have a noticeable effect on the sound heard by the human ear, the loss of such data for the speech recognition mechanism can significantly impair accuracy.




All Articles