Skip to main content

Notice

Please note that most of the software linked on this forum is likely to be safe to use. If you are unsure, feel free to ask in the relevant topics, or send a private message to an administrator or moderator. To help curb the problems of false positives, or in the event that you do find actual malware, you can contribute through the article linked here.
Topic: Which codec for speech? OGG or MP3? (Read 12503 times) previous topic - next topic
0 Members and 1 Guest are viewing this topic.

Which codec for speech? OGG or MP3?

I want to encode speech. Which coded should i use? OGG or MP3? What quality setting (ogg) and bitrate (mp3) do you think i should use?

Thank you

Which codec for speech? OGG or MP3?

Reply #1
I think you should try Ogg Vorbis, and the aoTuV beta 4 version. I'm not sure about the quality, but you should try it yourself and find out the quality level that suits you.

Which codec for speech? OGG or MP3?

Reply #2
i tried the original ogg drop xp. I am extremely satisfied with q -1,00  (nominal bitrate 45kbps). Can it go even lower?

Which codec for speech? OGG or MP3?

Reply #3
OggDropXPd based on aoTuV beta 4 has a -q-2 mode (~32 kbps).
You can also reduce the samplerate to lower the bitrate: according to some people, it also increase the quality (aoTuV at 32 kbps and 44100 Hz is not really pleasant IMO).

Which codec for speech? OGG or MP3?

Reply #4
Quote
OggDropXPd based on aoTuV beta 4 has a -q-2 mode (~32 kbps).
You can also reduce the samplerate to lower the bitrate: according to some people, it also increase the quality (aoTuV at 32 kbps and 44100 Hz is not really pleasant IMO).
[a href="index.php?act=findpost&pid=325169"][{POST_SNAPBACK}][/a]


ok tell me more. I dont have time to test, what sample rate should i use?

Which codec for speech? OGG or MP3?

Reply #5
i am interested in gettin the smallest size with a decent quality for speech/voice. I have 3 formats available MP3/OGG/WMA. Don't have time to test, i will trust whatever you audio experts recommend me.

Thank you all for your ultra fast replies.

Which codec for speech? OGG or MP3?

Reply #6
I can't help you. I'm only repeating what some other people said about vorbis and ultra-low bitrate encoding. Sorry. But at such low bitrate I would say that's it's not very hard nor very long to test it on your side

Which codec for speech? OGG or MP3?

Reply #7
Quote
I can't help you. I'm only repeating what some other people said about vorbis and ultra-low bitrate encoding. Sorry. But at such low bitrate I would say that's it's not very hard nor very long to test it on your side
[a href="index.php?act=findpost&pid=325174"][{POST_SNAPBACK}][/a]


that would do! what sample rate do you recommend for 43kbps ogg?

Which codec for speech? OGG or MP3?

Reply #8
As I already said it, I haven't tested it. Therefore, it's very hard for me to recommand you anything (apart testing by yourself).

Which codec for speech? OGG or MP3?

Reply #9
Since the bloke is desperate: A wireline phone has 4 kHz bandwidth. So sampling rate at 8 kHz should be enough to understand the voice. You might want to choose a bit higher. I do not know what choices are provided.

 

Which codec for speech? OGG or MP3?

Reply #11
16 kHz, 16 bit, mono are perfectly OK for pure speech.
"ONLY THOSE WHO ATTEMPT THE IMPOSSIBLE WILL ACHIEVE THE ABSURD"
        - Oceania Association of Autonomous Astronauts

Which codec for speech? OGG or MP3?

Reply #12
Quote
Why not use speex? http://speex.org/

It is designed for voice encoding.


Yes and it has three modes including wide-band (16 kHz) and ultra-wideband (32kHz) 
budding I.T professional

Which codec for speech? OGG or MP3?

Reply #13
Originally posted in this thread.

Quote
Quote
Apart from encoding to mono, can anyone suggest space saving parameters for speech only files?
I recently did a project similar to this.  It took a little trial and error, but the settings I ended up with seem to be the best for me.

I started with 16-bit mono WAV files at 22050Hz and converted them to MP3 using LAME 3.96.1 with the following parameters:

-V3 --vbr-new --lowpass 8

These settings create an MP3 file with a bit-rate around 48kbps and an 8kHz low-pass filter, which seems fine for speech.

A typical 45 minute speech will reduce from ~115M (WAV) to ~15M (MP3) in about 35 seconds on my computer (P4 2.8GHz, 1G RAM, Windows XP).

Hope this helps...

Which codec for speech? OGG or MP3?

Reply #14
I would have looked into Speex or aoTuVb4, maybe also HE-AAC before going mp3 on this. But if it's to be played by the regular DAP's, DVD players or such that only do mp3 and wma, there's been several threads on the forum on best settings for audiobooks and such.


[span style='font-size:8pt;line-height:100%']EDIT: Typo.[/span]
"ONLY THOSE WHO ATTEMPT THE IMPOSSIBLE WILL ACHIEVE THE ABSURD"
        - Oceania Association of Autonomous Astronauts

Which codec for speech? OGG or MP3?

Reply #15
the reason i'm am not going to other codec is that they are not supported by my mp3 player. Unforunatelly, so low bitrates of ogg also are not supported. I played a bit with mp3 and i found that 24 kbps 11kHz are enough for my needs. They are a bit bigger as files than ogg but i guess i'll have to live with than

Thank you all for your fast replies.

Which codec for speech? OGG or MP3?

Reply #16
Here are some very impressive (in my opinion) mp3 settings for voice:

--abr 16 -a --resample 11 --lowpass 5 --athtype 2 -X3

--alt-preset 24 -a --resample 22 --lowpass 7

The second line is obviously better than the first, but the first line has very small file sizes.

Which codec for speech? OGG or MP3?

Reply #17
Quote
Here are some very impressive (in my opinion) mp3 settings for voice:

--abr 16 -a --resample 11 --lowpass 5 --athtype 2 -X3

--alt-preset 24 -a --resample 22 --lowpass 7

The second line is obviously better than the first, but the first line has very small file sizes.
[{POST_SNAPBACK}][/a]


i get this error from the second one

Command: C:\Program Files\Music\MP3\Lame 3.90.3\lame.exe --alt-preset 24 -a --resample 22 --lowpass 7 "C:\Documents " "C:\Documents "
LAME version 3.90.3 MMX  (http://www.mp3dev.org/)
-- Compiled at [a href="http://www.hydrogenaudio.org]http://www.hydrogenaudio.org[/url]
-- Check this website for up to date information on the --alt-presets
Error: The bitrate specified is out of the valid range for this preset
When using this mode you must enter a value between "80" and "320"
For further information try: "C:\Program Files\Music\MP3\Lame 3.90.3\lame.exe --alt-preset help"
RazorLame encountered an unknown message from LAME while trying to encode "C:\Documents "!

Encoded 0 files in 0:00:00
There was an unexpected LAME message for one file, please check log for error messages.

I used 16kbps and 11kHz and i get an output of 294. Same file is 360 with the first switch

Which codec for speech? OGG or MP3?

Reply #18
Here are the full parameter lines I am using in foobars cli

--abr 16 -a --resample 11 --lowpass 5 --athtype 2 -X3 - %d
--alt-preset 24 -a --resample 22 --lowpass 7 - %d

and am using them with lame 3.97a11

I just tried them and they worked perfectly.  Must be because you are using 3.90.3 lame.

Which codec for speech? OGG or MP3?

Reply #19
Wouldn't HE-AAC offer no advantage over LC-AAC because speech has no frequencies high enough to benefit from the SBR? or did i miss something here?
Veni Vidi Vorbis.