IPB

Welcome Guest ( Log In | Register )

 
Reply to this topicStart new topic
Which codec for speech? OGG or MP3?
alfa156
post Sep 5 2005, 14:22
Post #1





Group: Members
Posts: 126
Joined: 19-December 02
From: Athens, Greece
Member No.: 4152



I want to encode speech. Which coded should i use? OGG or MP3? What quality setting (ogg) and bitrate (mp3) do you think i should use?

Thank you
Go to the top of the page
+Quote Post
cyborg
post Sep 5 2005, 14:40
Post #2





Group: Members
Posts: 76
Joined: 21-January 03
From: Earth
Member No.: 4663



I think you should try Ogg Vorbis, and the aoTuV beta 4 version. I'm not sure about the quality, but you should try it yourself and find out the quality level that suits you.
Go to the top of the page
+Quote Post
alfa156
post Sep 5 2005, 14:44
Post #3





Group: Members
Posts: 126
Joined: 19-December 02
From: Athens, Greece
Member No.: 4152



i tried the original ogg drop xp. I am extremely satisfied with q -1,00 (nominal bitrate 45kbps). Can it go even lower?
Go to the top of the page
+Quote Post
guruboolez
post Sep 5 2005, 14:50
Post #4





Group: Members (Donating)
Posts: 3474
Joined: 7-November 01
From: Strasbourg (France)
Member No.: 420



OggDropXPd based on aoTuV beta 4 has a -q-2 mode (~32 kbps).
You can also reduce the samplerate to lower the bitrate: according to some people, it also increase the quality (aoTuV at 32 kbps and 44100 Hz is not really pleasant IMO).
Go to the top of the page
+Quote Post
alfa156
post Sep 5 2005, 14:53
Post #5





Group: Members
Posts: 126
Joined: 19-December 02
From: Athens, Greece
Member No.: 4152



QUOTE (guruboolez @ Sep 5 2005, 03:50 PM)
OggDropXPd based on aoTuV beta 4 has a -q-2 mode (~32 kbps).
You can also reduce the samplerate to lower the bitrate: according to some people, it also increase the quality (aoTuV at 32 kbps and 44100 Hz is not really pleasant IMO).
*


ok tell me more. I dont have time to test, what sample rate should i use?
Go to the top of the page
+Quote Post
alfa156
post Sep 5 2005, 14:59
Post #6





Group: Members
Posts: 126
Joined: 19-December 02
From: Athens, Greece
Member No.: 4152



i am interested in gettin the smallest size with a decent quality for speech/voice. I have 3 formats available MP3/OGG/WMA. Don't have time to test, i will trust whatever you audio experts recommend me.

Thank you all for your ultra fast replies.

This post has been edited by alfa156: Sep 5 2005, 15:00
Go to the top of the page
+Quote Post
guruboolez
post Sep 5 2005, 15:02
Post #7





Group: Members (Donating)
Posts: 3474
Joined: 7-November 01
From: Strasbourg (France)
Member No.: 420



I can't help you. I'm only repeating what some other people said about vorbis and ultra-low bitrate encoding. Sorry. But at such low bitrate I would say that's it's not very hard nor very long to test it on your side smile.gif
Go to the top of the page
+Quote Post
alfa156
post Sep 5 2005, 15:03
Post #8





Group: Members
Posts: 126
Joined: 19-December 02
From: Athens, Greece
Member No.: 4152



QUOTE (guruboolez @ Sep 5 2005, 04:02 PM)
I can't help you. I'm only repeating what some other people said about vorbis and ultra-low bitrate encoding. Sorry. But at such low bitrate I would say that's it's not very hard nor very long to test it on your side smile.gif
*


that would do! what sample rate do you recommend for 43kbps ogg?
Go to the top of the page
+Quote Post
guruboolez
post Sep 5 2005, 15:07
Post #9





Group: Members (Donating)
Posts: 3474
Joined: 7-November 01
From: Strasbourg (France)
Member No.: 420



As I already said it, I haven't tested it. Therefore, it's very hard for me to recommand you anything (apart testing by yourself).

This post has been edited by guruboolez: Sep 5 2005, 15:07
Go to the top of the page
+Quote Post
Triza
post Sep 5 2005, 15:15
Post #10





Group: Members
Posts: 367
Joined: 16-November 03
Member No.: 9867



Since the bloke is desperate: A wireline phone has 4 kHz bandwidth. So sampling rate at 8 kHz should be enough to understand the voice. You might want to choose a bit higher. I do not know what choices are provided.
Go to the top of the page
+Quote Post
legg
post Sep 5 2005, 15:49
Post #11





Group: Members
Posts: 175
Joined: 5-March 05
From: Morelia, Mexico
Member No.: 20386



Why not use speex? http://speex.org/

It is designed for voice encoding.


--------------------
Home page: http://lc.fie.umich.mx/~legg/indexen.php
Go to the top of the page
+Quote Post
Mr_Rabid_Teddybe...
post Sep 5 2005, 15:56
Post #12





Group: Members
Posts: 1197
Joined: 3-September 03
From: Bergen, Norway
Member No.: 8667



16 kHz, 16 bit, mono are perfectly OK for pure speech.


--------------------
"ONLY THOSE WHO ATTEMPT THE IMPOSSIBLE WILL ACHIEVE THE ABSURD"
- Oceania Association of Autonomous Astronauts
Go to the top of the page
+Quote Post
HotshotGG
post Sep 5 2005, 16:27
Post #13





Group: Members
Posts: 1593
Joined: 24-March 02
From: Revere, MA
Member No.: 1607



QUOTE
Why not use speex? http://speex.org/

It is designed for voice encoding.


Yes and it has three modes including wide-band (16 kHz) and ultra-wideband (32kHz) wink.gif


--------------------
College student/IT Assistant
Go to the top of the page
+Quote Post
esa372
post Sep 5 2005, 16:29
Post #14





Group: Members (Donating)
Posts: 429
Joined: 5-September 04
From: Los Angeles
Member No.: 16796



Originally posted in this thread.

QUOTE (esa372 @ Jun 30 2005, 09:22 AM)
QUOTE (ardea @ Jun 30 2005, 07:28 AM)
Apart from encoding to mono, can anyone suggest space saving parameters for speech only files?
I recently did a project similar to this. It took a little trial and error, but the settings I ended up with seem to be the best for me.

I started with 16-bit mono WAV files at 22050Hz and converted them to MP3 using LAME 3.96.1 with the following parameters:

-V3 --vbr-new --lowpass 8

These settings create an MP3 file with a bit-rate around 48kbps and an 8kHz low-pass filter, which seems fine for speech.

A typical 45 minute speech will reduce from ~115M (WAV) to ~15M (MP3) in about 35 seconds on my computer (P4 2.8GHz, 1G RAM, Windows XP).

Hope this helps...


--------------------
Clowns love haircuts; so should Lee Marvin's valet.
Go to the top of the page
+Quote Post
Mr_Rabid_Teddybe...
post Sep 5 2005, 16:39
Post #15





Group: Members
Posts: 1197
Joined: 3-September 03
From: Bergen, Norway
Member No.: 8667



I would have looked into Speex or aoTuVb4, maybe also HE-AAC before going mp3 on this. But if it's to be played by the regular DAP's, DVD players or such that only do mp3 and wma, there's been several threads on the forum on best settings for audiobooks and such.


EDIT: Typo.

This post has been edited by Mr_Rabid_Teddybear: Sep 5 2005, 16:40


--------------------
"ONLY THOSE WHO ATTEMPT THE IMPOSSIBLE WILL ACHIEVE THE ABSURD"
- Oceania Association of Autonomous Astronauts
Go to the top of the page
+Quote Post
alfa156
post Sep 5 2005, 18:42
Post #16





Group: Members
Posts: 126
Joined: 19-December 02
From: Athens, Greece
Member No.: 4152



the reason i'm am not going to other codec is that they are not supported by my mp3 player. Unforunatelly, so low bitrates of ogg also are not supported. I played a bit with mp3 and i found that 24 kbps 11kHz are enough for my needs. They are a bit bigger as files than ogg but i guess i'll have to live with than

Thank you all for your fast replies.
Go to the top of the page
+Quote Post
AlexanderTG
post Sep 5 2005, 20:31
Post #17





Group: Members
Posts: 216
Joined: 26-March 05
Member No.: 20965



Here are some very impressive (in my opinion) mp3 settings for voice:

--abr 16 -a --resample 11 --lowpass 5 --athtype 2 -X3

--alt-preset 24 -a --resample 22 --lowpass 7

The second line is obviously better than the first, but the first line has very small file sizes.

This post has been edited by AlexanderTG: Sep 5 2005, 20:35
Go to the top of the page
+Quote Post
alfa156
post Sep 5 2005, 20:45
Post #18





Group: Members
Posts: 126
Joined: 19-December 02
From: Athens, Greece
Member No.: 4152



QUOTE (AlexanderTG @ Sep 5 2005, 09:31 PM)
Here are some very impressive (in my opinion) mp3 settings for voice:

--abr 16 -a --resample 11 --lowpass 5 --athtype 2 -X3

--alt-preset 24 -a --resample 22 --lowpass 7

The second line is obviously better than the first, but the first line has very small file sizes.
*


i get this error from the second one

Command: C:\Program Files\Music\MP3\Lame 3.90.3\lame.exe --alt-preset 24 -a --resample 22 --lowpass 7 "C:\Documents " "C:\Documents "
LAME version 3.90.3 MMX (http://www.mp3dev.org/)
-- Compiled at http://www.hydrogenaudio.org
-- Check this website for up to date information on the --alt-presets
Error: The bitrate specified is out of the valid range for this preset
When using this mode you must enter a value between "80" and "320"
For further information try: "C:\Program Files\Music\MP3\Lame 3.90.3\lame.exe --alt-preset help"
RazorLame encountered an unknown message from LAME while trying to encode "C:\Documents "!

Encoded 0 files in 0:00:00
There was an unexpected LAME message for one file, please check log for error messages.

I used 16kbps and 11kHz and i get an output of 294. Same file is 360 with the first switch

This post has been edited by alfa156: Sep 5 2005, 20:47
Go to the top of the page
+Quote Post
AlexanderTG
post Sep 5 2005, 20:55
Post #19





Group: Members
Posts: 216
Joined: 26-March 05
Member No.: 20965



Here are the full parameter lines I am using in foobars cli

--abr 16 -a --resample 11 --lowpass 5 --athtype 2 -X3 - %d
--alt-preset 24 -a --resample 22 --lowpass 7 - %d

and am using them with lame 3.97a11

I just tried them and they worked perfectly. Must be because you are using 3.90.3 lame.

This post has been edited by AlexanderTG: Sep 5 2005, 21:07
Go to the top of the page
+Quote Post
HbG
post Sep 6 2005, 01:08
Post #20





Group: Members
Posts: 289
Joined: 12-May 03
From: The Hague
Member No.: 6555



Wouldn't HE-AAC offer no advantage over LC-AAC because speech has no frequencies high enough to benefit from the SBR? or did i miss something here?


--------------------
Veni Vidi Vorbis.
Go to the top of the page
+Quote Post

Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 



RSS Lo-Fi Version Time is now: 18th September 2014 - 19:09