Skip to main content

Notice

Please note that most of the software linked on this forum is likely to be safe to use. If you are unsure, feel free to ask in the relevant topics, or send a private message to an administrator or moderator. To help curb the problems of false positives, or in the event that you do find actual malware, you can contribute through the article linked here.
Topic: Multiformat Listening Test @ 64 kbps - FINISHED (Read 112805 times) previous topic - next topic
0 Members and 1 Guest are viewing this topic.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #50
It is possible to get that data using chunky on the complete test results which are available in .rar.

I already did that, but I wasn't sure if any corrections were needed. Here's what Chunky calculated (please correct me if the values are not right):

Code: [Select]
% Sample Averages:
WMA    High    Vorbis    Low    Nero
3.32    4.78    2.09    1.81    4.21
3.39    4.58    2.21    1.50    3.93
3.84    4.61    3.91    1.22    3.88
3.87    4.70    3.81    1.80    4.27
3.45    4.84    3.25    1.61    3.92
3.21    4.69    2.94    1.37    3.37
2.79    4.55    3.20    1.31    2.78
3.55    4.80    3.36    1.86    4.01
3.30    4.60    3.80    1.47    3.76
4.25    4.47    4.22    1.59    4.33
3.84    4.71    3.73    1.48    3.92
2.92    4.13    2.94    1.45    2.74
3.90    4.47    3.34    1.34    3.85
3.54    4.26    3.29    1.30    3.84
3.16    4.50    3.50    1.50    3.36
3.67    4.86    2.78    2.03    3.63
3.54    4.49    3.78    1.85    3.49
3.87    4.58    3.60    1.41    3.96

% Codec averages:
3.52    4.59    3.32    1.55    3.74


and here's the chart:

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #51
This looks much better now. Nero is at its place
The only way to check values is to compare them to those that Mares published.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #52
This one goes to the experts:

How would you rank codecs in such a situation, where A=B and B=C, but C<A?

That is actually not a contradiction as such (though further expert opinion on the actual statistical metric used is needed).

You think that is a contradiction, because such situation doesn't happen in "normal" number systems, like integers, reals, etc. What you noticing is the property of total order breaking. However not all valuations have that property.

Take integers, if you take 2 integers at random there is the way to count from one to the other, precisely because there is a total order and you know what is less/greater than what, what equals what and what follows what.
On the other hand take Complex numbers, this is the first number system students usually exposed to in school that doesn't have total order on its elements (though school teachers don't usually mention that). Given 2 random Complex numbers there isn't "the" way to count from one to the other, in fact there are infinitely many ways, all correct in some sense.

So while I don't know if the underlying statistical measure produces set of values that has total order, your example (if not subject to some freaky error) shows that it doesn't, and should be read as raking:

1) HE = WMA
2) Ogg

My immediate intuition would be to use equivalence classes to solve this problem.
1) Make individual comparison between every possible pair
2) Look at the ones with strict inequalities
3) Pick the largest of them all (strictly greater, not >=)
4) Rank that first
5) Add all who are directly equal to it to its equivalence class (not ones that are equal by some chain of equalities)
6) Removing them from further consideration
7) From remaining, rank the next largest as number 2
8) Go to 5 and repeat for the rest of the ranking.
The Plan Within Plans

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #53
How about use approximation instead of equality.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #54

This one goes to the experts:

How would you rank codecs in such a situation, where A=B and B=C, but C<A?

That is actually not a contradiction as such...

Same opinion for me.
For clearly defined objects A, B, C, a clearly defined identy and a clearly defined <-relation  it would be a contradiction.
Here A and B correspond to the quality of Nero HE AAC and that of WMA pro, and C corresponds to that of Vorbis. Quality as measured with this test. 
The problem is in the meaning of '=' and '<' as these are rough quality comparison operators which can easily make up for such a pseudo-contradiction.

The zoomed view is a major evil to me as it overestimates such a rough '<' comparison.
From absolute view ranging from 1.0 to 5.0 it's easy to say 'all these three encoders yield roughly the same quality with vorbis being a tiny bit behind.'
This is what is most important in practice, cause with these codecs you usually don't have the choice which one to use on a mobile device. No matter which one you use: you get state of the art 64 kbps technology regarding quality.

If it's up to elaborating differences between the encoders the very personal preferences are much more of concern than the overall small quality differences according to the test.
lame3995o -Q1.7 --lowpass 17


Multiformat Listening Test @ 64 kbps - FINISHED

Reply #56
Interesting results. I only use Vorbis for high bitrates (my own music collection on my PC's hard disk), since I expected low bitrates (for a small, portable mp3 player) to kill Vorbis. Seems like I was wrong.

With those results, low bitrate Vorbis internet streams make more sense now!

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #57
Wouldn't it make more sense just to go with one of the winners for low bitrates?  (Also note that on some of the subtests, ogg did much more poorly relative to the others.)  Or are there other tradeoffs here? 



Interesting results. I only use Vorbis for high bitrates (my own music collection on my PC's hard disk), since I expected low bitrates (for a small, portable mp3 player) to kill Vorbis. Seems like I was wrong.

With those results, low bitrate Vorbis internet streams make more sense now!

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #58
Wouldn't it make more sense just to go with one of the winners for low bitrates?  (Also note that on some of the subtests, ogg did much more poorly relative to the others.)  Or are there other tradeoffs here?

Well, for streaming purposes you're right, 64 kbit/s WMA10 Pro/Nero AAC seem to be better choices than Vorbis here. For low bitrates in general this isn't implicitly the case, because Vorbis, unlike the other two codecs, doesn't rely on tricks to artificially improve the quality. HE-AAC includes Spectral Band Replication, WMA 10 Pro makes use of a very similar approach. For portable players these techniques can be quite a burden, since they heavily drain batteries. Of course, Vorbis isn't a saint concerning its power hunger as well, but judging from what I've read so far it isn't as demanding as HE-AAC. I haven't seen any figures about WMA 10 Pro's decoding performance on low bitrates yet, hence I'll abstain from further commenting it. A Zune owner might be able to shed some light on this matter.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #59
Thanks, I forgot about that whole dimension to the problem. It's like comparing compression utilities by size reduction and forgetting about speed. It would be useful to have some power consumption index, but I guess that may vary greatly by device.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #60
Out of curiousity, why is there no castinettes in this test?  In my recollection pre-echo was a large problem with several of the codecs, and not with some others. It would seem unreasonable to suppress this issue.
-----
J. D. (jj) Johnston

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #61
Out of curiousity, why is there no castinettes in this test?  In my recollection pre-echo was a large problem with several of the codecs, and not with some others. It would seem unreasonable to suppress this issue.


AFAIR, it's "castanets", not castinettes, and there are other samples in it that show that issue. You'll definitely find comments about preecho in several test comments (i know, i wrote some)

Edit: typos.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #62
Quote
' date='Aug 27 2007, 14:46' post='512918']

Out of curiousity, why is there no castinettes in this test?  In my recollection pre-echo was a large problem with several of the codecs, and not with some others. It would seem unreasonable to suppress this issue.


AFAIR, it's "castanets", not castinettes, and there are other samples in it that show that issue. You'll definitely find comments about preecho in several test comments (i know, i wrote some)

Edit: typos.


Indeed there are some other sources, but these other sources also have ring tones and such, which mask a lot of pre-echo. I don't doubt you could still hear some. Oh, and thank you for offering the spelling corrections, I am elucidated. Yeah.
-----
J. D. (jj) Johnston


Multiformat Listening Test @ 64 kbps - FINISHED

Reply #64
How many of you that took test did use Headphones? How many did use in ear phones?

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #65
headphones for me

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #66
How many of you that took test did use Headphones? How many did use in ear phones?

Cheap Sennheiser HD 60 TV Headphones an AC97 onboard Soundcard.

I was recently thinking of an additional questionnaire in future listening-tests, e.g. for listening environment, age, ...

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #67
Headphones for 17 samples.  Loudspeakers for White America samples because I couldn't hear the diff in headphones. Masking, noise canceling?

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #68
good headphones (mid-price Beyerdynamic)

with some generic on-board sound on a laptop.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #69
Headphones: Koss PortaPro
davidnaylor.org

 

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #70
I too used the Koss PortaPro headphones, which are quite suitable for testing encoders at this quality level.

...and as I wrote before:
Out of curiosity, I played some of the samples through my big & good Hi-Fi speakers. I did know that only headphones can reveal codec problems properly, but I was still surprised about how much better the encoded samples sounded through a standard stereo speaker system in a casual listening situation. I suppose that the normal room echoes get mixed with pre-echo and other codec faults and the listener's brain "calculates" subconsciously a new "combined acoustic space", which does not sound completely wrong.

These speakers have a price tag of about EUR 2000. The PortaPros are about EUR 50 or less.


Headphones for 17 samples.  Loudspeakers for White America samples because I couldn't hear the diff in headphones. Masking, noise canceling?



My comments about White America:

"I hate this sample. My previous sample was the quiet bibilolo and I had set the volume level louder than normal. I didn't remember to reduce the level before starting this. It was like an explosion inside my eardrums. I hope I didn't damage my hearing...

In general the sample is overcompressed and very distorded. It does not have much that encoders could hide or alter. A bit more pre-echo & distortion does not change the ugly nature of this sample."

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #71
It seems to me that in most cases headphones reveal more differences than speakers and that most of participant in public listening tests use headphones.
Also while doing this listening test, I discovered that in some cases I could hear more differences with cheap earphones Creative EP 630 than with Sennheiser HD 650. I guess it is because of blocking of outside noise (though there was not much outside noise since I was in a room with doors and windows closed, and using quiet HTPC).
Or maybe it has to do something with neutralizing effects of head and pinnae related filtering.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #72
It seems to me that in most cases headphones reveal more differences than speakers and that most of participant in public listening tests use headphones.
Also while doing this listening test, I discovered that in some cases I could hear more differences with cheap earphones Creative EP 630 than with Sennheiser HD 650. I guess it is because of blocking of outside noise (though there was not much outside noise since I was in a room with doors and windows closed, and using quiet HTPC).
Or maybe it has to do something with neutralizing effects of head and pinnae related filtering.


Could it be because the cheaper ones don't produce all frequencies as well/evenly, which reveals high frequency artifacts better...?
davidnaylor.org

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #73
Could it be because the cheaper ones don't produce all frequencies as well/evenly, which reveals high frequency artifacts better...?

It is true that these headphones have different frequency responses:
http://www.headphone.com/technical/product...ild-a-graph.php
http://www.hydrogenaudio.org/forums/index....ost&id=3535
But it is question if this is the only thing that causes different perception of differences.

And difference between discovering artifacts with speakers or headphones IMO is beyond just different frequency responses of speakers/headphones. There are examples of double talk artifacts that sound terrible on headphones and are hard to perceive on speakers. One for example is sample 16 in this test.

Multiformat Listening Test @ 64 kbps - FINISHED

Reply #74
I used a pair of Sennheiser PX 100s for the test.


"I hate this sample. My previous sample was the quiet bibilolo and I had set the volume level louder than normal. I didn't remember to reduce the level before starting this. It was like an explosion inside my eardrums. I hope I didn't damage my hearing...

In general the sample is overcompressed and very distorded. It does not have much that encoders could hide or alter. A bit more pre-echo & distortion does not change the ugly nature of this sample."

I composed something similar but deleted it prior to submission due to unprofessionalism. Of all samples I spent significant time on I had the most difficultly with that one, precisely because the sample itself was so offensive to my ears. Even with volume levels down the song remains a screeching blob.