I still don't get the relations between digital information and reproduced sound. I do for some part, but then I get a strange feeling that certain measures are being used both to describe the workings in the digital domain and then later to describe the sound.
Like the other day, I saw a video where someone said something about a sampling frequency and then added how it is unnecessary as humans don't hear above 20kHz. I wanted to say; but you're not listening to the sampling rate, that's just the speed of "reading" a digital file, it seems as if you can't really have high audible frequency without a fast sampling rate (?).
EDIT: later confirmed by
@yepimonfire
Then there's dynamics which imply both frequency span and dB and I hope in this case dB refers to those SPL dB's, otherwise I'm committing myself. This means that you need sufficient bit depth for music to be loud (or is it just in case it has both loud and silent parts?) and for music to go from low fq to high fq?
EDIT: later confirmed by
@yepimonfire
Because everyone keeps saying we don't really need hi-res files and then comes Mark Walter in that lecture linked by
@Auditor55 and says; sure, we don't need it until a certain instrument comes and hit a certain note (he was referring to a high pitch of some instrument). But, couldn't you then simply cut out the bottom part and still make it fit to a CD (which is not hi-res for him)
And then there's vinyl, which is said to have the equivalent of 8 bits, so it couldn't take a hi-res recording and yet someone raised this doubt just a couple of posts before. If it's known that vinyl is equivalent to 8 bits, and you need 16 for hi-res (or 24 for Walter), why raise this question? But you could cut digital info into record grooves, right? Like having a bump in the record groove for 1 and silence for 0, is it theoretically possible to record hi-res music onto vinyl this way? No matter how short the song might be, but could it be hi-res? Just make the "1 bump" more pronounced than record "clicks" to make it clear.
Sorry for trying your patience, I'm injured lying in bed, hence the intensified activity.
EDIT: I'm sorry, it's Waldrep. I never heard this surname and couldn't make it out from the video.
I actually greatly enjoy these “theoretical” tech discussions.
Bit depths higher than 16bit aren’t necessary for playback, only recording and processing (i.e. your receiver oversamples and increases the bit depth to 32bit/384khz before applying processing such as bass management, eq, Dolby Surround upmixing, time delay etc for a higher precision), oversampling also gets rid of the issues associated with the transform filter in the pass band. For playback purposes, bit depth only increases the range of the softest sound to the loudest sound. Sound below the maximum dynamic range, for example, in 16bit, which is -96dB, are lost due to quantization noise. In almost all cases, a dynamic range greater than 96dB is entirely unnecessary, let’s say you listen to a classical recording at an average level of 70dB, the average signal is -30dBfs, so the loudest peaks in the recording are 90dB, and the absolute quietest sounds that can possibly be captured are less than 1dB, i.e. inaudible. it’s highly unlikely that you will be able to even hear anything captured below 10dB due to the noise floor and auditory masking, even in an ultra quiet room. My own testing demonstrates that the quietest sound I can hear in a quiet room with a 30dBA noise floor using a 4khz sine wave is 1 dB, using full range pink noise, this falls to about 5dB. The 96dB dynamic range of 16bit is more than adequate. The only reason I mentioned needing 24bit for movies was because reference level is 105dB per channel, so a 5dB sound requires 100dB of dynamic range. Even so, you’d really only need 20bit, which is the equivalent bit depth resolution of the old Dolby Digital.
Still, if a film soundtrack were reduced to 16bit, you probably wouldn’t be able to tell any difference.
In the time domain, a 48khz sampling rate allows a maximum impulse of 20 microseconds, the rise time of a transient such as a cymbal strike can’t be properly recorded at this sampling rate, and humans can resolve a temporal difference of 6 microseconds. Yamaha has a good article about this. In addition, one study has shown the addition of ultrasonics to music elicits an emotional response
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0095464
You are correct that digital audio requires double the sample rate of the maximum frequency. Frequencies above half the sampling rate are not properly captured and fold down into the audio band as lower frequencies, which is why a very steep low pass must be used.
Although digital audio looks like a stair stepped waveform, it comes out as a continuous waveform after passing through the transform filter. To get an demonstrate how this works, you could create a 120hz square wave signal using rew and hook the output directly into the lowpassed input of your sub, rather than coming out as the irritating clicking sound of a square wave, it would sound like a relatively smooth bass sound, this is actually similar to how DSD works. The 60dB low pass used in the transform filter essentially removes the stair steps, since those stair steps are extremely high frequencies.
I have no idea what you’re getting at with recording a digital signal on vinyl. Due to the problems involved with the medium, I’d imagine the jitter during playback would be ridiculously high, as each successive playback wears the bumps down, 1s would eventually be interpreted as 0s.
Vinyl is an inferior format compared to hi res audio or even regular redbook. The only reason some vinyl releases sound better than digital is because it’s impossible to cut a record with a signal that has had a stupid amount of dynamic range compression applied, however, a digital release with less compression at 176khz would likely surpass the vinyl in every way.
Another interesting discussion is the DSD vs PCM debate. From a purely technical standpoint, I’d say DSD is worse than PCM. A good article discussing this viewpoint can be found here
http://www.mojo-audio.com/blog/dsd-vs-pcm-myth-vs-truth/
The only way DSD would sound better than pcm is if it were played back on a 1bit delta sigma dac, since the conversion from pcm to pulse density modulation introduces noise and distortion into the signal (many cheap dacs such as those in a $30 DVD player use this type of dac), however, modern high quality oversampling dacs such as those found in our avrs don’t use this, it’s worth noting every study done using blind test comparisons between the two found no difference.
Sent from my iPhone using Tapatalk