Inthepipeline.net is written primarily as a resource for inthepipeline recording school.



Introducing the 'K' System: Metering for the Digital Age

The digital age of recording requires a far better and uniform system of metering than the purely analog days.

It is now possible to make recordings that have so little background noise that there is very little point in trying to record as loud as possible. Now we have such dynamic range available to us that it would be possible to record the sound of a pistol shot with perfect clarity, play it back at its original level (at 130dBSPL, beyond the threshold of pain) and still be unable to hear the background noise from the recording before the shot was fired.

On the other hand, recording as loudly as possible has become very competetive in the pop world, where material is often played over poor quality media such as night club P.A. systems, In-car systems, FM radio, Myspace, MP3 players and portable equipment. Despite the fact that better quality of sound is available to us, some parts of the recording industry lean toward supplying us with low quality audio equipment and low quality recordings to play. 


How Loud?

There is a fundamental question which you may ask, and may have already asked yourself in relation to recording. That question is how loud do I mix? The sound pressure level in different environments will vary greatly for a given amount of amplification, so how is it possible to have some reference level for recording, mastering and mixing when we already know that a listner may very well change the level of the sound on reproduction?

In this section I would like to bring to your attention the importance of the amount of sound in your control room being relative to the level displayed by your meters.
The actual sound pressure level (SPL) in a listening environment is something that each listener will adjust to taste. This can and should vary, but you may have noticed that the loudness of individual recordings can vary to a great degree. So how do we apply a standard for loudness which will satisfy musicians, recordists, mastering engineers, radio listeners, broadcasters, theatre-goers and domestic listeners?

 Many people hook up a cd player and set it to read 0dB on their master meters, adjust the volume of their control room monitors to be comfortable at that level and proceed with recording and mixing down their material. In theory this would appear to be practical, but there are many further considerations. It would be quite easy to set your monitors to a louder level because you like the tracks you play and enjoy hearing them perhaps a little louder than is ideal. It is also more than likely that the recording levels on the tracks will vary, making the decision very difficult.

Even before you start recording you will benefit from acknowledging that it is important to monitor at a certain loudness in the control room.  Above a certain loudness our perception of dynamic content in audio material tends to diminish. We become less aware of compression because the Human brain starts to compress the sound we hear. This phenomenon varies depending upon not only the loudness of the sound, but also the amount of time the listener is in the environment. Anyone leaving a loud rock concert or night-club will have noticed that it takes some time for the brain to re-adjust to a 'normal' level. To a lesser degree this effect can easily happen to an engineer in a control room. In fact it is no less important to give your ears a rest when mixing down than it is to take your eyes away from a computer screen. Clearly, to encourage any sort of insensitivity to sound is of no advantage to a sound engineer and to be avoided. Adversely, attempting to adjust sound whilst listening at a lower than ideal level can tempt us to make recordings which are too "hot" and lack dynamic range because we attempt to bring the sound level up to an SPL which our brains consider ultimately comfortable. This can result in a flat, un-interesting mix with little dynamic range. Clearly, somewhere between these two extremes lies a relative listening level and recording level which should allow us to achieve the best results.

Recordings of popular music have been getting "louder" for many years. That is to say, the average level of a recording has become closer to the maximum available recording level. So the perceived loudness is greater, but the depth of the dynamic, the "liveliness" has decreased. A higher average level means less available headroom and so a flatter mix, but compare the actual comfortable listening level of an older recording and you are likely to find a far more dynamic sounding mix, but at a lower average level. In other words, the consumer is likely to adjust the "volume" upwards to achieve the same comfortable listening level and to perceive a greater dynamic.
Broadcast engineers have to balance the transmitted output from TV and radio stations to provide a regular relative transmission level. Moving picture theatres need to provide a sound level which has a far greater dynamic range than a Pop CD. The sound is likely to have a far lower average level and greater headroom available for special effects etc. In other words, it is necessary to consider the circumstances under which the media will be reproduced as well as the content of the media itself.
Certain commercial entities have taken advantage of the fact that no standard exists. It has become widespread practice to make recordings with a very "hot" average level to make advertisements which sound louder than an average film track. In the Pop world a "loudness war" has been going on for some time. Despite the fact that as digital recording technology has improved, higher quality recording has become possible, certain parts of the industry have actively attempted to reduce the quality of output in favour of greater average level and as greater bit depth and sample rate increase the quality of digital recording and yet greater dynamic range becomes possible, the industry may seek to continue the loudness war and make the problem of relative sound levels even worse.

So how can a recording engineer decide on a relative listening level which will result in a recording acceptable to the industry and the domestic listener, given that the end-user will adjust the level of their system to taste? how can we produce appropriate content for delivery via different media?
Currently there is not an absolute standard, but much work has been done in considering the problem and certain standards are widely accepted thanks to a proposal made by the experienced mastering engineer Bob Katz.

It is clear that to provide an outline for an "average comfortable listening level" much experimenting would need to be done. In practice, engineers in the film industry have already carried out detailed subjective listening tests in this area and regularly use a mixing environment containing a fixed, calibrated monitoring system set for an SPL of 83dB. This is considered too loud as a domestic listening level because most home listening situations would not accommodate such a great dynamic range, so many authorities recommend lowering the gain by 6dB. Bob Katz has experimented over the production of many masters to develop a consistent perceived loudness when creating CDs. He has found this level to be consistent with the recommendations and has proposed not only a standard for an average monitoring level, but also "K Metering" a system of metering which refers to program level depending upon the intended use of the medium.


The 'K' system and Listening Levels

The wonderful thing about the K system is that it doesn't stop at metering.

Consider for a moment the different sorts of dynamic information which could characterize a recording. They fall broadly into three groups.

Bob Katz, a leading mastering engineer proposed that there should be not one but three standard levels for recording in the digital domain, each of which could best reflect the needs for effective delivery of a particular media group.

In recording  movie sound-tracks for example, the spoken dialog needs to be intelligable, but reproduced at a level whereby it is in harmony with the backing track and the sound effects. Preseving the reality of the sound effects will often require that they are recorded with a great deal of headroom available so that the dynamic can be preserved. 

A classical or operatic concert is a large scale acoustic concert  with a dynamic which will vary between the sound of a single flute and the whole orchestra in unison.

In the case of Pop and Rock music compression often constitutes some of the character of the music. The sounds are very complex and are often changed so much that any solo'ed component would not sound as though it belonged to the whole project. The average level of the  music is lightly to be higher, but the requirement for headroom is lightly to be much lower.

In the case of media for broadcast, the stations themselves compress the transmissions to obtain a constant, clear signal. In these circumstances, by decreasing the dynamic range of the original recording it may be possible to ensure that the master is better represented across the transmission.

The learned Mr Katz cites research which indicates that over a large representative audience in a theatre (over 1000), a certain reference SPL is considered not to loud and not to soft for the audience. This magic number is 83dB SPL.  He then goes further to cite references from respected audio engineers who recommend reducing this level by 6dB for domestic listening.

With reference to 0dBfs the 'K' system recommends are average recording levels of:

The image below demonstrates how the meters appear compared to a normal digital meter

k12, k14,k20 meters compared to odBFS


Bob Katz' full explanation of the K system is available here.
His original proposal and the reasoning behind it is here.

The K metering system is of great benefit to an engineer. It provides:

A consistent monitoring level
The opportunity to create mixes which will contain the desired dynamic content when reproduced and distributed using specific digital media.
A consistent recording level for a "comfortable" listening level
A consistant metering system to comply with the required loudness for CD, DVD, Film and Broadcast production

It is not complicated to calibrate a monitoring system to comply with the K system.
To set up the SPL of a monitoring system:

Play a standard pink noise calibration signal at a level of -20dB FS RMS according to the digital peak meters through one monitor at a time.
Adjust each one to a sound pressure level of 83dB using a calibrated sound pressure meter set to C-weighted, slow response.
This -20dB point  on the peak meter now represents an average level of 83dB, the standard for the film industry.
Now you can set the "0" level at 6dB lower on the meter to represent the suggested level for domestic monitoring at 77dB.

There is a free VST plug in named MDA test tone which provides pink and white noise, test tones and a sweep generator. It is available at Smartelectronix.com


Recording Levels for Tracking

The 'K' system is designed primarily for allowing us to create our final mix-downs at a a suitable level for their intended use, but what about the level at which we record each track in a multi-track mix?

There is much discussion about the marvels of recording with software that uses 32bit-float and 64bit-float audio engines. It is often presumed that this enables us to record at any level and that it is impossible to obtain distortion, so one might as well record as loudly as possible. This is not the case.

Digital recording software of this nature will unlikely distort your recordings. It is almost impossible, but as with a "real" studio, it is important to ensure that the amount of gain, or loss in the audio chain remains relatively constant through the various stages of your DAW software. Many FX plugins may not have the same large bit-depth as the application which is hosting them. This being the case it is quite easy to cause a plugin to distort the signal, especially at the point between one effect and the next in a chain.

Since most recording software will provide unity gain between a channel and the main output with no adjustment this also allows the user to make good value judgements about the qualities of each sound with the same accuracy which would be applied to a final mix-down. This can also be applied to sub-mixes.

It is quite common to render tracks and submixes to an audio file in order to free up processor power for more complicated mixdowns and allow the use of additional plugins. These audio files may be re-used as stand-alone mixes for some other purpose or inserted into tracks with different FX applied, so recording them at a sensible level is just as important. Remeber that the noise floor for each audio file will be very low. This is of course subject to your recording skills in the use of the analog equipment connected to the input of the sound card. Many a bad, muddy sounding recording is blamed on one or another DAW system when the user has not paid enough attention to this critical point in the recording chain. Digital recording is very powerful, but it's not magic.

In real terms, I would suggest that the best approach is to treat each and every track as if it were a master track, using the lessons of the 'K' system to your advantage. Pay as much attention to clarity and dynamic range as you can. This way you can quickly build a sample library of outstanding quality, the flexibility available to you in the mixing process will be greater. You will have greater control over your mixes and they will sound sharper and more apparent. You will far more easily be able to create the required amount of space in your final mixes if you place the audio comfortably in the space available, considering both the space between the noise floor and the quietest detail of each sound and leaving plenty of room for the highest peaks.

This site is under on-going development. More pages will be added soon. Thanks for visiting inthepipeline.net