De-essing

De-essing (also desibilizing) is any technique intended to reduce or eliminate the excessive prominence of sibilant consonants, such as the sounds normally represented in English by "s", "z", "ch", "j" and "sh", in recordings of the human voice.[1] Sibilance lies in frequencies anywhere between 2–10 kHz, depending on the individual voice.

Causes

Excess sibilance can be caused by compression, microphone choice and technique, and even simply the way a person's mouth anatomy is shaped. Ess sound frequencies can be irritating to the ear, especially with earbuds or headphones, and interfere with an otherwise modulated and pleasant audio stream.

Process of de-essing

Broadband de-essing

There are several time and frequency based algorithms that can reduce sibilance or de-ess the sound. Time-domain based approaches, such as bandpass filters, are more suited to real-time applications such as live radio due to less constraint on Digital signal processor. Playback or offline applications incorporate Fast Fourier Transform (FFT) based methods.

Using a dedicated de-essing plugin

In the current digital stronghold of audio production, the most commonly used tool for reducing sibiliance is a de-esser plugin. A dynamic equalizer can be used to achieve the same effects as a de-esser, however, plugin manufacturers have tailored these tools to operate efficiently within the mid-high to high frequencies.

A de-essing plugin will compress the desired signal according to the amplitude of the selected frequency as it passes over a preset threshold. In the case of excessive sibilance anywhere from 4-10k will often be where the problem resides. Certain plugins will shape the envelope of the compression to achieve a more musical effect.

Over de-essing can result in the over manipulation of transients resulting in the softening or hardening of certain consonants, yielding undesirable effects.

Split-band de-essing

Dynamic-equalization de-essing

De-essing is a dynamic audio editing process, only working when the level of the signal in the sibilant range (the ess sound) exceeds a set threshold. De-essing temporarily reduces the level of high frequency content in the signal when a sibilant ess sound is present.[1] De-essing differs from equalization, which is a static change in level among many frequencies. However, equalization of the ess frequencies alone can be manipulated to reduce the level of sibilance.

Side-chain compression or broadband de-essing

With this technique, the signal feeding the side-chain of a dynamic range compressor is equalized or filtered so that the sibilant frequencies are most prominent. As a result, the compressor only reduces the level of the signal when there is a high level of sibilance. This reduces the level over the entire frequency range. Because of this, attack and release times are extremely important, and threshold settings cannot be placed as low as with other types of de-essing techniques without experiencing more blatant sound artifacts.

Split-band compression

Here, the signal is split into two frequency ranges, a range that contains the sibilant frequencies, and a range that does not. The signal containing the sibilant frequencies is sent to a compressor. The other frequency range is not processed. Finally the two frequency ranges are combined back into one signal.

The original signal can either be split into high (sibilant) and low frequencies, or split so that the frequencies both below and above the sibilance are untouched. This technique is similar to multi-band compression.

Dynamic equalization

The gain of a parametric equalizer is reduced as the level of the sibilance increases. The frequency range of the equalizer is centered on the sibilant frequencies.

De-essing with automation

A more recent method of de-essing involves automation of the vocal level in a digital audio workstation (DAW). Whenever problematic sibilance occurs the level can be set to follow automation curves that are manually drawn in by the user.

This method is made feasible by editing automation points directly, as opposed to programming by manipulating gain sliders in a write-mode. An audio engineer would not be able to react fast enough to precisely reduce and restore vocal levels for the brief duration of sibilants during real-time playback.

De-essing without automation or with manual equalization

Highlighted the frequency of the ess in the spoken word "instantly" on an audio editing timeline

Equalization curve lowering the decibels of an ess frequency range for a human voice

Audio editing software, whether professional or amateur software such as Audacity, can use the built-in equalization effects to reduce or eliminate sibilance ess sounds that interfere with a recording. Described here is a common method with Audacity. The process is in two phases: 1) analyze the frequency of the voice's ess sound by sampling several instances and calculating the range of ess frequencies. Male voices sibilance range in 3000Hz to 6000Hz while female voice's typically range in 6000-8000Hz(sourced from plug-in guide). Next, 2) apply an equalization filter to quiet the determined frequency band by -4 dB to -11db during ess frequency time events. The rise and fall time of filter should be fast (less than 10ms) in order to clip the sibilance specific instances only[2]

References

Jeffs, Holden, and Bohn (September 2005). "Chapter 4 -- Specialized Compressors". Dynamics Processors -- Technology & Applications. Retrieved 2020-10-20.CS1 maint: multiple names: authors list (link)
Reiss, Joshua D. (2014). Audio Effects; Theory, Implementation and Application. CRC Press. pp. 300–301. ISBN 978-1-4665-6028-4.

"What are the best de-essers".

Music technology
Music technology	Mechanical Electrical Electronic and digital
Sound recording	Audio channel Mixing console Binaural recording Digital audio workstation (DAW) Effects unit Equalizer Headphones Microphone Microphone preamplifier Monitor speaker Multitrack recording Music production Music sequencer Outboard gear
Recording media	Phonograph record Magnetic tape Compact cassette Compact disc DAT Hard disk MiniDisc MP3 Opus
Analog recording	8-track cartridge Amplifier Cassette deck Comparison of analog and digital recording Experimental musical instrument Phonograph Player piano Reel-to-reel audio tape recording Tape recorder
Playback transducers	Loudspeaker Headphones Monitor speaker PA system Sound reinforcement system Speaker enclosure Subwoofer
Digital audio	Digital recording Digital signal processing
Live music	Mixing console Bass amplifier Effects unit Foldback Guitar amplifier Keyboard amplifier PA system* Reverberation Sound reinforcement system
Electronic music	Chiptune Circuit bending Drum machine Electronic drums Electronic musical instrument MIDI MIDI controller Music workstation Sampler Sequencer Sound module Synthesizer Theremin
Software	Digital audio editor Digital audio workstation GarageBand ProTools Scorewriter Software effect processor Software sampler Software synthesizer
Professions	Audio engineer DJ Guitar technician Mixing engineer Monitor engineer Piano tuner Record producer Re-recording mixer Sound designer Sound follower Sound operator Sound recording engineer Tape op
People and organizations	Audio Engineering Society Goji Electronics Institute of Broadcast Sound Lejaren Hiller IRCAM Max Mathews Musical Electronics Library Professional Lighting and Sound Association Robert Moog SMPTE STEIM
Related topics	Audiophile High fidelity Home audio Home cinema Music store Professional audio store New Interfaces for Musical Expression (NIME) Vehicle audio