sillybit AT (raw audio + a lesson)
i have never posted about the boring editing part that takes me literally hours. it's clean-up work. but it's also the reason i fell in love with making music and singing.
so, first, let's do a small cute simple sound sciency lesson.
sound is a wave: pressure changes in air. sounds simple, no? when you scream or hear a thick bass drop, you can sometimes feel the physical wave in your body.
however, the sound of a human voice is one of the most complex waves you can encounter; there are so many variables that influence the sound you make.
you make sound by pushing air past your vocal chords which are basically fleshy flappy things that vibrate in your throat. important note: you only actually make vowel sounds with the vocal chords! (AAA, EEE, IIII, etc.) consonants come later.
then the air travels through your throat into your mouth and nose cavity. there, the first pressure waves from the chords are modified by basically anything: your mouth shape, nose shape, throat shape, tongue position, etc. the position of your mouth modifies the sound that is pushed through and this is how you make consonants and words.
the air comes out. in real life situations, the sound wave you make travels everywhere and reverberates (is reflected) on objects around you. in the shower, for example, there is strong reverberation which creates this fullness/echo-like effect.
in digital situations (like for voice audio) this is different; a microphone captures essentially the main waves (if you are lucky). it cannot reproduce the real life environment, which is why a live singer always sounds better, more visceral, than on audio. the sound is literally more 'alive' because it 'adapts' to the real environment.
your produced sound wave is super complex like I said before. it is not a simple 'wave' like ~~~~~~. a wave like that is called a sine wave and actually sounds like a long and annoying microwave 'ding'.
your voice is more of an extremely chaotic wave that is actually made up of millions of mini sine-like waves. the final result looks like below:
you can see it's a wave, but it's made up of many many smaller waves. and these are not regular. this is about 20 milliseconds of an 'A' sound (very small fragment of human sound)
bigger waves -> lower frequency, more 'heavy'. think: bass, rumble, low voice
smaller waves -> higher frequency, more 'bright'. think: crystalline, bright, high pitched voice
below you see a spectral view of sound, the 2nd post-chorus of alien tower. a spectral view of sound (simple explanation) is something that allows you to see all the details of your produced sound wave.
a spectrogram 'breaks down' that chaotic wave into all the sine-like waves that are in it. this requires complicated math that i don't understand lol (Fourier wave transform equations).
on the bottom are the lower frequencies and on the top are the higher frequencies. the horizontal axis is just time (so, first sound -> last sound). the brighter orange the wave, the 'louder' it was present in the sound.
just look at the beauty of this. just contemplate.
zoom in. see how fucking beautiful a human voice is. this is mine, but yours looks exactly similar to this. you make millions of waves that come together as a unified sound.
the bottom part is more yellow/orange (im colourblind). i have a relatively heavy voice, but all voices are thicker towards the bottom, even female voices. this is nice -- otherwise, we would all sound like screeching birds.
the upper part (purple/blue) is more calm. normal -- again, humans are not screeching ultrasonic birds. except -- there are some parts where there is like a 'pillar'. there is no pretty waves in these pillars. these are called sibilants -- your S, Z, CH, F, V, wet T, wet K ... sibilants are closer to random noise (white noise) than any other sounds humans make. just try to say "SSSSSS". now, you can now 'see' and 'feel' that chaotic randomness in your mouth -- magic :)
there are long stretched parts that look beautiful. inside these parts are waves. these are my vowel sounds. the waves inside are the changing pitch. when the wave goes up, i sing higher. the wobble inside the wave itself is 'vibrato' -- every human voice trembles a bit.
in the long stretched vowel parts, sometimes the shape changes abruptly. these are the consonants. you can actually 'read' this sound spectrum to know what i say. "from the Lake - to the River - to the Sky - I am - touched - by something strange - not gonna lie"
the edited version is much cleaner than the unedited. and it's brighter overall because I made it louder.
Now listen to the (edited version) of this spectrum. (no autotune and false note trigger warning :p)
What I edited out in this bit:
i removed noise -> the general cloud of 'noise' visible in the silences becomes pitch black
i removed breaths in this song -> those puffy lower clouds before/between/after words
i removed clicks -> those vertical bright stripes randomly appearing everywhere
there were no gargles here, but these are hardest to see -> like a spotty cloud hidden in the upper parts, usually at the beginning of a sound.
i removed the room's rumble of my fridge and ventilation system (yeah, home studio vibes) -> the long orange stripe at the very bottom.
i removed my room's reverb and pauses between words -> you can see 'tails' or smears of my vowels into nothing. some bleed into the sibilants which is why there are these strange black rectangles.
I think I am a bit of perfectionist when it comes to clean-up. i know many use 1-click tools to remove all the things above. i prefer to do it by hand. then i have 100% control over every micro sound. also, i don't have a studio, so I am forced to be less lazy when it comes to editing. i don't think many people manually edit out the reverb in their takes.
editing is the very first step after recording in my workflow. after this comes Melodyne, which i already showed you guys many times. Melodyne is advanced software that can tighten and shape those waves inside the vowel sounds.
i hope you made it till here and liked this lesson! this is why i love singing/music. because it's just physics becoming poetry -- right at the boundary of science and magic.