Is Game Music All It Can Be?

By Andrew High

This is a love song. A love song to video game music. A love song to video game music that spends a lot of time pointing out that video game music would do well to iron its shirt, shower every day, and would it kill it to maybe shave every once in a while?

This piece is directed toward those who make, compose for, and/or enjoy a cinematic game experience common to most triple-A and an increasing number of indie titles. It touches on elements common to all video games in many places, but the purpose is not to play the nagging Jewish mother to two-man developers about how they should be more like their big brother who graduated summa cum laude and landed a big contract with Activision and will probably cure cancer someday.

The purpose is to help producers communicate with their composers, help composers hone their craft, and help the end consumer become more educated about the potential value of game music.

Why Take a Cinematic Approach to Game Music?

Too long has video game music been relegated to a dusty corner of gamers' minds. Sure, we all have fond memories of chip-tunes and our favorite melodies, but video game music has typically been viewed as a background soundtrack, not something that plays directly into the visual elements. Just look at all the games that allow you to import or stream your own music while you play.

This is a shame. Music can have a tremendous impact on the mood, feel, and emotion of any visual elements a game can try to convey. A shift in the music can take the exact same visual scene in two completely different directions. (I've always liked this example to show how a different score can change things up:)

Video games come in many forms and serve many purposes as far as the type of entertainment -- Ninja Gaiden in hard mode clearly scratches a different itch than FarmVille -- but I think it is safe to say that the majority of triple-A and otherwise popular games are trying to take a more cinematic, story-focused approach. What was the last FPS you played that didn't have a story component, regardless of how preposterous the premise? The visual techniques reflect this -- effects that emulate real camera patterns like light bloom, lens flare, focal shift and even film grit are all very common in the modern game.

Video games are unique to this A/V field in a number of ways -- one of the most obvious being that the pacing and even the order of events can be dictated by the player. Writing for this sort of uncertainty definitely present problems that any video game developer needs to consider. However, as games become more scripted, planned, and emotionally impactful, game composers would do well to study the centuries of experience other mediums can provide them. Re-inventing the wheel is not something we want to do here.

The focus on cinematic visuals and storytelling becomes increasingly obvious as we look into just how much straight-up non-interactive cinematic storytelling can be found in games. Oh sure, there might be a "press X to not die" moment sprinkled here or there, but when you strip out the real gameplay you are often left with a long sequence of cut-scenes that rivals the length of major movies.

For instance, The Batman: Arkham City cutscene playlist on YouTube is just north of 2 hours and 30 minutes long, longer than the majority of motion pictures. Gears of War 3 is 1:43 in duration. Xenoblade Chronicles? North of five hours, beating even the extended edition of Return of the King in length. Even completely disregarding player-driven gameplay, there are entire movies contained inside today's games.

Unfortunately, video game developers and the players themselves don't often see this connection. Corners are cut, sacrifices made, flat-out wrong practices are repeated time and again, and the gaming media looks upon it and proclaims it good. Games have made great strides lately with a more cinematic approach to storytelling, but it's sad to see a crucial piece of that puzzle so often neglected. The Final Fantasy series, Dragon Age, and Mass Effect have managed to start to understand lighting, blocking, cinematography, and the like, and utilized them to great effect -- but what about the music?

All visual media is more closely related than some would think. Film, TV, advertising, and games all share many similar traits, and music publishers often treat them in a very similar way. Though each presents unique advantages and challenges, all can be summed up with two simple, tiny words:

"TO PICTURE"

This is the essence of all visually-oriented music. Video games have long been a valid medium for telling an intriguing story, and the "to picture" approach has been proven over the centuries to be the best companion, as such. Our reaction to the music is often more subconscious and deeper than our visual analysis. At best it enhances and deepens our understanding of what our eyes tell us -- sometimes directly adding, sometimes showing another facet or wrinkle that we didn't see.

With all the cinematic focus on visual elements, why wouldn't we take a cinematic approach to game music?

Before we discuss using musically nerdy cerebral philosophies to guide game scoring, perhaps a quick overview of some basic techniques are in order. Frankly, many games fail to get even these right. The essential problem is that you can't just write music and expect it to work.

Understanding Your Place

Our ears are specifically tuned to speech frequencies, and working around that can be difficult. Guess where melodies (and music in general) sound best to our ears? That's right -- the exact same range as speech. Think about the last time you were trying to hold a conversation when you had the radio tuned to a pop station. Did you notice how much you had to turn it down to be able to hear the other person? Now try talking with about the first 90 seconds of this on in the background:

This piece was scored specifically written to accommodate human voice. Can you keep the volume much louder than the pop music example? You should be able to quite well. It's about space.

For the purposes of visual media discussion, diegesis is anything that is directly represented on-screen as "of the world". So if there's a scene in a smoky cabaret and the music of the scene is being played by a jazz band contained therein, that's diegetic. "Bohemian Rhapsody" is diegetic within Wayne's World, as the characters are obviously aware that the music is coming out of their radio and are reacting to it. (This article talks a bit more about that and other important concepts.) Most all film and video game music is mimetic, not diegetic, meaning that it's not music that is in the world with the characters but has instead been added for the sake of the audience. It's important to understand why that matters: because diegesis is king.

If you ever study opera you will quickly see that the entire orchestra basically exists to support the singer(s). The vocalist is diegetic, or in the world, and must be skirted around carefully by the music, which is mimetic, or outside of the world. Therefore, any composer worth their salt must write around the diegetic part of the story because that's the part that's actually telling the story. In more traditional musical settings like opera this is quite easy to accomplish, as the melody lines are clearly written out in musical notation. In other mediums, it may not be as obvious. This doesn't mean that the concept can be ignored, however.

Even simple conversations have pitch. Great stage actors can have up to a three-octave speaking range; it is how emotion is carried through the voice. Try speaking in a monotone and see how much you are able to convey. Erich Korngold, one of the great early composers of film, was famous for writing diegetic film dialogue out in musical notation and then scoring around that, as he did earlier with opera librettos. While this may not be a necessary step, a basic understanding of the frequency ranges used by male and female speakers and how to avoid writing scores in the same range is not out of the realm of any composer's understanding. As an example, take this clip from The Adventures of Robin Hood, one of Korngold's most famous scores.

Notice that the instrumentation takes up the entire spectrum of sound at the beginning, but then right as the vocals enter at 18 seconds in, they part. The strings become higher, the bass gets a bit lower, and everything that was in between the two drops out. It's a virtual parting of the waters to make room for the voices in their proper register. Just to show it's not a fluke, it does the same thing at 0:43.

I can't think of a single game that really nails this concept, which surprises me. It's not necessarily difficult, one just has to be aware of it. It's sad commentary that the first thing I typically do when I load up a video game is turn the "voice" slider to maximum and the SFX and music sliders down considerably, because they have no concept of how to write and mix around the vocals, instead of barreling over them.

In addition to keeping the frequency range in mind, the composer must also consider other ways they can muddy the text, and avoid them. In the gaming world this most often manifests itself as a score that is so busy it's distracting. Too many notes, too fast a tempo, too much of everything. An expert composer can properly choose the time for the score to become prominent and the time for it to fade into the background, back and forth between the natural breaths of the narrative.

An excellent example of this comes from the trailer for Conan -- a film that was rather terrible from a cinematography and plot standpoint, but had an absolutely outstanding score composed by Basil Poledouris. Watch the whole thing:

Notice how Poledouris actually alters the music for the lines of text? And it works even on its own as a song? Pull out the high strings and choir, throw in some low brass to punch up James Earl Jones' dialogue, it works very well. The actual instrumentation lends itself to the interplay of dialogue and action scenes.

Now for the ugly side of that coin. Sometimes people opt for the lazy way out and dump this on the mixing engineer, who accomplishes such by "ducking" the music when a vocal track is present -- ducking being pulling back the music volume to make room for the vocal to be heard. An example of constant ducking is, well, a lot of trailers, as they tend to have busier, more "intense" music. We'll use Uncharted 3 as a recent example:

Starting at about 40 seconds into the launch trailer, notice the up-down-up-down-up-down as the speech pops in and out. It's distracting and annoying.

Creating audio space is like arranging a bunch of 3D bubbles. Whatever is in the center on the X- and Y-axes and at the front of the Z (depth) axis is going to grab the most attention, and that should always be the vocals. Ducking is a cheap way of pushing the music back on the Z-axis, but that constant shifting is noticeable; the much preferable method is to make space on the front plane of X and Y around the dialogue. It is quite possible to write around dialogue, as thousands upon thousands of hours of Opera and film scoring will attest. Why shouldn't video games do this, as well?

Just Beat It

A beat, in scoring terms, refers to a particular visual point of action that should be accented. This can be a hard cut in the footage, a punch, just about anything. This can be accented in the music in many ways, typically depending on the requirements of the visuals. Here's a quick example that runs the gamut:

The low percussion (likely an udu) as the dragonfly lands on her nose is a beat. The sitar note as it flies away is another. The harp gliss for the reveal of Wonderland is another. The cymbal roll for discovering the caterpillar is yet another; the harsh low brass almost immediately after as his expression sours another still. 25 seconds in and we've hit five beats already. This is a fairly common pace for higher-energy sections and trailers. Notice that each of these has a different effect, but all come together to add interest and impact to what's happening on screen.

It is possible to create music for a beat-heavy visual without using beats, but then it's up to the foley and sound designers to pick up your slack. See here for a quintessential example:

The opposite effect, hard transitions and visual beats without any aural punch at all, feels so unnatural that I can't even find good examples of it, because no one does it.

I've used the example of a trailer here because this is something not really seen in games much, despite having many cut-scenes proliferate in the modern game. Occasionally one can find a use of a single beat, say a cue that build to a big crescendo, but considering many scenes can have potential beats that number well into the double-digits it is a woefully under-utilized idea.


Painting the Picture

The most important thing the score is there to provide is an enhancement and complement to the visual cues on-screen. This means, as an example, that bombastic brass in the middle of a tender love scene would not be particularly appropriate.

A good score can also act as another tool for the director (or game designer), adding subtext that may not otherwise be available, or shifting the perspective of a scene. The latter is a very important concept, but not necessarily appropriate for all situations.

Most movies, games, etc. are pretty straightforward, and usually demand a straightforward accompaniment. Nevertheless, it is important enough that we will discuss it in-depth later.

Silky Smooth...

In the last 25 years or so, films and television have discovered that a score doesn't always have to be "music" in the traditional sense that there's a melody, harmony, steady time signature, and so forth.

Sometimes a simple texture can get the point across as well or better than a more formally structured piece. Sometimes the music's lack of structure can support the picture's theme of amorphousness.

Think back on the shows of the '70s and '80s. These were written so that when the show started, people all over the house would hear the first few notes and think, "Bonanza's on! Time to gather 'round the TV!" You're probably humming one of the theme songs right now. Eventually, we transitioned to this:

This is not an inherently bad or good thing, but another tool in the composer's toolbox they can use to properly paint the scene. The "pad", or group of low strings/synths sustaining one or a handful of notes out of time below a scene, is extremely common now in movies to build suspense or otherwise denote a holding pattern, so to speak. At any rate, we now have music, textures, and sounds to play with, which opens things up quite a bit.

How to Represent

As noted in the preceding paragraph, it is not enough to say, "write something punchy here" or "make this sad". You have to be attentive to far more detail than just that. The best way to show this is by example. To show that I don't hate everything that you hold dear, we'll take a cue from Howard Shore's score to The Fellowship of the Ring. Now, concerning hobbits...

What is the music trying to portray here, and how does it accomplish it? Note that for the first minute, we see a hint of things to come. The camera sweeps through Bilbo's house, over things that are to be very important in a very short while (like the fireplace). During this you actually hear a low and incomplete version of the Fellowship Theme, an introduction to the extremely important concept of leitmotifs that we'll get to later. So the music is foreshadowing for us, in support of the video that does the same. Then Bilbo begins to discuss hobbits.

Hobbits are simple. They love peace, food, and good tilled earth. The music has to represent all of these things, as well as their general lazy, jovial nature. Shore chooses to go purely with a string section here to keep things simple, as a complicated instrumentation would belie the point. A fiddle (or a solo violinist playing one) is chosen as the lead instrument. Shore relies on prior knowledge here to stress the theme -- another helpful concept. Most every human who is culturally involved associates a single fiddle with a simple, rural, pastoral setting. Whether it be Ireland or the American South -- or what have you -- the sound instantly conjures up those memories. So half of his work is done for him before he ever puts notes to page.

The music itself is short and punctuated. Somewhat lively, but not too fast. Were it elongated, with sweeping strings, it would feel too sappy and lovey-dovey; were it much livelier (e.g. faster) it would feel manic, and clash with the rather lazy nature of the Shire's inhabitants. Note also that the register of the violin is nearly two octaves above where Sir Ian is speaking, which ties into our earlier discussion about avoiding the diegesis.

There are ideas in this short little piece that I don't have time to discuss, and I'm pretty sure that there are ideas I don't even hear but that we all may absorb sub-consciously. Howard Shore may be the best film composer currently practicing, so I wouldn't put it past him.

Let's listen to another example, but I want you to listen to the song, without visual accompaniment, and think about what it's portraying:

What do you hear? There's tenderness, but there's tension. Even in the first couple of notes, there's what I would call "consonant dissonance", that is, dissonance that we're pretty used to hearing in the form of a lot of upper-tertian harmonies, major 7ths and minor 9ths and so on. Pretty, but full of tension. And then that trumpet comes in. Oh goodness, that trumpet. Uan Rasey, the storied trumpeter who plays that part, was famously told by Jerry Goldsmith (the composer) to "play it sexy. But not like it's good sex!" A fitting portrayal of Chinatown, and the level of granularity that needs to be thought of in games as well.

Okay, one more. If you can, try to avoid looking at the title on this one! Just click the play button, close your eyes and tell me what you picture:

What do you hear? I hear a lot of energy, but of a very intense nature. Likely an action sequence. A lot of industrial sounds, samples of things like banging on metal drums and the like, but mixed with strings that have some Middle Eastern flair (like the little trills they do) and later on, a pan flute of all things. Okay, now look at the title -- this is the music that plays after you've completed a main assassination, and have to high-tail it back to a brotherhood base. Does it make sense to you?

I'm a bit on the fence about this one. In one sense, I get the combination of the old-world themes with the modern sound, since we're talking about taking highly technological equipment being used to send a person back to the 1100s. I like that. It also does good job of sending a sense of urgency to the player to get out of the area before major trouble ensues.

On the other hand, the very industrial, mechanical feel of the modern elements screams steampunk/industry to me, not the cybernetic technology and level of espionage at play in Assassin's Creed. Additionally, the slow, methodical, clunky battles with foot soldiers that you'll likely get into during your flight (though you can avoid them, it's difficult when they congregate directly outside the base) are an odd match. That, however, is more the fault of the game than the score, but designers must constantly think of how the two fit together. The score is not there to distract the ears while the eyes play the game.

As an aside, most of what we think of as instruments weren't invented until the 1800s at the earliest (including the piano). Rolling backward, we had things like crumhorns and sackbuts, and farther back you are restricted to gut-string guitar, lute, harp/lyre, and a basic woodwinds like a pan flute.

So pre-Renaissance settings and Bronze Age fantasy games that use heavy brass or even electric instruments are being very anachronistic. This often feels weird, even if you can't put your finger on precisely why. (300, I'm looking at you.) Assassin's Creed has a get-out-of-jail-free card due to ostensibly being set in the modern day, but it would be nice to see them pay more attention to this fact.


Level Design

Video games typically have long segments of player-controlled action that require atmospheric background music on endless loop. This is certainly something that's unique to video games, and presents its own difficulties. The composer must create audio that (a) tells the story of the area or level, (b) has enough interest to avoid fading out of the player's notice entirely, (c) can be listened to ad nauseum without driving the player insane, and (d) stays cohesive with the rest of the music in the game. Game composers in general seem not to struggle too much with B, but A, C, and D can present serious hurdles.

What does it mean to "tell the story" of a level? Does it mean "this is an ice level, so we should sound aloof and use high, cold instruments like the glockenspiel and high-register piano a lot" and "this is the fire level, so let's use heavy brass and industrial sounds to bring the heat"? (Like Metroid Prime, maybe?) Does it mean "this is where they are in the story and how the characters currently feel, so let's score to that"? Do you foreshadow events to come? Look back on the past?

The answer is all of the above, in varying degrees. Very few games have managed to consistently pull this off. The field is littered with boring, cliché, and just plain bad level music. First let's look at the good, though. To do this, I'll turn to Nier, a woefully underrated adventure that has one of the best scores I've heard in a game.

It's a well written and very cohesive work, and every piece, even the town music, all lean into this general feeling of malaise and pervasive dread that the game possesses. This is not a happy game. It is a game, first and foremost, about loss -- of loved ones, of a life gone by -- and the music is a large reason that this is conveyed so well.

Nier's soundtrack was written to convey this profound sadness in every track; the composer noted in an interview that even the "thrilling" high-tension boss battles were composed with this pervading feeling of sadness in mind. Interestingly, the developer thought so highly of the music that elements of the game design were shuffled around to match the music, rather than the other way around. This interplay where the director occasionally bows to the composer is commonly present in film, notably.

One of the main reasons it feels so cohesive and fitting as level music is the presence of one or two female voices on each track. In the story there are two women who feature prominently, and in fact who (minor spoiler) turn out to be a sort of watchdog/architect duo, looking after the world. One, at least, is always found singing and strumming a guitar. They also have an "on-stage" performance. So in a sense, they sing the soundtrack as they watch over the events of the world.

In other words, the female voices that are heard on nearly every track have actual diegetic in-game significance, which is a very nice touch. The music and the plot are tied together in a compelling and interesting way, and this heightens the narrative considerably. Let's look at a few specific examples from the game:

What do you "see" when you listen to this?

Not just general mood, but more specific. I'll tell you what I see: I see a place of great history and mystery. The minor key and open underlying string pad add to the openness, and the soaring soprano and what I would describe as Vaguely Middle Eastern Percussion place it in a desert region (notice that, as with the Hobbits example, we rely on the listener's outside knowledge to shorten what we have to explain). In truth, this level is an ancient desert temple of unknown origin, which (as is cliché in games) holds a secret of great power. I also hear a great pathos to the music, as if it's crying out for things long gone.

I can attest to the fact that listening to it for over an hour won't wear on you, and as we will soon see it is cohesive with the rest of the game, while at the same time expressing its unique location. Now the next cue:

This cut uses the same voices (though laid a little lower in their register) and the same general "feel" to the song as the first example. It sounds as though someone is looking...out a window, or to sea maybe, and longing to see someone or something. This is only played in a single room in the game, which is not ever necessary to visit for main story reasons, though if you side-quest a lot you will be visiting often. It is the longest and perhaps most bittersweet sidequest in the game.

In brief, this is the interior of a lighthouse, whose sole occupant is an elderly woman whose husband went out to sea long ago. He began to write her letters, and sent her something important, but his death notice reached the post office first. The postman couldn't bear to tell the woman of her lover's demise, so he writes letters to her in her lover's name, with the town's knowledge. For 50 years. In the end, it turns out she knows this and was playing along to keep some façade for everyone else in town so they wouldn't all mourn the loss. You get the idea.

Next cue:

This track begins with the sounds of metal on metal, which instantly take me to "factory" (not "cybernetic espionage," which was a confusing thing about the earlier Assassin's Creed example). There's an indifferent drive to it -- a minimalistic Philip Glass tribute -- as if this factory is completely automated like clockwork, and pumps out the same thing day after day without stopping or changing in any way. Oh, but that voice is back, layering another yearning for bygone days, and the sadness that has been brought upon the land.

These are but a few examples of the many intriguing and cohesive sounds you can hear in the Nier score. I would once again encourage you to pick up this very good game. At least, as long as you don't mind soul-crushing despair as a central theme. But we've seen the good, let's study some of the bad.

I loved Persona 4. It, like its predecessor, has become one of my all-time favorite games for its mature (in a good way) look at human nature, but also because it is masterfully created to have this cohesion of theme that runs through the story, world, and even the game mechanics themselves. However, the music falls flat in support of this otherwise tight package. Contrast the cues below with what you just listened to from Nier. Do they show the same cohesiveness, whether it be in instrumentation, style, or anything else? Do they fit the visuals or theme of the game? Do they evoke any of the feelings the game leads you toward? Emphatically, they do not.

What do you visualize when you hear this?

To me it sounds kind of like we're in a rave, or a late-night meat market bar. We're actually in a sauna. In the story, there are overtones of a gay bar, which leads a little more toward this music, but as the character of Kanji is further revealed to you this doesn't seem to fit either. Now, just imagine listening to it for over an hour on endless loop. In fact, just hit that replay button in the YouTube box and do it yourself two or three times. It wears on you, right?

Staying with the same game... What do you hear?

I don't even know what the composer is trying to imply here because the theme of the music is so muddled. A bit muzak-y, which to me implies just waiting around; a bit playful; a bit "we got some work to do, buddy". In truth, you hear this music in what is supposedly a secret underground base of someone who grew up reading and acting out detective novels. So... right. They're running from a normal life path and are even trying to escape their own gender, but does the music say any of that? I don't hear it.

One more Persona 4 track:

Here we have what is perhaps the most egregious example yet. At this point in the game you are well aware that those you're trying to save in each level are experiencing horrible things and will wind up hanging dead upside down from a light post unless you can save them. At this point, they have gone too far -- the victim is your nine-year-old cousin with whom you are staying, and who has probably built up the most emotional cachet with you throughout the course of the game. At this point, you feel anger, and a sense of dread of what could happen to poor, innocent child, and an incredible urgency to find her before she ends up like the rest of the victims. So to drive home this point, you get a song that would be perfectly at home on J-pop radio. Huh?

Persona 4's tracks clearly do not tie together in any way. There's no common theme, no musical ties between the cues; no real cohesion at all. Granted, each location is very unique, but ultimately the player is there to accomplish the same goal in each one. Even granting the unique locations, the cuts don't seem to be appropriate for their individual placements. That whole "To Picture" thing that I mentioned? Not found here. In a game that is otherwise masterfully created, this is disappointing, to say the least.

Nier, on the other hand, found a way not only to convey the feel of the individual locations, but create an over-arching theme and tie each piece back to it -- bonus points for doing so in a diegetical way. This is a perfect example of writing a score instead of just a soundtrack. Unfortunately, it is the exception rather than the rule.

Subtlety, Subversion, and Secret Meanings

A passable score will accomplish no more than to echo the actions that are occurring on-screen. This isn't a bad thing per se, but there are many opportunities to actually add to and enhance the viewer's understanding of the visual elements through music. The score can quickly convey many thoughts, both consciously and sub-consciously, that diegetic narration can ever hope to. It can represent the more general theme of the work (as we saw above with Nier); it can bring back an old theme to shift the perspective; it can do many things. One easy example of this comes from the late Bernard Herrmann, perhaps the greatest film composer to ever live.

Joseph Mankiewicz's The Ghost and Mrs. Muir is, at first glance, a 1940s romantic comedy that happens to include a ghost. Here is Herrmann's score for the opening credits, setting the tone for the film:

What do you hear? Does this scream romcom? It almost sounds like a mistake, right? It's dark, brooding, and melancholy. Feels out-of-place. But if you watch the entire movie you realize that one of the main themes of this movie -- unspoken by any character yet omnipresent throughout the footage -- is the fleeting character of human earthly life, and the inherent sadness in contemplating the passage of time. Herrmann sensed this and gave the film an incredible score that elevated it to one of the all-time greats.

As a counter-point see the trailer for Wise's I Want to Live!, a 1958 movie about an infamous murder trial. It's a typical jazzy '50s score by Johnny Mandel that at first feels appropriate, because the protagonist is a jazzed-up wild girl from the '50s. But as the movie progresses and she's tried and executed for a murder she swears she didn't commit, the movie turns very dark indeed. I think Mandel saw this and used less and less music as the movie went on, but nevertheless the music has aged very badly and simply doesn't carry the weight it needs to for such a heavy subject. This trailer gives the general impression of the score:

This is an important technique, and one upon which multiple papers could be written. However, its use in games has been basically non-existent to this point (save one exception). Suffice it to say that game composers -- and directors -- would do well to remember they have this tool in their kit.


Less Is More

Movies, television, and games certainly don't require a wall-to-wall score. It's a mistake many scorers make (and inexperienced musicians in general), but the score can often have far more impact if it is present only when it is needed.

In fact, it is not uncommon for a significant portion of a film's music to not ever make it onto a soundtrack; most musical moments come in brief little vignettes known as cues, usually under a minute and often barely conveying an eight-note theme to support dialogue.

A couple decades ago movies realized that, hey, we're spending millions of dollars on foley to get very deep and impactful action sounds like bullets firing, explosions, etc., and that maybe some of those major action sequences have more impact if we don't score them at all. The lack of music in these cases can actually bring you closer to the action. Think about the opening battle scene in Saving Private Ryan, the invasion of Normandy.

Throughout that 20-minute battle sequence (I'm not going to link it, but I bet you've seen it) there is only one tiny section with music: when the camera briefly dips under the water in a bit of a body-count shot. That little shred actually brings attention to the fact that there isn't any sound elsewhere in the scene that isn't the visceral sound of major battle.

This is something that video games, by and large, have yet to realize. Everyone is familiar with the general level music transition to the "hunker down behind this oddly-convenient chest high wall, we have enemies a-comin'!" fast-paced music, right? Maybe that's not necessary.

Lack of music can also build tension, especially when it is released properly. Here's a great example from the first Gears of War (NSFW language):

There is no music at all until a full minute in, and when it comes in it's a barely-perceptible string pad to simply build the tension a little more. And then, 20 seconds later, release. The score continues its minimalism after that, but it's a good example of how very few lines can be used to good effect.

And for an example of something that would have had much more impact without a score, check the first thirty seconds of this:

The music here is so disparate from the visuals it almost brings it into the realm of comedy, which I don't think is what the composers intended.

Instrumentation, chord restriction, and other limitations are also ways to heighten impact. Bernard Herrmann wrote the entire famous Psycho score using only a string section: no winds, no brass, not even a timpani. I think that this small, focused sound makes the terror far more intimate, which is the same thing Hitchcock's cinematography was trying to express. Do you need an example? Of course you do:

Conclusion

There is plenty of evidence of the many ways in which video games can be enhanced by a better take on their scoring, elevating them to much greater heights than they have previously attained. The score truly can be another character in the story, adding interjections and subtext of its own. How can we go about effecting such change?

I think the first thing is mere education. Being able to recognize and discern good uses of game music and how they work, and picking them apart from bad, is something from which all video gamers can benefit. Ideally video game reviewers and designers themselves will begin to pick up on such nuances as well, but only if gamers writ large begin to demand it.

As we dip into the uncanny valley and emerge on the other side at nearly photo-realistic visuals, it's no longer enough just to have "good graphics". Over the next decade it will not be good graphics that win recognition, but good and cohesive art design (in truth, this is already happening). The singular artistic vision, contributed to by many -- just like a movie -- is the future of growth in games. This should -- this must -- include a much deeper understanding of visual-oriented music scoring and how it can enhance the gaming experience. We must create more immersive, more supportive scores, and the time is now for it to happen. How?

The easiest fast-track is budget. A typical film has a music budget of somewhere around 5 percent. Were we to extrapolate that to video games, we could see things like Call of Duty with $5 million budgets or higher, and that's just based on the sheer development budget (exclusive of marketing and distribution). That much money could land you the likes of Howard Shore standing in front of The London Philharmonic, if you so desired. It's no accident that the game rumored to have the highest music budget of any to-date (Final Fantasy XIII) is one of my favorite soundtracks -- and you can read a great deal about why in this blog post.

Of course, the second half goes back to education and thought investment. It's not enough for Skyrim's designers to say "Let's have a Dwaemer male choir sing the Elder Scrolls theme, 'cause that'll be awesome! And huge! Fuck yeah!" There has to be far more deliberate thought into what can be done with the music, and the myriad ways in which it can affect the setting. Thought and care must be put into the music. We've seen several examples: Jerry Goldsmith's admonition to the trumpeter on the Chinatown score, Nier's producer bowing to a superior score and re-arranging the game around it, Herrmann writing a very intimate score to support Hitchcock's very intimate movie, or the linked leitmotif discussion of Final Fantasy XIII... this is where the growth opportunity lies.

There are patches of sunlight: aside from the games discussed here, Dead Space 2, Journey, and a few other games have scores that will surprise many people... but we're still a long way off from full understanding and implementation. Publishers and developers should realize that the music is worth spending money on. Producers should think very clearly about their emotional and intellectual goals with their projects, and how to convey those to composers.

The composers themselves, who have been recalcitrant to study the more cinematic aspects of music, need to pull from the deep well of knowledge that hundreds of years of compositional experience can offer. While we've seen a move toward more well trained composers like Jason Graves (Dead Space series, among others), the large majority of game composers simply don't have the training.

People who have been in the game for 10-20 years, have a great ear for melody, and know how to work within the confines of adaptive audio, for sure, but the long narrative of music seems to have eluded us to this point. Those few occasions where the "big boys" of film have been invited in for games (like Hans Zimmer writing Cues for Modern Warfare 2) it's been to make their PR team happy with another box-art bullet point by writing a couple cues and then let others do all the heavy lifting and actual composing.

Indie games have proven that gamers are willing to actively invest brain space in game music. The BIT.TRIP series, Botanicula, Audiosurf... My Steam list is filled with games that integrate music fully into the game. Though these games are tangential to this discussion, they can serve as proof-of-concept that spending time to concentrate on the music isn't throwing money down a well.

As to the players themselves, well, they would do well to think about what they really enjoy about game music. Are they just looking for some tunes to chill to while they game, as a great many game soundtracks show? Or are they willing to take the step to become a more active audience in a game's audio-visual synergy, to invest themselves more into the experience, and reap the greater rewards?

Return to the full version of this article
Copyright © UBM Tech, All rights reserved