This writeup, and the code it’s meant to accompany, are still a bit of a work in progress. In particular, while working on this writeup, I realized that there’s probably a simpler way to represent a lot of the data, but I haven’t had the time to work it out yet.

There’s currenty two implementations of this: one in Typescript for SMEditor and one in C++ for ITGMania. They’ve diverged in their specifics, but I’ve done my best to keep their overall function in sync.

This work was not done in a vacuum. The groundwork was laid by tillvit’s implementation of this algorithm as an experimental feature of SMEditor. A lot of my efforts were initially focused on improving the performance and readability of this implementation, and that was followed by a long process of iterating on the cost functions used. This writeup focuses mainly on the cost calculations performed, and is an attempt to explain the rationale behind them. I’ll include links to relevant parts of the code for both implementations.

What’s so hard about this?

For simple cases, predicting how a player will hit a note isn’t actually all that difficult. There’s three basic assumptions that you can work with that will cover probably 90% of most step charts:

players generally want to be facing forward
players prefer to alternate feet when pressing notes
players prefer to minimize the amount of movement made when pressing jumps

The remaining 10%, however, consists mostly of what the community refers to as “tech”. This includes things like:

Brackets
(hitting two notes with one foot)

Footswitches
(consecutive notes hit with alternating feet)

Jacks
(consecutive notes hit with the same foot)

Crossovers
(crossing feet over the center to hit notes)

They're basically just fancy ways of tapping the arrows, used to make step charts more complicated. For my use case, though, getting this 10% correct is very important.

Alright, so how does this work

The general idea is that we need to build a graph, with each node representing one possible state (a foot placement for a given point in time), and each edge representing the difficulty of moving from one state to the next. There’s two interesting parts to this: determining the possible foot placements, and calculating the movement difficulties.

Data structures and concepts

Before getting too deep into things, let’s define some concepts.

The dance stage is represented as points on a 3x3 grid (or 6x3 in the case of doubles), where 0,0 is the left bottom (or farthest from the screen). So for singles, this looks like:

left:  0,1
down:  1,0
up:    1,2
right: 2,1

These are essentially unitless, but if it makes it easier to think about it, each panel on a DDR dance stage is roughly 1ft square.

This data is represented as a StageLayout object, which contains an array of StagePoints containing the coordinates of each arrow, along with some other useful information like the number of columns, which columns are considered “up”, “down”, or “side” arrows.

interface StagePoint {
  x: number
  y: number
}

interface StageLayout {
  layout: StagePoint[]
  columnCount: number
  upArrows: number[]
  downArrows: number[]
  sideArrows: number[]
}

The step chart is represented as a series of Rows, where each row contains one or more Notes. The notes in a row are laid out in columns, zero-indexed, ordered as left, down, up, right. Each note is one of several note types:

a Tap is a standard note
a Hold Head is the beginning of a hold or roll (these are currently treated as equivalent for our purposes)
a Hold Tail is the end of a hold or roll note
a Mine is a note that is to be avoided

(There are other note types (lifts, fakes) that we’re not concerned about right now)

Besides the notes in the row, we also want to keep track of the time that a row occurs, as well as things like note count, whether any hold notes are currently being held (remember that there is no “hold body” defined in Stepmania), and whether there any notes were preceded by a mine.


enum NoteType {
  None = "0"
  Tap = "1"
  HoldHead = "2"
  RollHead = "3"
  HoldTail = "4"
  Mine = "M"
}

interface Note {
  column: number
  type: NoteType
  hold?: number 
}

interface Row {
  time: number
  notes: Note[]
  holds: 
  noteCount: number
}

Each Node of the graph contains a given State, which represents a specific row of the step chart, with a specific foot positioning. Foot positioning is tracked by indicating which part of the foot is on which arrow.

This is still a work in progress in my opinion, I’m still trying to come up with a more clear way to represent this information.


enum FootPart {
  None
  LeftHeel
  LeftToe
  RightHeel
  RightToe
}

interface State {
  rowIndex: number
  time: number
  columns: FootPart[]
}

interface Node {
  state: State
  neighbors: Map<number, Node>
}

This is all somewhat simplified from what has actually ended up being implemented. Partly because it’s computationally cheaper to pre-compute some data for things like the States, and partly because, about halfway through writing this, I realized a

Determining possible foot placements

For the vast majority of steps in a step chart, we really only have two possible foot placements: the player presses the note either with their left foot, or with their right. This applies to single notes and non-bracketable jumps. But, as I mentioned earlier, tech make all of this more complicated.

For instance, with a single bracketable jump, we now have 6 possible foot placements.

It could be a regular jump:

Or it could be a bracket of some kind:

Hold notes also complicate matters. Tapping a note while holding another has something like 8 possible foot placements, due to the fact that we have to consider that the player might execute a holdswitch or a bracket tap.

Granted, most of these positions are unlikely, and some aren’t really physically possible. But I’ve found that it’s very difficult to make assumptions about which positions could be considered “valid”, because the context of what notes came before, and what notes come after, is very important. That, and I just know that the moment I try to eliminate some “invalid” moves, someone is going to release a chart that explicitly wants players to perform them.

It’s also worth pointing out that, besides brackets, I don’t try to predict whether the player will tap an arrow specifically with the heel or the toes of their foot. This is done because, frankly, that would just make all of this even harder than it already is, and, for my purposes, that information isn’t that important.

Calculating movement difficulties

After a good deal of experimenting and analyis, I’ve ended up with a set of 14 different cost functions and associated weights, that make the total difficulty. They can be broken down into four categories:

Basic Movement
Brackets
Footswitches vs Jacks
Other Obscure Stuff

And then at the end there’s some costs that still remain in the coddebase, but aren’t actually getting used anymore.

Basic Movement

First, the basic movement costs: Distance, Facing, and Doublestep.

These three costs alone do a pretty good job of predicting a player’s movement, especially on easier, non-technical charts, and especially if you don’t want to bother with predicting brackets.

What’s so hard about this?#

Alright, so how does this work#

Data structures and concepts#

Determining possible foot placements#

Calculating movement difficulties#

Basic Movement#

Distance#

Facing#

Doublestep#

BRACKETS#

Twisted Foot#

Slow Bracket#

FOOTSWITCHES AND JACKS#

Slow Footswitch#

Jack#

Sideswitch#

Other Obscure Stuff#

Bracket Tap#

Bracket Jack#

Holdswitch#

Spin#

Missed Footswitch#

Mine#

Other Costs That Aren’t Getting Used Anymore#

Other#

Crowded Bracket#

Jump#

What’s so hard about this?

Alright, so how does this work

Data structures and concepts

Determining possible foot placements

Calculating movement difficulties

Basic Movement

Distance

Facing

Doublestep

BRACKETS

Twisted Foot

Slow Bracket

FOOTSWITCHES AND JACKS

Slow Footswitch

Jack

Sideswitch

Other Obscure Stuff

Bracket Tap

Bracket Jack

Holdswitch

Spin

Missed Footswitch

Mine

Other Costs That Aren’t Getting Used Anymore

Other

Crowded Bracket

Jump