ruby.tutorials.bdd.UsingBddToDevelopABasicAlgorithm

Overview

This tutorial presents the mechanics of BDD, refactoring and touches on continuous integration by implementing the Shunting Yard Algorithm. The point of this tutorial isnot to write a solution to this algorithm. The algorithm is a vehicle for practicing the mechanics of BDD. Having said that, you will end up solving much of the algorithm.

This tutorial has been written for you to actually read and type. The Kinesthetic Learning experience of actually typing the code, running the tests and checking in the work significantly reinforces learning habits that will serve you well. Much of what you practice in this tutorial can eventually become habit/muscle memory with enough practice. Consider this an opportunity to start. You’ll get frequent feedback throughout. And while this may represent a very different way of working from what you are used to, here are a few comments:

What you are practicing now is just something you’ve learned, so this really is no different.
Try it for the duration of this tutorial. It will only be a few hours of your time, so if you find it useless after that time, stop doing it.

Good Luck!

Introduction

Synopsis: The Shunting Yard Algorithm takes an expression in infix notation and converts it to reverse polish notation.

Here are a few examples:

Infix	RPN
1 + 3	1 3 +
1 + 3 - 4	1 3 + 4 -
1 + 3 * 2	1 3 2 * +
3 + 1 * 4 - 2 / 3	3 1 4 * + 2 3 / -
a + b	a b +
( 4 + 5 ) * 3	4 5 + 3 *
( ( 1 + 3 ) / ( 9 - 5 ) ) * ( 2 + 3 )	1 3 + 9 5 - / 2 3 + *
f ( 3 )	3 f
f ( 4 , 1 , a , d )	4 1 a d f
f ( g ( ( 1 + 3 ) * 4 ) / x ( y ( z ) ) )	1 3 + 4 * g z y x / f
a = b += 5	a b 5 += =

To better understand the algorithm, consider spending some time reading it here. However, after reviewing the algorithm, you might come up with several issues that your code will need to handle. Here’s a list of those issues:

Basics

Read tokens from a string (assume space-separated for now)
Return space-separated tokens in a string

Tokens

Numbers/variables [e.g., 5, x]
function calls [e.g. f(), f(3, 1), f(a, b, 3)]
Operators

Operators

unary [e.g., 3 !], binary [e.g. 4 + 5], ternary [ 5 > 3 ? a : b ]
Might just be one, [e.g., 4 + 5], or many, [e.g. 4 * 6 / 5 + 2 ^ 6]
Precedence, [e.g., * / before + -, before = +=]
Operators are associative, left -> right [e.g., +], or right -> left [e.g., ^ =]

Parenthesis

Used for precedence [e.g., ( 1 + 4 ) * 6]
Used in function calls [e.g., f ( 4 , 5 ) ]

Expressions

Simple expressions, [e.g., (x + y)]
Arbitrarily nestable, [e.g., ( 8 * ( 4 + 1 ) / ( 9 - 12 ) )]
With function calls, [e.g., f ( ( a + b ) ^ q / g ( 5, a, 1 ) )]

Even with the published algorithm, this is a lot of work. You do have the option of just coding up the algorithm as stated but will you know that it works? If you’re not sure, then you’ll probably want to write some tests. This approach can work and having tests is certainly better than not having tests. However, this tutorial will allow you to use Behavior Driven Development to approach this problem using very small steps. There are many reasons to do so:

See results quickly
Verify expected versus actual behavior
Provide a form of regression to know if something you just did broke was was already working
Check in code often so if you completely mess up, you can use the repository how it was meant to be used
You can take breaks often, so interruptions do not impact your work as much

Getting Started

To follow all of the steps in this tutorial you will need four things:

A Ruby interpreter (you can type ruby at the command line and it is found or you know how to use Ruby in your favorite IDE)
The RSpec ruby gem
A text editor (emacs, vi, or full-features IDE’s like Eclipse or IntelliJ
A revision control tool (strictly speaking this is optional, however there will be several places where I recommend you check your work in, even if you don’t normally do so, consider trying working outside of your comfort zone)

For this exercise, you’re going to keep things fairly simple:

Create a directory somewhere that can hold your Ruby source code
In that directory, create a file called shunting_yard_algorithm_spec.rb with the following contents:

    describe "Basic Algorithm Usage" do
    end

This describes a “context” under which examples can execute.

Save the file and verify everything is working:

    Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb
    
    Basic Algorithm Usage
    
    Finished in 0.003543 seconds
    
    0 examples, 0 failures

Congratulations, you have successfully written a context in RSpec. It’s missing any actual examples, and that is what you will add next.

The 0th Example

You will create an example whose primary purpose is to get the production class created with its usage documented in an executable form. This first example creates a skeleton, but it’s an excellent way to start because you get something created and working almost immediately. From there, it’s all about adding features while keeping your code clean. It is much easier to modify something that exists than something that does not exist.

Add an example to get your context (an “it” introduces a so-called example):

    describe "Basic Algorithm Usage" do
      it "should convert '' to ''"
    end

Run your example again:

    Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb
    
    Basic Algorithm Usage
    - should should convert '' to '' (PENDING: Not Yet Implemented)
    
    Pending:
    Basic Algorithm Usage should should convert '' to '' (Not Yet Implemented)
    
    Finished in 0.012712 seconds
    
    1 example, 0 failures, 1 pending

Theit indicates something that the production code should do. Right now this example is a placeholder and, as indicated, is not yet implemented. This is a great way to capture ideas that won’t actually cause things to fail. Just jot down your ideas and them go back and work on one at a time. Warning: don’t get too far ahead. You will probably find out for yourself that the very examples you want to create change as you make progress on the production code. It’s OK to write one or a few, butdo not try to get all the examples added but unimplemented before you start completing examples.

Next, make the example “complete” in the sense that RSpec will no longer indicate it is “(Not Yet Implemented)”.

Update your example:

  it "should should convert '' to ''" do
  end

Run your examples to confirm that everything is passing:

    Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb
    
    Basic Algorithm Usage
    - should should convert '' to ''
    
    Finished in 0.012612 seconds
    
    1 example, 0 failures

At this point, you have a complete context with all of its examples passing. You want to get back to this condition frequently. You need to create an example that somehow drives the development of new production code. Every example you’ll write will have at least three parts:

Phase	Description
Setup	Create everything necessary for an example to execute. Create instances, connect objects, put things into a well-defined, known starting point. For this exercise, you’ll always start with a fresh “converter” before each test.
Execution	Given a known-starting point, exercise the production code in some way with the intent of generating an expected result.
Validation	You knew the starting point (you control that), you know how you exercised the production code, verify that the production code did what you expected it to do.
Teardown	Not always necessary, you should write your examples such that they leave no footprint that could cause other examples to fail. For this tutorial, you will not have any teardown requirements because every example will begin with an in-memory object create beforeeach example executes.

You’re going to take small steps to keep things running and passing often. Sometimes these small steps will seem too small. When you think that, ask yourself “compared to what?”. If something is too small, that’s because you’re expecting to work in larger chunks. Fine, try this and see if it fits. At the end of the tutorial if you haven’t warmed up to the idea, you still learned something useful.

Setup

Update your example to resemble the following:

  it "should should convert '' to ''" do
    algorithm = ShuntingYardAlgorithm.new
  end

Run your example (it will fail)

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Basic Algorithm Usage
- should should convert '' to '' (ERROR - 1)

1)
NameError in 'Basic Algorithm Usage should should convert '' to '''
uninitialized constant ShuntingYardAlgorithm
./shunting_yard_algorithm_spec.rb:3:

Finished in 0.01362 seconds

1 example, 1 failure

Your example is back to failing. In this case, the line you just added makes reference to a class that does not exist. This is a normal occurrence in BDD - write example code to exercise production code that does not yet exist, then write just enough production code to get the test to pass.

For this first example, you can work in a single text file.

Setup

At the top of the file add the class (here’s the whole file):

class ShuntingYardAlgorithm
end

describe "Basic Algorithm Usage" do
  it "should should convert '' to ''" do
    algorithm = ShuntingYardAlgorithm.new
  end
end

Run your example:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Basic Algorithm Usage
- should should convert '' to ''

Finished in 0.012638 seconds

1 example, 0 failures

This is all that is required for setup for this test. Next, you need to execute some code.

Execute

Update your test as follows:

  it "should should convert '' to ''" do
    algorithm = ShuntingYardAlgorithm.new
    algorithm.convert ''
  end

Run you example:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Basic Algorithm Usage
- should should convert '' to '' (ERROR - 1)

1)
NoMethodError in 'Basic Algorithm Usage should should convert '' to '''
undefined method `convert' for #<ShuntingYardAlgorithm:0x58569c>
./shunting_yard_algorithm_spec.rb:7:

Finished in 0.013271 seconds

1 example, 1 failure

Once again, your test is failing. Instead of a missing class it’s due to a missing method.

Add that method to the ShuntingYardAlgorithm:

class ShuntingYardAlgorithm
  def convert(expression)
  end
end

Run your example:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Basic Algorithm Usage
- should should convert '' to ''

Finished in 0.01262 seconds

1 example, 0 failures

Great, your example is back to working. You just finished the execution part. Now it is time to have your example verify the results it expected.

Verify

Update your example to perform validation:

  it "should should convert '' to ''" do
    algorithm = ShuntingYardAlgorithm.new
    result = algorithm.convert ''
    result.should == ''
  end

Run your example:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Basic Algorithm Usage
- should should convert '' to '' (FAILED - 1)

1)
'Basic Algorithm Usage should should convert '' to ''' FAILED
expected: "",
     got: nil (using ==)
./shunting_yard_algorithm_spec.rb:10:

Finished in 0.013533 seconds

1 example, 1 failure

The example is back to failing. To get your test to pass, you’ll simply change the ShuntingYardAlgorithm.convert method return a value that will cause the test to pass.

Update your production code:

class ShuntingYardAlgorithm
  def convert(expression)
    ''
  end
end

Run your example:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Basic Algorithm Usage
- should should convert '' to ''

Finished in 0.012664 seconds

1 example, 0 failures

Congratulations, you’ve made it through a complete TDD cycle. Now is a great time to commit this work in to a repository before moving on.

Check In

A great time to check in is anytime your tests are passing. So after you’ve written a new unit test and got it to pass, check in your work. Why?

If you mess up, you can use the revision tool to compare what you have with what was already checked in to better pinpoint the problem.
If you subsequently mess up, you can revert back using the tool
Other people can see your work sooner (that can be scary, but since you’re writing unit tests now, you’re already a leg up on the average bear)

This tutorial briefly demonstrated git. Istrongly encourage you to use some revision control tool throughout. This goes back to learning by doing. Thinking about doing this will not activate your brain in the same way doing it will. If you want more details on using git, read the git tutorial.

Add this directory into a revision control system and make sure it is checked in:

Macintosh-7% git init
Initialized empty Git repository in /Users/schuchert/src/ruby/bdd_tutorial_1/.git/
Macintosh-7% git add shunting_yard_algorithm_spec.rb 
Macintosh-7% git commit
Created initial commit ba33a1c: Initial checking. First example demonstrating basic algorithm usage passing.
 1 files changed, 13 insertions(+), 0 deletions(-)
 create mode 100644 shunting_yard_algorithm_spec.rb
Macintosh-7%

Summary

Here’s what you have created so far:

class ShuntingYardAlgorithm
  def convert(expression)
    ''
  end
end

describe "Basic Algorithm Usage" do
  it "should should convert '' to ''" do
    algorithm = ShuntingYardAlgorithm.new
    result = algorithm.convert ''
    result.should == ''
  end
end

This probably seems small, however you have described the API of the class used to translate between an infix and RPN notation.

In addition, you practiced Martin’s three laws of TDD (paraphrased) - within the context of BDD:

Write no production code without a failing test
Write only enough of a test so that it fails
Write only enough production code to make the test pass

This is what you did.

You first created a basic context and example that referred to a missing class (laws 1 and 2)
You added a basic definition of that missing class for the example to pass (law 3)
You sent that new class a message that it did not yet implement (laws 1 and 2)
You then added that missing method without a body (law 3)
You added a check to your example to make sure your results were as expected (laws 1 and 2) - they were not
You hard-coded the method to actually return a value to make the test pass (law 3)

Now is a great time to take a break.

Working up to a basic operation

Now you’ll work with both constants and basic operators like + and -. As you work though these next few examples, you’ll end up writing code, changing it and cleaning it up. You will be adding refactoring to your tool set. This particular kind of refactoring might vary from your refactoring experiences. You will be performing simple refactoring that takes seconds and minutes not hours, days or weeks.

Create a new context and example (the location is irrelevant, however for your purposes consider adding new contexts and examples after the last ones added):

class ShuntingYardAlgorithm
	...
end

describe "Basic Algorithm Usage" do
	...
end

describe "Constants" do
  it "should convert a single constant to itself, e.g., 42 ==> 42' do
    algorithm = ShuntingYardAlgorithm.new
    result = algorithm.convert '42'
    result.should == '42'
  end
end

Notice that it was OK to write this entire example before stopping to check. The example does not add any new methods to the existing class. Indeed, going forward with this example you can write complete examples. This is because the thing you are writing has a simple API. Even so, eventually your example writing will get to this point on any unit under test.

Run your examples:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Basic Algorithm Usage
- should should convert '' to ''

Constants
- should convert a single constant to itself, e.g., 42 ==> 42 (FAILED - 1)

1)
'Constants should convert a single constant to itself, e.g., 42 ==> 42' FAILED
expected: "42",
     got: "" (using ==)
./shunting_yard_algorithm_spec.rb:19:

Finished in 0.016461 seconds

2 examples, 1 failure

The new example fails but the original example is intact. Now you need find a way to change the production code such that you do not break existing examples and make the new example pass.

There are a few more things to consider:

The order in which examples execute is, by design, unknown. This means you should write examples independently of each other. If there is a need for common setup, there are ways to accomplish that.
Try to avoid backtracking. Break no existing examples. Why? An example describes a contract or a promise of a certain behavior. Breaking an example, while sometimes necessary, should be carefully considered.
In this case, it is a simple change to support the old example, and get the new example passing. Make the following change:

class ShuntingYardAlgorithm
  def convert(expression)
    expression
  end
end

Run your examples to make sure things pass:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Basic Algorithm Usage
- should should convert '' to ''

Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Finished in 0.015081 seconds

2 examples, 0 failures

Congratulations, you’ve just made it through another application of the three laws of TDD. Since all examples are passing, now is a great time to check in your work.

So, check in your work already:

Macintosh-7% git add shunting_yard_algorithm_spec.rb 
Macintosh-7% git commit
Created commit c6e3ce2: Added support for basic constants.
 1 files changed, 10 insertions(+), 2 deletions(-)
Macintosh-7%

Refactor the Examples

Why You Should Care

Automated tests give us amazing leverage. If something is not quite right, hard to understand or inefficient, then you might need to change it. What would happen if you could change the implementation of something and know with near certainty that you did not break anything? What if your requirements change and force a redesign? Maybe you are just adding to an existing class. Did what you just do break it? How can you know? The answer is automated tests (or examples in this case). What if you need to build something from the ground up? If you happen to be rewriting something with existing examples, then you have an executable specification from which to work.

I have had several such experiences. I wrote the obligatory login service for a single application. I wrote several unit tests (not even following TDD, but rather I was test infected) to make sure I got all of the business rules right (there were maybe 70, give or take). After we deployed the first application, we used the first application as a prototype for a suite of applications. A suite of applications, of course, requires single sign-on. This required significant rewriting both because of new requirements as well as a need to extend the base architecture to support multiple applications. I had 70 working unit tests. I added around 40 more tests to accommodate the new requirements. Also, with the new system, the underlying implementation became a mess and strongly suggested refactoring to the state design pattern. While all of this was going on, I keep the unit tests passing. It make a daunting task much easier.

So unit tests are a valuable asset and should be treated with the same (maybe even more) TLC that you’d treat production code. Thus, keep your tests clean and well written as you work.

Update your both your contexts by putting them in a containing context:

describe "Shunting Yard Algorithm" do
  before(:each) do
    @algorithm = ShuntingYardAlgorithm.new
  end

  describe "Basic Algorithm Usage" do
		...
  end

  describe "Constants" do
		...
    end
  end
end

Run your examples to verify you have not broken anything:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Finished in 0.016944 seconds

2 examples, 0 failures

Now update the first example to use the @algorithm instance variable:

  describe "Basic Algorithm Usage" do
    it "should should convert '' to ''" do
      result = @algorithm.convert ''
      result.should == ''
    end
  end

Run your examples to verify you have not broken anything:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Finished in 0.01684 seconds

2 examples, 0 failures

Notice that both examples do the same thing:

Initialize (now shared)
Execute the algorithm and store the result
Validate the result against the expected result

You’ll be doing this several times, so we can generalize from these two examples to simplify the work.

Add the following two methods:

describe "Shunting Yard Algorithm" do
  before(:each) do
    @algorithm = ShuntingYardAlgorithm.new
  end

  def a_conversion_of expression
    @expression = expression
  end

  def should_equal expected
    result = @algorithm.convert @expression
    result.should == expected
  end

	...
end

Run your examples to verify you have not broken anything:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Finished in 0.01707 seconds

2 examples, 0 failures

Use these new support methods in the first example:

  describe "Basic Algorithm Usage" do
    it "should should convert '' to ''" do
      a_conversion_of ''
      should_equal ''
    end
  end

Run your example to verify you have not broken anything:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Finished in 0.017171 seconds

2 examples, 0 failures

Finally update your second example to use this support method:

  describe "Constants" do
    it "should convert a single constant to itself, e.g., 42 ==> 42" do
      a_conversion_of '42'
      should_equal '42'
    end

Run your examples to verify you have not broken anything:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Finished in 0.017112 seconds

2 examples, 0 failures

Congratulations, you’re done refactoring (for now) and the examples are passing. (Are you thinking “time to check in?”)

Here is the whole thing after these steps:

class ShuntingYardAlgorithm
  def convert(expression)
    expression
  end
end

describe "Shunting Yard Algorithm" do
  before(:each) do
    @algorithm = ShuntingYardAlgorithm.new
  end

  def a_conversion_of expression
    @expression = expression
  end

  def should_equal expected
    result = @algorithm.convert @expression
    result.should == expected
  end

  describe "Basic Algorithm Usage" do
    it "should should convert '' to ''" do
      a_conversion_of ''
      should_equal ''
    end
  end

  describe "Constants" do
    it "should convert a single constant to itself, e.g., 42 ==> 42" do
      a_conversion_of '42'
      should_equal '42'
    end
  end
end

Finally, now is a great time to check in your work with a comment like “Refactored both tests to remove duplication.”

Macintosh-7% git add shunting_yard_algorithm_spec.rb 
Macintosh-7% git commit
Created commit 9f68a9b: Refactored both tests to remove duplication.
 1 files changed, 34 insertions(+), 21 deletions(-)
 rewrite shunting_yard_algorithm_spec.rb (80%)

Now is a great time to kick back, listen to some tunes, get a glass of ice tea and add BDD to your resume.

Example: An Actual Operator

Now seems like a good time to create a complete expression. This example represents a “happy path”, or a path through the system that generates a good result. At some point you’ll need to consider negative examples so you can define how an invalid use of the system behaves (gracefully somehow, hopefully).

Since this is an example of processing basic operators, which is different from the other contexts, you should create a new context

Create an example for addition in its own context (add this just before thelast end in the file):

  describe "Binary Operators" do
    it "should convert 5 + 3 ==> 5 3 +" do
      a_conversion_of '5 + 3'
      should_equal '5 3 +'
    end
  end

Run your tests to see how it fails (not showing full output):

'Shunting Yard Algorithm Binary Operators should convert 5 + 3 ==> 5 3 +' FAILED
expected: "5 3 +",
     got: "5 + 3" (using ==)
./shunting_yard_algorithm_spec.rb:18:in `should_equal'
./shunting_yard_algorithm_spec.rb:38:

Finished in 0.021162 seconds

3 examples, 1 failure

This result makes sense, your method returns the value passed, so now you’ll actually have to write some code to do some work. However, you need to keep existing examples working at the same time. For a simple first version, how about splitting the expression into its parts and then you’ll simply write the operator last:

class ShuntingYardAlgorithm
  def convert(expression)
    @result = ''

    expression.split(' ').each { |t|
      if t =~ /^\d+$/
        @result << ' ' << t
      else
        @operator = t
      end
    }

    if @operator != nil
      @result << ' ' << @operator
    end

    @result
  end
end

This is a big jump from nothing. This indicates that maybe the test that does too much more more than what was already there, or maybe you missed a simpler opportunity.

Of course, this doesn’t work.

Running your examples, you’ll notice the following errors:

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42 (FAILED - 1)

Shunting Yard Algorithm Binary Operators
- should convert 5 + 3 ==> 5 3 + (FAILED - 2)

1)
'Shunting Yard Algorithm Constants should convert a single constant to itself, e.g., 42 ==> 42' FAILED
expected: "42",
     got: " 42" (using ==)
./shunting_yard_algorithm_spec.rb:32:in `should_equal'
./shunting_yard_algorithm_spec.rb:45:

2)
'Shunting Yard Algorithm Binary Operators should convert 5 + 3 ==> 5 3 +' FAILED
expected: "5 3 +",
     got: " 5 3 +" (using ==)
./shunting_yard_algorithm_spec.rb:32:in `should_equal'
./shunting_yard_algorithm_spec.rb:52:

Finished in 0.022164 seconds

3 examples, 2 failures

Not only did this not fix the problem, it broke an existing example. You have two choices, try again, or fix it quickly. In this case, you can change one thing to actually fix both tests:

      if t =~ /^\d+$/
        @result << ' ' if @result.length > 0
        @result << t
      else

Run your tests and you’ll notice all tests are passing.

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Shunting Yard Algorithm Binary Operators
- should convert 5 + 3 ==> 5 3 +

Finished in 0.020394 seconds

3 examples, 0 failures

Now is a good time to check in your work because you need to do some serious cleanup on this code. It is very ugly, unruly and generally messy. However, all the tests pass. It’d be a shame if you forgot to check in and then made a mistake.

Check in your code. Really!

Macintosh-7% git add shunting_yard_algorithm_spec.rb 
Macintosh-7% git commit
Created commit 4fe1b71: Added support for first math expression. Preparing for refactoring of solution.
 1 files changed, 25 insertions(+), 3 deletions(-)

Refactoring Your Mess

You have three lines that write to the @result instance variable. You can quickly fix this by adding a method to add to the result:

  def add_to_result(token)
    @result << ' ' if @result.length > 0 
    @result << token
  end

Add this method and make sure your tests still pass before you actually use this method. (When refactoring, generally, add first.)
Once your tests are passing, update the method to the following:

  def convert(expression)
    @result = ''

    expression.split(' ').each { |t|
      if t =~ /^\d+$/
        add_to_result t 
      else
        @operator = t
      end
    }

    if @operator != nil
      add_to_result @operator 
    end

    @result
  end

Verify your tests still pass.

The first “if” is obtuse. You can write this so someone can understand its meaning without having to read regular expressions (personally, I like regular expressions, but I also use a vi plugin in Eclipse, so you can imagine that I’m not quite wired right).

Add the following method (again, adding before changing existing code):

  def is_number(token)
    token =~ /^\d+$/
  end

Run your examples to make sure nothing has broke.
Now, update the original “if” statement to use this method:

      if is_number t 
        add_to_result t 
      else
   ...

The previous two refactorings are examples of “Extract Method” as described in Refatoring by Fowler. This is a bread-n-butter refactoring. You should use this refactoring freely. Do not concern yourself with method invocation overhead. It is not going to be a problem in practice.

In addition, taking even simple, but certainly complex conditions and putting them into a method can make your code much easier to read and maintain. This is one of those refactorings you should consider nearly all the time.

Use a ruby idiom to improve adding the operator at the end:
From:

    if @operator != nil
      add_to_result(@operator)
    end

To:

    add_to_result @operator if @operator != nil

Run your examples to make sure nothing is broken.
Extract the token processing into its own method (extract method again) by first adding a new method:

  def process(token)
    if is_number token 
      add_to_result token 
    else
      @operator = token
    end
  end

Run your tests to make sure you did not break anything
Update the original code to use this new method:

  def convert(expression)
    @result = ''

    expression.split(' ').each { |t| process t  }

    add_to_result @operator if @operator != nil

    @result
  end

Run your tests and make sure things still work.
Since you are back to everything working, check your work in. Go ahead and do it, the tutorial will be here when you get back.

Macintosh-7% git commit -a
Created commit bfd7d90: Cleaned up the code.
 1 files changed, 20 insertions(+), 12 deletions(-)

Summary

Here’s another important observation about refactoring. Consider extract method. In general, you should use the following steps:

Create a new method bycopying the original code
Make sure the code compiles and the examples pass
Use the new method in the original code
Make sure the code compiles and the examples pass

In general, when refactoring use create by “copying, verify, update, verify” instead of move. It’s less error prone. Yes you will appear to move slower, but often the way to speed up is by slowing down. If you have effective refactoring tools, then extract method becomes:

Select a block of code
Refactor it by using a shortcut key - you may have been tempted to use the mouse, stop it. Mouse bad.

Now is a great time to take a break. You’ve made good progress. You’ve practiced another round of BDD, added some solid refactoring skills and refined your continuous practice.

Example: Two Operators, Same Precedence

You handled one operator pretty easily. What happen when you put in two operators of the same precedence? You want to nudge your production code gradually. Going from 1 to more than 1 is often a good nudge (though it can be a big nudge at times). However, does this mean using the same operator two times? If you do that, it might make the results ambiguous. So this example will use two different operators to make the results unambiguous.

Add the following example to the “Binary Operators” context:

    it "should convert 1 + 3 - 4 ==> 1 3 + 4 -" do
      a_conversion_of '1 + 3 - 4'
      should_equal '1 3 + 4 -'
    end

Run your examples and notice that the results are close but not quite correct. The current implementation drops the +. To fix this, the code needs to write the current operator, if there is one, when it encounters the second operator:

  def process(token)
    if is_number token 
      add_to_result token 
    else
      add_to_result @operator if @operator != nil
      @operator = token
    end
  end

By simply adding the current operator if it is non-null, your code should now be able to handle an expression of any length so long as the operators are of the same precedence. Of course, you have some code duplication. Run your tests to make sure they pass.

Now is a great time to check your work it because you are about to refactor. Go ahead, it should only take a second.

Macintosh-7% git commit -a
Created commit 2524ac9: Added support for multiple operators of same precedence.
 1 files changed, 6 insertions(+), 0 deletions(-)

Create a new method that writes the current operator if it is not null:

  def add_operator_if_necessary
    add_to_result @operator  if @operator != nil
  end

Verify that your test still pass (they should, you’ve only added a method you have not changed existing code).
Update the process method to use this new method and run your tests.
Update the convert method to use this new method and run your tests.

Tests are passing after refactoring your code:

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Shunting Yard Algorithm Binary Operators
- should convert 5 + 3 ==> 5 3 +
- should convert 1 + 3 - 4 ==> 1 3 + 4 -

Finished in 0.020943 seconds

4 examples, 0 failures

Congratulations, you’ve successfully removed duplication and kept your tests passing. You’ve now practiced enough refactoring to add it to your resume. After all, you’re practicing RDD - resume-driven-development.

Now is a great time to check in your work.

    Macintosh-7% git commit -a
    Created commit d6bf591: Refactored duplicated code.
     1 files changed, 7 insertions(+), 3 deletions(-)

Example: Two Operators of Different Precedence

The next nudge to your production code is adding the idea of precedence. You won’t implement all of the rules yet, just a start. The first example will simply add a new operator, *, which, because of its higher precedence, results in something close to your previous test.Then the next example will change the order of the operators, with * after +, forcing a more serious change to your implementation.

Rather than giving you the source for the tests, I’ll use a neat feature that allows me to draw some LaTex style formulas.

Create the following example under the “Binary Operators” context: $1\ *\ 3\ +\ 2\ \ \rightarrow \ \ 1\ 3\ *\ 2\ +$
Run your example. What happens?

Notice that this test passed as is. This means one of the following things:

The example does not exercise anything new and it can probably be removed
You wrote too much production code and happened to get it right
You were not sure about an “edge” condition and this example verifies that the edge condition is handled (or simply does not exist)

In this case, I gave you a poor example. Maybe I though we were adding support for a new operator, but the original code defaulted to thinking something was an operator if it wasn’t a number. The result of this particular test does not force any changes since the code interprets * as an operator and the algorithm writes the stored operator all the time. So you need to change the example.

Here’s something a very different test from the first:

\[1\ +\ 3\ *\ 2\ \ \rightarrow \ \ 1\ 3\ 2\ *\ +\]

Notice that if you follow traditional precedence rules, multiplication happens before addition. So by writing an example with addition before multiplication, it will force your production code to hold on to the + operator longer. Did you notice that whereas the previous results contained an operator embedded within the numbers, now both operators are at the end, with the first one ending up last (very different indeed).

Write this as an example (replace the previous example with this one and add it under the “binary Operators” context):

    it "should put higher precedence operators before lower ones" do
      a_conversion_of '1 + 3 * 2'
      should_equal '1 3 2 * +'
    end

Run this test and you’ll notice the following failure:

1)
'Shunting Yard Algorithm Binary Operators should put higher precedence operators before lower ones' FAILED
expected: "1 3 2 * +",
     got: "1 3 + 2 *" (using ==)
./shunting_yard_algorithm_spec.rb:46:in `should_equal'
./shunting_yard_algorithm_spec.rb:76:

Finished in 0.022302 seconds

5 examples, 1 failure

Detour: Refactoring Before Moving Forward

Rather than always writing a non-nil operator, your code needs to check something. However, there are cases where your code can just write and some where it must hold onto two operators before writing any. To support both cases, you’ll want to change from storing a single operator to storing more than one. This requires a refactoring and you should do this while examples are passing.

Often, you’ll look at a solution, realize that its design is at the end of its life-cycle and it needs significant retooling. How do you know you have changed the design without breaking any assumptions? You keep the example passing! So you’ll get the current failing test out of the way for right now:

Change the example you just added into a “pending” state:

    it "should put higher precedence operators before lower ones" do
      pending 'on hold for refactoring'
      a_conversion_of '1 + 3 * 2'
      should_equal '1 3 2 * +'
    end

Next, you need to create an array to hold the operators. This is a refactoring so you are going toadd code first and then update it. My colleague Bob K calls this “Parallel Development”.
Update the convert method to initialize an array @operators:

  def convert(expression)
    @result = ''
    @operators = []
    ...

Run your examples, they should all still pass.
Update the process method to duplicate the work of storing the operator:

  def process(token)
    if is_number token 
      add_to_result token 
    else
      add_operator_if_necessary
      @operator = token
      @operators << token
    end
  end

Run your examples, they should all still pass (other than the pending one, of course).
Next, update the add_operator_if_necessary method to use the new instance variable:

  def add_operator_if_necessary
    add_to_result @operators.pop if @operators.length > 0
  end

Run your examples and notice that nothing is broken. You’ve moved from “add” mode to “update” mode.
Finally, remove all references to @operator (there should be just one in @process):

  def process(token)
    if is_number token 
      add_to_result token 
    else
      add_operator_if_necessary
      @operators << token
    end
  end

Run your examples and verify that nothing is broken.
Now is a great time to check in your work. So go ahead and do it. It’s been too long and you probably had an itchy checkin finger.

With that refactoring, you can now store multiple operators if necessary. So this diversion is over; time get back to the new example.

End Construction: Back to new development

Now that your code has a mechanism in place that allows storage of more than one operator, you need to undo some work that I asked you to do.

Remove the “pending” line from your most recent example:

    it "should put higher precedence operators before lower ones" do
      a_conversion_of '1 + 3 * 2'
      should_equal '1 3 2 * +'
    end

Run your example and you should be back to one broken example.

Your code has a slight problem; it needs to know the current operator in add_operator_if_necessary to know if it should add the token to the array or directly store it into the result. However, if the code requires the current operator, you can no longer use it in the convert algorithm. For now you will change the process method and leave that method alone - this happens often. You combined two lines of code because they were the same. Now they need to diverge, so you’ll allow them to. Along the way, you’ll change some method names as appropriate to better reflect their intent.

To fix this, you’ll write out higher-precedence operators in the array:

Here’s a first attempt:

  def process(token)
    if is_number token 
      add_to_result token 
    else
      if token == '*'
        add_to_result token 
      else
        @operators << token
      end
    end
  end

However, this does not fix anything and in fact breaks another test. If you have been checking in, now would be a great time to revert back to the previous version. If not, why didn’t you? This is a great example of allowing a tool to extend your functionality. You can probably undo your work and you have examples to verify that you did so. However, how many undos are necessary, versus one command using a reasonable revision control tool?

Let’s try that again. This time, you’ll try updating the process method a little bit differently:

  def process(token)
    if is_number token 
      add_to_result token 
    else
      if @operators.length > 0 &&
        precedence_of(token) <= precedence_of(@operators.last)
        add_operator_if_necessary
      end
      @operators << token
    end
  end

  def precedence_of(operator)
    case operator
      when '*': 10
      else 1
    end
  end

  def add_operator_if_necessary
    add_to_result @operators.pop while @operators.length > 0
  end

Run your examples and you notice that everything passes.

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Constants
- should convert a single constant to itself, e.g., 42 ==> 42

Shunting Yard Algorithm Binary Operators
- should convert 5 + 3 ==> 5 3 +
- should convert 1 + 3 - 4 ==> 1 3 + 4 -
- should put higher precedence operators before lower ones

Finished in 0.021718 seconds

5 examples, 0 failures

You have passing tests, but ugly code. Now is a good time to check in because you’re going to do some refactoring to clean up this code.

Checkin your code. Remember, if you don’t like working this way, you can always stop doing it after this tutorial. This is just practice anyway, so it won’t count and the other developers won’t chastise you for moving their cheese.

Refactoring

Begin by extracting a method to clean up process a bit. Add this method and make sure nothing breaks:

  def add_higher_precedence_operators_to_result(token)
    if @operators.length > 0 && 
      precedence_of(token) <= precedence_of(@operators.last)
      add_operator_if_necessary
    end
  end

Next, replace the original code with a call to this method. Run your tests to make sure things are still working:

  def process(token)
    if is_number token 
      add_to_result token 
    else
      add_higher_precedence_operators_to_result token 
      @operators << token
    end
  end

While you are at it, extract two more methods: handle_number, handle_operator to clean this code up:

  def process(token)
    if is_number token 
      handle_number token 
    else
      handle_operator token 
    end
  end

  def handle_number(number)
      add_to_result number 
  end

  def handle_operator(operator)
      add_higher_precedence_operators_to_result operator 
      @operators << operator
  end

Run your examples, they should be passing.

Check: Did you first add the methods, run your examples then change the original code to use the new methods and run your tests again? If not, why not? Sure, you probably got it right. You won’t always. Make it a habit, maybe you’ll be too busy to bite your fingernails.

Did you notice that the method add_operator_if_necessary was using an “if” and now it is using a “while”? You should rename it to: add_remaining_operators (you need to change three places to get this to work).
Run your examples, they should be passing.
While you are at it, the first two lines of the convert method are really performing initialization. Factor those out into an init method:

  def init
    @result = ''
    @operators = []
  end

Run your examples, make sure nothing broke.
Now update the convert method to use your new init method:

  def convert(expression)
    init

    ...
  end

Run your tests to make sure nothing is broken.
Finally, there is another opportunity to extract a method. Add yet another method:

  def convert_expression(expression)
    expression.split(' ').each { |t| process t  }
    add_remaining_operators
  end

Make sure your examples are still passing.
Update the convert method:

  def convert(expression)
    init
    convert_expression expression 
    @result
  end

After your examples are all passing, check your work in before moving on.

Summary

You’ve just done quite a bit of refactoring and basic cleanup. The description of what you did probably too longer to read than the actual work. Constant cleaning up on the new code you write should just be a normal thing you do. Get into the habit. And seriously, add refactoring to your resume.

Here’s an example of what your file might look like:

class ShuntingYardAlgorithm
  def init
    @result = ''
    @operators = []
  end

  def convert(expression)
    init
    convert_expression expression
    @result
  end

  def convert_expression(expression)
    expression.split(' ').each { |t| process t  }
    add_remaining_operators
  end

  def add_to_result(token)
    @result << ' ' if @result.length > 0
    @result << token
  end

  def is_number(token)
    token =~ /^\d+$/
  end

  def process(token)
    if is_number token
      handle_number token
    else
      handle_operator token
    end
  end

  def handle_number(number)
      add_to_result number
  end

  def handle_operator(operator)
      add_higher_precedence_operators_to_result operator
      @operators << operator
  end

  def add_higher_precedence_operators_to_result(token)
    if @operators.length > 0 &&
      precedence_of(token) <= precedence_of(@operators.last)
      add_remaining_operators
    end
  end

  def precedence_of(operator)
    case operator
      when '*': 10
      else 1
    end
  end

  def add_remaining_operators
    add_to_result @operators.pop while @operators.length > 0
  end
end

describe "Shunting Yard Algorithm" do
  before(:each) do
    @algorithm = ShuntingYardAlgorithm.new
  end

  def a_conversion_of expression
    @expression = expression
  end

  def should_equal expected
    result = @algorithm.convert @expression
    result.should == expected
  end

  describe "Basic Algorithm Usage" do
    it "should should convert '' to ''" do
      a_conversion_of ''
      should_equal ''
    end
  end

  describe "Constants" do
    it "should convert a single constant to itself, e.g., 42 ==> 42" do
      a_conversion_of '42'
      should_equal '42'
    end
  end

  describe "Binary Operators" do
    it "should convert 5 + 3 ==> 5 3 +" do
      a_conversion_of '5 + 3'
      should_equal '5 3 +'
    end

    it "should convert 1 + 3 - 4 ==> 1 3 + 4 -" do
      a_conversion_of '1 + 3 - 4'
      should_equal '1 3 + 4 -'
    end

    it "should put higher precedence operators before lower ones" do
      a_conversion_of '1 + 3 * 2'
      should_equal '1 3 2 * +'
    end
  end
end

Example: More than two operators

So far you’ve handled a single operator and up to two operators and some basic precedence rules. Now you need to make sure you can handle an arbitrary length expression and a few new operators. While not always the case, here’s a hierarchy of difficulty:

Not handling something to handling one such something.
Handling, more than one something (maybe just two) - often adds an “if” somewhere
Handling many somethings - often converts an “if” to a “while” - something I heard Uncle Bob say in one of his TDD tutorials
Create an example for the following (add it to the “Binary Operators” context, maybe it “should handle interleaved operators of different precedence”): $3\ +\ 1\ *\ 4\ -\ 2\ /\ 3\ \ \rightarrow \ \ 3\ 1\ 4\ *\ +\ 2\ 3\ /\ -$
Execute the example to see how your algorithm responds. You should see a failure similar to this:

    ... <snip> ...
    - should handle interleaved operators of different precedence (FAILED - 1)
    
    1)
    'Shunting Yard Algorithm Binary Operators should handle interleaved operators of different precedence' FAILED
    expected: "3 1 4 * + 2 3 / -",
         got: "3 1 4 * + 2 - 3 /" (using ==)
    ./shunting_yard_algorithm_spec.rb:74:in `should_equal'
    ./shunting_yard_algorithm_spec.rb:109:
    
    Finished in 0.023779 seconds
    
    6 examples, 1 failure

A quick review of how the code determines operator precedence suggests adding ‘/’ into the mix:

  def precedence_of(operator)
    case operator
      when '*': 10
      when '/': 10
      else 1
    end
  end

Run your examples, and now they pass (maybe a bit too easy).
Check in your code since you are back to passing tests

Did you consider that your code really didn’t do very much to handle this case? It is possible that you wrote too much production code? The add_remaining_operators uses a while instead of an if. If it only had an if, it would only write at most one operator. Because it uses a while, it writes several. So you probably overdid it a bit (or rather you trusted me and I led you astray). How could you check this? Change the while to an if and see if any of the examples fail. You might have just gotten lucky. But the only way you’ll know for sure is if you experiment. Change something, run your tests. If none of them fail, then your examples are not covering all that your code does.

However, it looks like you’ve got a decent general solution. You’ll need to register operators you care about in the precedence_of method, which is a violation of the open/closed principle, which is covered in detail in a later tutorial. Before you can call this algorithm finished, however, you need to address several more things:

Handling variables, e.g., a + 5
Handling ( )’s
Function calls and nested function calls, e.g., f(g(5))

Example: Handling Variables

It is time to revisit an earlier test, only this time you’ll use variables: $a\ +\ b\ \ \rightarrow \ \ a\ b\ +$

Rename the “Constants” context to “Operands”
Create this as an example under the newly renamed “Operands” context and see how it fails.
You should see something similar to:

1)
'Shunting Yard Algorithm Operands should handle variables as well as constants' FAILED
expected: "a b +",
     got: "a + b" (using ==)
./shunting_yard_algorithm_spec.rb:75:in `should_equal'
./shunting_yard_algorithm_spec.rb:93:

Finished in 0.02464 seconds

7 examples, 1 failure

A quick review of the process_expression method shows that the code looks for numbers first. If a token is not a number then it is an operator. By default, unlisted operators have the same precedence, so nothing happens. The code needs to change. You can make a quick change by changing the is_number method.

Rename is_number to is_operand.
Verify that your tests still fail as expected
Change the is_operand method to include a pattern for variables:

  def is_operand(token)
    token =~ /^(\d+|[a-zA-Z0-9$_]+)$/
  end

So something is an operand if it is a sequence of digits of length 1 or more or a sequence of one or more letters, numbers, $ and _.

Run your examples, things should be passing.

Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb

Shunting Yard Algorithm

Shunting Yard Algorithm Basic Algorithm Usage
- should should convert '' to ''

Shunting Yard Algorithm Operands
- should convert a single constant to itself, e.g., 42 ==> 42
- should handle variables as well as constants

Shunting Yard Algorithm Binary Operators
- should convert 5 + 3 ==> 5 3 +
- should convert 1 + 3 - 4 ==> 1 3 + 4 -
- should put higher precedence operators before lower ones
- should handle interleaved operators of different precedence

Finished in 0.02318 seconds

7 examples, 0 failures

Check in your code since you are back to passing examples.

Example: Handling ( )

In this section you’ll start by properly parsing parenthesis. Then you’ll make sure that ( )’s work with simple expressions and finally you’ll make sure that ( )’s cause lower precedence operators to happen before higher-precedence operators. Notice that this sets of examples fall nicely under a new context called something like “Handling ( )’s”.

What is Analysis

Check out the definition of http://dictionary.reference.com/browse/analysis:analysis: synopsis: breaking something into its constituent parts. When you are thinking about adding a new feature, learning how to break it down, while requiring practice for most of us, is a valuable skill to practice. Start with the assumption that it can be broken down, because it’s nearly always possible. Typically you’ll pick a partial solution that, by itself, does not complete anything. In fact, that’s what this entire tutorial has been demonstrating. Think about it in terms of questions. If you cannot figure out the whole thing, are there any questions you can answer that move you in the direction? Can you then state that question as a test, or experiment? Once you’ve written the experiment, unlike testing a hypothesis to check your model of reality, youdefine reality by writing production code to match your hypothesis. If you do not like the reality that you have created, change the definition of reality (your tests), and then alter your universe to match.

Example: Removing ( )

Begin by writing a new context called “Handling ( )’s” and adding an example that verifies the removal of ( )’s from an otherwise empty expression: $(\ )\ \ \rightarrow$

Create this context and example:

  describe "Handling ( )'s" do
    it "should remove ( ) from an otherwise empty expression" do
      a_conversion_of '( )'
      should_equal ''
    end
  end

Run your examples and verify that they fail.

... <snip> ...
Shunting Yard Algorithm Handling ( )'s
- should remove ( ) from an otherwise empty expression (FAILED - 1)

1)
'Shunting Yard Algorithm Handling ( )'s should remove ( ) from an otherwise empty expression' FAILED
expected: "",
     got: "( )" (using ==)
./shunting_yard_algorithm_spec.rb:75:in `should_equal'
./shunting_yard_algorithm_spec.rb:122:

Finished in 0.027261 seconds

8 examples, 1 failure

You can fix this and follow the current approach used in the class by making a change to process and adding a few methods:
process

  def process(token)
    if is_paren token 
      handle_paren token  
    elsif is_operand token 
      handle_number token 
    else
      handle_operator token 
    end
  end

new methods

  def is_paren(token)
    token =~ /[\(\)]/
  end

  def handle_paren(token)
  end

Run your examples and notice that everything is now passing:

    Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb
    
    Shunting Yard Algorithm
    
    Shunting Yard Algorithm Basic Algorithm Usage
    - should should convert '' to ''
    
    Shunting Yard Algorithm Operands
    - should convert a single constant to itself, e.g., 42 ==> 42
    - should handle variables as well as constants
    
    Shunting Yard Algorithm Binary Operators
    - should convert 5 + 3 ==> 5 3 +
    - should convert 1 + 3 - 4 ==> 1 3 + 4 -
    - should put higher precedence operators before lower ones
    - should handle interleaved operators of different precedence
    
    Shunting Yard Algorithm Handling ( )'s
    - should remove ( ) from an otherwise empty expression
    
    Finished in 0.026796 seconds
    
    8 examples, 0 failures

Check your work in.

Example: ( ) around expression works

Next, verify that ( )’s around an expression still works. $(\ 4\ *\ a\ )\ \ \rightarrow \ \ 4\ a\ *$

Create this example (it “should simply remove ( ) around an expression”) and see whether it works or not.

Since this appears to work, you might consider whether keeping this example is useful or not. It does not exercise any new code and it does not document some kind of edge condition (it doesn’t exercise any new equivalence classes). Given that this test does now really add any new knowledge, you’ll remove it and instead write a more complex test. Get used to taking some blind alleys, the blinder the alley, the more illuminating, ultimately, it will be.

Example: ( ) around expression causes lower precedence operator to happen first

Here is a key test that should exercise something new: $(\ 4\ +\ 5\ )\ *\ 3\ \rightarrow \ 4\ 5\ +\ 3\ *$ This demonstrates the whole purpose of ( ), change the natural precedence rules.

Remove the previous example and add this example, verify that it fails.
Here is one way to make it “work”:

handle_paren

      def handle_paren(token)
        if token == '('
          @operators << token
        else
          if @operators.last != '('
            add_to_result @operators.pop  
          end 
          @operators.pop
        end
      end

precedence_of

      def precedence_of(operator)
        case operator
          when '(': -1
          when ')': -1
          when '*': 10
          when '/': 10
          else 1
        end
      end

While this does work, it seems strange to define the precedence of ( )’, which make things happen sooner, as -1 - or the lowest thing so far.

Make another change to improve this somewhat: Update precedence_of to have larger numbers for ( and ):

      def precedence_of(operator)
        case operator
          when '(': 99
          when ')': 99
          when '*': 10
          when '/': 10
          else 1
        end
      end

Update add_remaining_operators to stop at ‘(‘:

      def add_remaining_operators
        add_to_result @operators.pop while 
            @operators.length > 0 && @operators.last != '('
      end

Make these changes, verify your tests pass.
Check your work in.

With all tests passing, you can perform some minor plastic surgery. The @operators.length > 0 and @operators.last represent a “logical top” of the operator stack. So change the code to make it more self-describing.

First, extract a method:

      def under_logical_top
        @operators.length > 0 && @operators.last != '('
      end

Verify that your tests still pass.
Now, make one more update:

  def add_remaining_operators
    add_to_result @operators.pop while under_logical_top
  end

Run your tests and verify everything still passes.
Check in your code.
Next, the method handle_paren does not follow a standard ruby idiom:

  def handle_paren(token)
    ...
    else
      if @operators.last != '('
        add_to_result(@operators.pop)
      end
      @operators.pop
    end
  end

Replace the code in the else with the following:

      add_to_result(@operators.pop) while @operators.last != '('
      @operators.pop

Run your tests, make sure everything is passing.
Check in your code.

Example: Nested ( )’s

Moving along, you’ll now have a look at supporting nested ( ). Here’s a test to give it try: $(\ (\ 1\ +\ 3\ )\ /\ (\ 9\ -\ 5\ )\ )\ *\ (\ 2\ +\ 3\ )\ \ \rightarrow\ \ 1\ 3\ +\ 9\ 5\ -\ /\ 2\ 3\ +\ *$

Create this test and see what happens.

Surprisingly, this example passes. This test does not exercise any new code. However, unlike the last example, this does demonstrate some new functionality that your code happens to handle correctly, nested ( )’s. So I recommend keeping this test around. This may seem like an arbitrary decision. It may be an incorrect decision. Here’s an arbitrary statistic: 80% of all users never remove features from menus or tool bars. Here’s another, 90% of all people don’t think that the previous statistic applies to them. You’re a pack-rat, admit it. You might find yourself wanting to write tests “just in case” and never get rid of them. That’s OK; when it starts to cause a problem - and it will - you’ll suffer and learn for yourself why you want to have as many tests as necessary, but no more.

Summary

You’ve successfully added support for ( )’s.

You started with a simple case, removing ( ). That gave your code basic support for ( ) so you could focus on the next small thing.
You then worked your way up to making sure ( )’s caused changed to precedence, as they are meant to do.
The solution you developed seems to work for nested ( )’s so it did not take much to make it work.

Has it been some time since you took a break? Blinked your eyes? Go outside, play some laser tag. The tutorial will be here when you get back.

Example: Functions

Now you’re going to add support for handling function calls. Functions are difficult because an operand might be a variable or a function, it depends on the next thing (or at least it does without some kind of context).

So your code either needs to look ahead when processing a token that is an operand or look behind. Looking ahead is hard because of how your code processes expressions; iterating over tokens using the each message, which doesn’t have look-ahead support. Looking back is possible, but your code directly builds a string, so it’d have to parse the string it just created.

Neither way is necessarily better. However, looking back can be made easy with a refactoring that might sound much worse than it actually is. Currently the code builds a string as it goes along, mixing content and presentation. Instead, you’ll change the code to store the content in an array. At the end, you’ll then build the presentation and return it. By separating content from presentation, looking back will be easier.

How do you get started? In a situation like this, you’ll probably experience something similar to one of the following:

Just before getting started with a new test, you realize the feature will be hard and requires a refactoring (this is what happened to me when I did this the first time; I had taken a break and the change you’ll be doing is pretty much exactly what I ended up doing)
After writing a test and having a go, you find something just isn’t right. Since you check in often, you blow away your work and get a do-over.

Just to be sure, you’ll start with an example.

Example: Basic Function Call

Here’s a simple test that describes what you want to have happen: $f\ (\ 3\ )\ \ \rightarrow\ \ 3\ f$

Create a new context and example, see that it in fact fails:

    describe "Handling function invocations" do
      it "should put the function name after the ( )'s" do
        a_conversion_of 'f ( 3 )'
        should_equal '3 f'
      end
    end

As expected, this example fails. To support this example you are going to separate the generation of the output from the formatting of the output. Let’s get back to all tests passing first.

Add: pending ‘Requires refactoring of @result’ as the first line of your example.
Verify your examples “pass”:

    <snip>
    Shunting Yard Algorithm Handling function invocations
    - should put the function name after the ( )'s (PENDING: Requires refactoring of @result)
    
    Pending:
    Shunting Yard Algorithm Handling function invocations should put the function name after the ( )'s 
    (Requires refactoring of @result)
    
    Finished in 0.031281 seconds
    
    11 examples, 0 failures, 1 pending

Refactoring

You are going to refactor your solution in support of this new example case. Remember, refactoring means changing the structure without changing the behavior. In your case, you have a pretty clear definition of behavior…your examples. So keep your examples passing. Also, as mentioned earlier in the tutorial, the approach will be to add and then remove rather than replace.

Add a new line to the init method:

  def init
    @result = ''
    @newResult = []
    @operators = []
  end

Next, you need to find all the places where the code adds to @result and duplicate the work (minus the formatting). Luckily, the code does not violate the DRY principle, so there’s only one place. It’s in the add_to_result method:

  def add_to_result(token)
    @result << ' ' if @result.length > 0
    @result << token
  end

The first line is about formatting, the second line is about content. So you simply need to add one more line anywhere in the method (for now, add it to the end):

    @newResult << token

Run your examples to make sure nothing is broken.

Now you are going to do something that happens a bit. You’re going to move to an intermediate result that produces somewhat ugly code. Not to worry, the code won’t stay this way long.

Add the following method:

  def produce_result
    @result = ''
    @newResult.each{ |t| 
      @result << ' ' if @result.length > 0
      @result << t
    }
    @result
  end

Run your examples, everything should still pass.
Next, update the convert method to return that instead:

  def convert(expression)
    init
    convert_expression(expression)
    produce_result
  end

Run your tests, things should still work.

Notice that the code no longer needs to produce the intermediate result, so now it is safe to make a few changes together: Remove initialization of @result in the init method:

  def init
    @newResult = []
    @operators = []
  end

Remove writing to @result in the add_to_result method:

  def add_to_result(token)
    @newResult << token
  end

Run your examples, things should be passing.
Now, make a quick update to produce_result:

  def produce_result
    result = ''
    @newResult.each{ |t|
      result << ' ' if result.length > 0
      result << t
    }
    result
  end

Run your examples, things should still be working.
Next, @newResult is a poor name. This represents output tokens, so rename it from @newResult -> @outputTokens.
Run your example, things should still be working.

Back to Ruby-isms

My colleague, Dean Whampler, reviewed an early version of this tutorial and pointed out that I had created something much more complex than necessary. In retrospect, given that in a previous life I used Smalltalk, I don’t have much of an excuse for missing this. However, it’s so much better, that I felt the need to fess up.

Replace the implementation of produce_result:

  def produce_result
    @outputTokens.join(' ')
  end

Run your examples, verify everything is passing.
Check in your work.

Refactoring Summary

You just changed the implementation without breaking any tests, congratulations. Having done this, you have a collection of tokens to work with rather than looking at a string to figure out if a left parenthesis represents a function call or groups an expression.

Back to supporting Functions

You disabled the method for testing a function call to allow refactoring while keeping all tests passing. Now that the refactoring is complete, it is time to reintroduce that test.

Delete the “pending” line from your most recent example.
Run your examples, make sure you have one failing example.

Throughout this exercise, you’ve been referring to the published algorithm on wikipedia. The algorithm mentions what to do when processing a right-parenthesis. Look at the operand stack and if it’s a function, then put in the output. There’s more than this, but you are not processing multiple parameters just yet.

The algorithm also mentions what to do if the token is a function name. Your code won’t know if a token is a constant or a function until the code hits a left parenthesisand look at the last element added to outputTokens. So that’s where to start.

Update handle_paren (which is getting pretty long and unruly about now):

  def handle_paren(token)
    if token == '('
      if last_result_pushed_is_function_name
        @operators << @outputTokens.pop
      end
      @operators << token
    else
      add_to_result @operators.pop while @operators.last != '('
      @operators.pop
    end
  end

The definition for last_result_pushed_is_function_name is straightforward, but it does introduce duplication:

  def last_result_pushed_is_function_name
    @outputTokens.last =~ /^[a-zA-Z0-9$_]+$/
  end

Make these changes and verify that all of your examples pass.
Check in your results.

Time to refactor

Here are a few observations:

The method handle_paren violates the SRP, it has multiple reasons to change, exists at different levels of abstraction and is generally getting harder to read. A secret to making code readable is to make it unnecessary to read it in the first place.
The regular expression for checking that something matches a name (variable or function) is duplicated. There’s more, but this is what you’ll tackle for now.
“Extract method” handling left parenthesis from handle_paren: Add an extracted method for the if part of the block:

  def process_left_paren
    if last_result_pushed_is_function_name
      @operators << @outputTokens.pop
    end
    @operators << '('
  end

Verify your tests still pass before moving on.
Change the top part of the if in handle_paren to use the new method:

  def handle_paren(token)
    if token == '('
      process_left_paren
    else
      add_to_result(@operators.pop) while @operators.last != '('
      @operators.pop
    end
  end

Verify your examples still pass. When they do, consider checking in your work.
Same thing, bottom part of the if: Create an extracted method for handling the right parenthesis:

  def process_right_paren
    add_to_result @operators.pop  while @operators.last != '('
    @operators.pop
  end

Add this method, make sure your examples still pass.
Use this method in the handle_paren method:

  def handle_paren(token)
    if token == '('
      process_left_paren
    else
      process_right_paren
    end
  end

Make sure your examples pass
Check in your work.

Now remove the duplication of regular expressions used for handling names. There are at least two options: put the part of the expression in a constant or write a function. I don’t have a good reason to select one over the other so I’m going to use a function.

Create a new method:

  def is_name(str)
    str =~ /^[a-zA-Z0-9$_]+$/
  end

Add this method, make sure all of your examples still pass.
Update last_result_pushed_is_function_name

  def last_result_pushed_is_function_name
    is_name @outputTokens.last
  end

Update this method, make sure all your examples still pass.

  def is_operand(token)
    token =~ /^\d+$/ || is_name(token)
  end

There’s an asymmetry in handling numbers versus names, so add the following method:

  def is_number(token)
    token =~ /^\d+$/ 
  end

Add this method, make sure all of your examples still pass.
Finally, update the is_operand method one final time:

  def is_operand(token)
    is_number(token) || is_name(token)
  end

Run your examples, makes sure they all pass.
Check in your code.

Did this last change seem silly or over the top? Consider this, the first part of the logical expression uses a regular expression, the second part did not, it called a function. You wanted to use a function on the second part to have a single definition of what is a name. Doing that forced a change in is_operand that made its implementation exist at different levels of abstraction. This simple change is a classic refactoring and leads to clean code. That is, code that someone else might have a chance to read and understand.

Example: Multiple parameters to a function

Now it’s time to add multiple parameters to a function. Here is one example: $f\ (\ 4\ ,\ 1\ ,\ a\ ,\ d\ )\ \ \rightarrow \ \ 4\ 1\ a\ d\ f$

Add an example under the “Handling function invocations” context and see what happens.
Since the , is interpreted as an operator, the results are not quite what you hoped:

    1)
    'Shunting Yard Algorithm Handling function invocations should put multiple parameters separated by
     ,'s in order first' FAILED
    expected: "4 1 a d f",
         got: "4 1 , a , d , f" (using ==)
    ./shunting_yard_algorithm_spec.rb:122:in `should_equal'
    ./shunting_yard_algorithm_spec.rb:191:
    
    Finished in 0.033487 seconds
    
    12 examples, 1 failure

The Shunting Yard Algorithm has a top-level clause for function parameter separator, so this suggests a change back in the process method: Update process:

  def process(token)
    if is_paren token
      handle_paren token
    elsif is_function_argument_separator token
      handle_argument_separator
    elsif is_operand token
      handle_number token
    else
      handle_operator token
    end
  end

Add method definitions for the new new methods:

  def is_function_argument_separator(token)
    token == ','
  end
  
  def handle_argument_separator
    add_to_result(@operators.pop) while @operators.last != '('
  end

Verify that your examples pass.
Check in your code.

An astute observer will notice that the body of handle_argument_separator does the same thing as the first line of process_right_paren. This both violates DRY and, possibly worse, makes it necessary to read the code to understand what it does! You can fix this by factoring this out into a method like:

  def record_operators_to_matching_paren
    add_to_result @operators.pop  while @operators.last != '('
  end

Update handle_argument_separator and process_right_paren to use this method:

    def handle_argument_separator
      record_operators_to_matching_paren
    end
  
    def process_right_paren
      record_operators_to_matching_paren
      @operators.pop
    end

Make sure your examples are passing.
Check in your work.

Example: What About Something Complex?

Here’s an example to see if something a bit more complex works with what you’ve written so far:

    describe "Big Examples" do
      it "should handle a large example with several levels of nesting" do
        a_conversion_of 'f ( g ( ( 1 + 3 ) * 4 ) / x ( y ( z ) ) )'
       should_equal  '1 3 + 4 * g z y x / f'
      end
    end

Running your examples should resemble:

    1)
    'Shunting Yard Algorithm Big Examples should handle a large example with several levels of nesting' FAILED
    expected: "1 3 + 4 * g z y x / f",
         got: "1 3 + 4 * z y x / g f" (using ==)
    ./shunting_yard_algorithm_spec.rb:136:in `should_equal'
    ./shunting_yard_algorithm_spec.rb:212:
    
    Finished in 0.03769 seconds
    
    13 examples, 1 failure

The problem is that when balancing ) with (, there’s the chance that it’s being done because of an expression or a function call. You can fix this:

    def process_right_paren
      record_operators_to_matching_paren
      @operators.pop
      add_to_result @operators.pop if is_name @operators.last
    end

Make this change and verify your examples all pass.
Check in your work.

A Quick Refactoring

Some time ago, you added a method called under_logical_top. You can use that method in places where the code checks for a ‘(‘. A quick search of the code reveals just one,:

    def record_operators_to_matching_paren
      add_to_result @operators.pop while @operators.last != '('
    end

Change this to use under_logical_top:

    def record_operators_to_matching_paren
      add_to_result @operators.pop while under_logical_top
    end

Make this change, run your examples and check in when your tests pass

This refactoring is something that comes as a result of noticing code duplication. Duplication is going to happen, when you notice it, remove it. This is one of the ways in which pair programming can make a big difference. The co-pilot will generally be in a better position to notice structural things because of their point of observation.

Along those lines, there’s three places where your code calls @operators «. Two times in process_left_paren and handle_operator. Logically these places are pushing something to be handled later. Your code can better document intent by handling that in a function:

    def record_operator(operator)
      @operators << operator
    end

Add this method, and make sure your tests pass.
Make the following updates:

    def process_left_paren
      if last_result_pushed_is_function_name
        record_operator @outputTokens.pop
      end
      record_operator '('
    end
  
    def handle_operator(operator)
      add_higher_precedence_operators_to_result operator
      record_operator operator
    end

Make these changes, verify all of your examples still pass.
Check in your changes.

Another random check of the code reveals that there are four places that call @operator.pop. While this may not seem like duplication it is. It forces knowledge of the implementation of the operator stack into four places in the code. You’ll change this and see another ruby idiom:

Add the following method:

    def last_operator!
      @operators.pop
    end

Add this method and make sure your examples still run.
Find replace all occurrences of @operators.pop with last_operator! (excluding the method you just added).
Make sure your examples still pass.
Take a look, there are two methods that have the same (or a very similar) implementation. Where is it?
Check in your changes.

???? should I also replace @operators.last with something like last_operator ????

There are several other methods than change your underlying object. In that spirit, you should change their names to include ! at the end. Those methods include:

convert
init
process_left_parent
record_operator
add_to_result This may seem like a bit of work, but consider this: The class is not thread safe. If the convert method were instead called convert!, it might better document that fact. Given that this is an exercise, this recommendation is left to your discretion.

Example: Operator Associativity

Next, your code needs to address operator associativity. For example, $4 + 5 - 6$ produces $4 5 + 6 -$ because + and - are left associative but otherwise at the same precedence. However, a = b += 5 produces a b 5 += =. First, b is incremented by 5 and then a equals that result. If these operators were left associative, the result would instead be: a b = 5 += (it’s even worse, because the result of = would the an lvalue of the rvalue - whew!), so a would equal bbefore it was incremented by 5. And, as mentioned, the return value of = would be the thing on the right instead of the left to make the thing work in the first place.

That gives a great test: $a\ =\ b\ += 5\ \ \rightarrow\ \ a\ b\ 5\ +=\$

Create a new Context and example:

    describe "Operator Associativity" do
      it "Should handle right-associative operators" do
        a_conversion_of 'a = b += 5'
        should_equal 'a b 5 += ='
      end
    end

It appears that the calculation is processed incorrectly because these operators are treated as the same precedence and left to right associative:

    Shunting Yard Algorithm Operator Associativity
    - Should handle right-associative operators (FAILED - 1)
    
    1)
    'Shunting Yard Algorithm Operator Associativity Should handle right-associative operators' FAILED
    expected: "a b 5 += =",
         got: "a b = 5 +=" (using ==)
    ./shunting_yard_algorithm_spec.rb:149:in `should_equal'
    ./shunting_yard_algorithm_spec.rb:232:
    
    Finished in 0.05 seconds
    
    14 examples, 1 failure

This is in fact what is happening because the process method has three checks, is_paren, is_function_argument_separator, is_operand, none of which match, so by default, += and = are treated as operators. A review of the shunting algorithm says this about such operators (paraphrased):

While the token is right-associative and its precedence is less than the last operator pushed, add the last operator pushed to the output. - Notice, it’s stuff like this in the algorithm that makes writing tests just about necessary to make sure you wrote it correctly!

Since your code handles this logic in add_higher_precedence_operators_to_result, that’s the place to change. However, its name will be a bit off.

Try the following change to see if it fixes the broken test and does not break other tests:

    def add_higher_precedence_operators_to_result(token)
      if @operators.length > 0
        p1 = precedence_of(token)
        p2 = precedence_of(@operators.last)
  
        if associativity_of(token) == :right_to_left && p1 < p2
          add_remaining_operators
        end
  
        if associativity_of(token) == :left_to_right && p1 <= p2
          add_remaining_operators
        end
      end
    end
  
    def associativity_of(token) 
      case token
        when '=': :right_to_left
        when '+=': :right_to_left
        else :left_to_right
      end
    end

This “works”, all examples pass. But can the code be any better? (Nearly rhetorical question, assume the answer is yes.) And what about the name of the method?

The examples are passing, check in your work before you refactor this unruly method.
Here is a better version of the same thing, breaking out some of the logic into a supporting method. Try this version and see that it works:

    def add_higher_precedence_operators_to_result(token)
      if @operators.length > 0 && should_happen_first(@operators.last, token)
        add_remaining_operators
      end
    end
  
    def should_happen_first(topOp, token)
      if associativity_of(token) == :left_to_right
        precedence_of(token) <= precedence_of(topOp)
      else
        precedence_of(token) < precedence_of(topOp)
      end
    end

While you are in refactoring mode, you have two places where the expression @operators.length > 0 exits; create a method for that:

    def there_are_pending_operators
      @operators.length > 0
    end

Update add_higher_precedence_operators_to_result

    def add_higher_precedence_operators_to_result(token)
      if there_are_pending_operators && should_happen_first(@operators.last, token)
        add_remaining_operators
      end
    end

Update under_logical_top

    def under_logical_top
      there_are_pending_operators && @operators.last != '('
    end

Get your tests passing
Check in your code

There’s one more thing to change before it’s time to call this refactoring side-bar done. The name add_higher_precedence_operators_to_result isn’t quite right. This adds operators that are higher or equal precedence or just higher, depending on the associativity. Rather than try to put the rules in the name, you can change the name to something suggesting its intent.

Rename this method to add_operators_that_should_happen_before(token):

    def add_operators_that_should_happen_before(token)
      ...
    end

Make sure to update handle_operator, which is the one place that calls the method.

Verify that all of your tests are passing.
Check your work in.

Conclusion

Are you done? For the purposes of this exercise yes, but there remains quite a bit of work you could do:

Handle unary operators and binary operators
Tokenize the input expression by properly splitting the line rather than requiring spaces in the input line
Many of the operators are not currently supported. The mechanism is in place to support them but they are not there.
The formatting of the result is built-in, you could instead provide an object that formats a stack and then allow for different transformations.
The list of operators and their precedence and associativity is hard-coded and requires changes to the code when you add new operators. You could provide this information in some kind of data-driven manner or even pass it in during construction.
Function calls must use ( ) and commas, what if you want to allow for flexibility?
There are actually bugs in your system. For example, the code dumps the operand stack in add_operators_that_should_happen_before by calling add_remaining_operators. Can you develop a test to show that this is in fact broken (hint, you need at least three levels of precedence)? Can you then fix it? In fact, that it is broken is an example of violating the third rule of TDD.

However, you’ve made amazing progress on this work. You’ve:

Written several unit tests to organically grow your implementation
Refactored code
Practiced an important aspect of continuous integration, frequent checkins - which requires that you work in small, tangible steps. That is a hard practice to learn and integrate.

Review

The Three Laws

Write no production code without a failing test.
Write only enough of a test such that it fails (and not compiling is failing)
Write just enough production code to get the tests to pass

Refactoring

The three laws are not enough. You refactored code: Remember, refactoring means to change the structure without changing the behavior. In your case, the examples define “the behavior”. So long as those examples remain passing, you’re refactoring.

There are some basics to consider when refactoring:

Prefer adding (even duplicating) first and then updating over directly changing.
Only start when all examples are passing. This might mean you disable an example you just added because you need to change the structure of your solution to support the new example (like what you did to support function calls)
Refactoring is not a separate activity from coding, it should be integrated as a natural part. Continuous refactoring keeps code clean versus building up large piles of design debt.

Methods

Should be short and have a single purpose
Be written at one level of abstraction
Getting the meaning from a method should not require a lot of reading.

See Martin’s Clean Code book for more on, well, writing clean code.

A Final Version

Here is the final report of all of the examples you created:

    Macintosh-7% spec -f s shunting_yard_algorithm_spec.rb
    
    Shunting Yard Algorithm
    
    Shunting Yard Algorithm Basic Algorithm Usage
    - should should convert '' to ''
    
    Shunting Yard Algorithm Operands
    - should convert a single constant to itself, e.g., 42 ==> 42
    - should handle variables as well as constants
    
    Shunting Yard Algorithm Binary Operators
    - should convert 5 + 3 ==> 5 3 +
    - should convert 1 + 3 - 4 ==> 1 3 + 4 -
    - should put higher precedence operators before lower ones
    - should handle interleaved operators of different precedence
    
    Shunting Yard Algorithm Handling ( )'s
    - should remove ( ) from an otherwise empty expression
    - () should cause lower precedence op's to happen before higher op's
    - should handle nested ( )'s
    
    Shunting Yard Algorithm Handling function invocations
    - should put the function name after the ( )'s
    - should put multiple parameters separated by ,'s in order first
    
    Shunting Yard Algorithm Big Examples
    - should handle a large example with several levels of nesting
    
    Shunting Yard Algorithm Operator Associativity
    - Should handle right-associative operators
    
    Finished in 0.040223 seconds
    
    14 examples, 0 failures

Here is the last version I ended up with after the tutorial. You result may vary based on where you added methods. Which one is right? Yours or mine? Both, as long as the examples pass:

    class ShuntingYardAlgorithm
      def init
        @outputTokens = []
        @operators = []
      end
    
      def convert(expression)
        init
        convert_expression expression 
        produce_result
      end
    
      def produce_result
        @outputTokens.join(' ')
      end
    
      def convert_expression(expression)
        expression.split(' ').each { |t| process t  }
        add_remaining_operators
      end
    
      def add_to_result(token)
        @outputTokens << token
      end
    
      def is_operand(token)
        is_number(token) || is_name(token)
      end
    
      def is_number(token)
        token =~ /^\d+$/ 
      end
    
      def is_name(str)
        str =~ /^[a-zA-Z0-9$_]+$/
      end
    
      def last_result_pushed_is_function_name
        is_name @outputTokens.last
      end
    
      def process(token)
        if is_paren token 
          handle_paren token  
        elsif is_function_argument_separator token
          handle_argument_separator
        elsif is_operand token 
          handle_number token 
        else
          handle_operator token 
        end
      end
      
      def is_function_argument_separator(token)
        token == ','
      end
      
      def handle_argument_separator
        record_operators_to_matching_paren
      end
    
     def record_operators_to_matching_paren
        add_to_result last_operator! while under_logical_top
      end
    
      def last_operator!
        @operators.pop
      end
    
      def is_paren(token)
        token =~ /[\(\)]/
      end
    
      def handle_paren(token)
        if token == '('
          process_left_paren
        else
          process_right_paren
        end
      end
    
      def process_left_paren
        if last_result_pushed_is_function_name
          record_operator @outputTokens.pop
        end
        record_operator '('
      end
    
      def process_right_paren
        record_operators_to_matching_paren
        last_operator!
        add_to_result last_operator! if is_name @operators.last
      end
    
      def handle_number(number)
          add_to_result number 
      end
    
      def handle_operator(operator)
        add_operators_that_should_happen_before operator
        record_operator operator
      end
    
      def record_operator(operator)
        @operators << operator
      end
    
      def record_operators_to_matching_paren
        add_to_result last_operator! while @operators.last != '('
      end
    
      def add_operators_that_should_happen_before(token)
        if there_are_pending_operators && should_happen_first(@operators.last, token)
          add_remaining_operators
        end
      end
    
      def should_happen_first(topOp, token)
        if associativity_of(token) == :left_to_right
          precedence_of(token) <= precedence_of(topOp)
        else
          precedence_of(token) < precedence_of(topOp)
        end
      end
    
      def precedence_of(operator)
        case operator
          when '(': 99
          when ')': 99
          when '*': 10
          when '/': 10
          else 1
        end
      end
    
      def associativity_of(token) 
        case token
          when '=': :right_to_left
          when '+=': :right_to_left
          else :left_to_right
        end
      end
    
      def add_remaining_operators
        add_to_result last_operator! while under_logical_top
      end
    
      def under_logical_top
        there_are_pending_operators && @operators.last != '('
      end
    
      def there_are_pending_operators
        @operators.length > 0
      end
    end
    
    describe "Shunting Yard Algorithm" do
      before(:each) do
        @algorithm = ShuntingYardAlgorithm.new
      end
    
      def a_conversion_of expression
        @expression = expression
      end
    
      def should_equal expected
        result = @algorithm.convert @expression
        result.should == expected
      end
    
      describe "Basic Algorithm Usage" do
        it "should should convert '' to ''" do
          a_conversion_of ''
          should_equal ''
        end
      end
    
      describe "Operands" do
        it "should convert a single constant to itself, e.g., 42 ==> 42" do
          a_conversion_of '42'
          should_equal '42'
        end
    
        it "should handle variables as well as constants" do
          a_conversion_of 'a + b'
          should_equal 'a b +'
        end
      end
    
      describe "Binary Operators" do
        it "should convert 5 + 3 ==> 5 3 +" do
          a_conversion_of '5 + 3'
          should_equal '5 3 +'
        end
    
        it "should convert 1 + 3 - 4 ==> 1 3 + 4 -" do
          a_conversion_of '1 + 3 - 4'
          should_equal '1 3 + 4 -'
        end
    
        it "should put higher precedence operators before lower ones" do
          a_conversion_of '1 + 3 * 2'
          should_equal '1 3 2 * +'
        end
    
        it "should handle interleaved operators of different precedence" do
          a_conversion_of '3 + 1 * 4 - 2 / 3'
          should_equal '3 1 4 * + 2 3 / -'
        end
      end
    
      describe "Handling ( )'s" do
        it "should remove ( ) from an otherwise empty expression" do
          a_conversion_of '( )'
          should_equal ''
        end
    
        it "() should cause lower precedence op's to happen before higher op's" do
          a_conversion_of '( 4 + 5 ) * 3'
          should_equal '4 5 + 3 *'
        end
    
        it "should handle nested ( )'s" do
          a_conversion_of '( ( 1 + 3 ) / ( 9 - 5 ) ) * ( 2 + 3 )'
          should_equal '1 3 + 9 5 - / 2 3 + *'
        end
      end
    
      describe "Handling function invocations" do
        it "should put the function name after the ( )'s" do
          a_conversion_of 'f ( 3 )'
          should_equal '3 f'
        end
    
        it "should put multiple parameters separated by ,'s in order first" do
          a_conversion_of 'f ( 4 , 1 , a , d )'
          should_equal '4 1 a d f'
        end
      end
    
      describe "Big Examples" do
        it "should handle a large example with several levels of nesting" do
          a_conversion_of 'f ( g ( ( 1 + 3 ) * 4 ) / x ( y ( z ) ) )'
          should_equal  '1 3 + 4 * g z y x / f'
        end
      end
    
      describe "Operator Associativity" do
        it "Should handle right-associative operators" do
          a_conversion_of 'a = b += 5'
          should_equal 'a b 5 += ='
        end
      end
    end

Overview

Introduction

Basics

Tokens

Operators

Parenthesis

Expressions

Getting Started

The 0th Example

Setup

Setup

Execute

Verify

Check In

Summary

Working up to a basic operation

Refactor the Examples

Example: An Actual Operator

Refactoring Your Mess

From:

To:

Summary

Example: Two Operators, Same Precedence

Example: Two Operators of Different Precedence

Detour: Refactoring Before Moving Forward

End Construction: Back to new development

Refactoring

Summary

Example: More than two operators

Example: Handling Variables

Example: Handling ( )

Example: Removing ( )

process

new methods

Example: ( ) around expression works

Example: ( ) around expression causes lower precedence operator to happen first

handle_paren

precedence_of

Example: Nested ( )’s

Summary

Example: Functions

Example: Basic Function Call

Refactoring

Back to Ruby-isms

Refactoring Summary

Back to supporting Functions

Time to refactor

Example: Multiple parameters to a function

Example: What About Something Complex?

A Quick Refactoring

Example: Operator Associativity

Conclusion

Review

The Three Laws

Refactoring

Methods

A Final Version

Comments