Previously: getting started

The skyline function

Now what I’ve specified in the “skyline function” part of this task is that a “box” is defined by three numerical values: its left edge position on the x axis, its width, and its height.

It’s probably my old Object-Oriented habits, but personally I prefer labeled attributes rather than using a raw tuple for these things. That is to say, I could define a box by convention as some ordered collection of numbers, like (in Clojure) a vector [2.0 3.0 4.0] or list like '(2.0 3.0 4.0).

But when I’m writing code that does things like calculating the position of the right edge of a box (which I am almost certain I’ll want to do, and soon), I don’t really want to calculate right this way:

(+ (first box) (second box))

but would much rather do it this way:

(+ (:left box) (:width box))

So I’ll make a Box record to use from here on.

testing constructors

I start with a check:

(fact "I can make a box"
  (:left (->Box 1 2 3)) => 1
  (:width (->Box 1 2 3)) => 2
  (:height (->Box 1 2 3)) => 3
  )

Notice that there is no Box record defined yet, so I’m writing this test expecting it to fail.

And indeed that’s just what happens.

java.lang.RuntimeException: Unable to resolve symbol: ->Box in this context, compiling:(tablecloth/core_test.clj:6:10)

That’s easy to fix over in tablecloth.core

(ns tablecloth.core)

(defrecord Box [left width height])

Not complicated or surprising.

heights

Now the actual task here is to report a maximum height value at some given x position, in the context of a collection of Box items.

The collection could be empty, but frankly that’s a boring case so I put it off until later.

It seems that what I want to do will involve collecting a bunch of different height “contributions” at some position x, one for all the zero or more boxes that happen to be sitting over position x, and then return the maximum of those contributions. Feels like I will want to be able to return something called box-height, which is the “height contribution” of some single box at some particular x position. Think of it as being the “single-box skyline”, right? It’s 0.0 everywhere around the Box, and that box’s :height only where the box actually sits.

Good enough for a test:

(facts "about box-height"
  (fact "I can detect the box when the x falls within it"
    (box-height (->Box 1 10 2) 2) => 2
    ))

That fails (predictably) because there is no function called box-height yet.

(defn box-height
  [box x]
  (let [l (:left box)
        w (:width box)
        r (+ l w)
        h (:height box)]
    (cond
      (< x l) 0
      (<= x r) h
      :else 0
      )))

So the idea here is simple enough. If x is less than the :left of the box, the result is 0. If that’s not true, but x is less than or equal to the right side of the box, then we return the :height. Otherwise we’re over to the box’s right, and we return 0.

Did I need all that conditional logic? No, not at all. But I’m not using “strict” test-driven design here, I’m just writing tests first. If I were using real TDD, then I would have made this test pass (the one that checks at x=2 that the height is 2) by always returning 2. The next test I’d written, which would have no doubt checked some other x value, that’s when I would have added more logic.

This is a big source of confusion for many developers, some of whom even present (wrong) information at conferences: “TDD” does not simply mean “writing a failing unit test before you write the code that makes it pass”. Test-driven design means you only add code that makes a failing test pass.

Anyway, I don’t usually use strict TDD. I just want tests to tell me I haven’t broken anything, and to slow down my fevered head to the point where I don’t lose track of what’s working and what isn’t.

So I’m going to add a bunch more tests, and I expect all of them to actually pass because I did the “whole function” already:

(facts "about box-height"
  (fact "returns height, given one box and any x value"
    (box-height (->Box 1 10 2) 2) => 2
    (box-height (->Box 1 10 2) 1) => 2
    (box-height (->Box 1 10 2) 0) => 0
    (box-height (->Box 1 10 2) 10) => 2
    (box-height (->Box 1 10 2) 11) => 2
    (box-height (->Box 1 10 2) 11.00001) => 0
    (box-height (->Box 1 10 2) 9000) => 0
    ))

And indeed, this is the case.

overlapping boxes

Now that I can determine the “profile” of one Box accurately, I’d expect I will want to take a collection of Box items and measure their individual box-height values all at once. This is just mapping the box-height function over the collection of boxes, but I just want to get a sense of how this might work, so I write a little “sketch” in my test file.

(fact "I can determine all the heights at one x"
  (let [boxes [(->Box 1 10 2)
               (->Box 2  2 3)
               (->Box 12 3 4)]]
    (map #(box-height % 2.0) boxes) => [2 3 0]
    (map #(box-height % 0.0) boxes) => [0 0 0]
    (map #(box-height % 13.0) boxes) => [0 0 4]
    ))

So just to help get my head around where I think I want to go, I’ve picked three boxes where the first two overlap, and the third is off on its own. Picking an x that falls in the region where the first two overlap should give me two non-zero box-height values, picking one that doesn’t fall under any of them should give me three 0 values, and picking one that’s over where the third box sits should give me its height only. And that is the case.

This test passes immediately, because I’ve just been exercising the code I’ve already written. It’s not so much a unit test as a sort of implicit acceptance test of it. Something like “I can probably write my sklyine function this way, invoking the function and producing this intermediate result.”

skyline

That “sketch” I just made might seem extraneous to all you powerful thinkers and ninja programmers, but I like the way it helps me frame this next one:

(fact "skyline function returns tallest height at x"
  (let [boxes [(->Box 1 10 2)
               (->Box 2  2 3)
               (->Box 12 3 4)]]
    (skyline boxes 2.0) => 3
    (skyline boxes 0.0) => 0
    (skyline boxes 13.0) => 4
    ))

See how this echoes the previous test? All I’ve really done is made the previous explicit calculation of all the box-height values, and manually taken the max of those.

I could have done something else in this test, I suppose, to indicate my intention. For instance, instead of (skyline boxes 13.0) => 4 I might have said (skyline boxes 13.0) => (max [0 0 4]), but to be honest that feels less clear than doing it this way. Your experience may vary.

Anyway, that test fails, because there is no skyline function yet. So:

(defn skyline
  [boxes x]
  (apply max (map #(box-height % x) boxes)))

That’s tidy enough to make me happy. It does only one thing (a plus), and it’s not even very opaque or nested.

refactoring and such

Here’s my complete Clojure source file, so far:

(ns tablecloth.core)

(defrecord Box [left width height])

(defn box-height
  [box x]
  (let [l (:left box)
        w (:width box)
        r (+ l w)
        h (:height box)]
    (cond
      (< x l) 0
      (<= x r) h
      :else 0
      )))

(defn skyline
  [boxes x]
  (apply max (map #(box-height % x) boxes)))

I haven’t got any doc-strings, which I should probably fix. And that box-height function feels a bit spread out. I wonder if I could refactor it a bit?

I spend some time fiddling with the conditional part of box-height. I suppose in some sense an if would be “tidier”, especially since the “left of box” and “right of box” parts both return a value of 0. But when I write that, it involves a freaky clause like (if (or (< x l) (> x r))) and you know what? That’s not communicative to me. This, even though it’s a bit redundant, feels more viscerally like what I’m envisioning: a scan from left to right.

So I leave it. Since I admonished players to try to practice good practices, here’s where my code ends up:

(ns tablecloth.core)

(defrecord Box [left width height])

(defn box-height
  "given a Box record and a numeric x value, returns the height of that Box measured at that x (or 0, if x lies outside the box)"
  [box x]
  (let [l (:left box)
        w (:width box)
        r (+ l w)
        h (:height box)]
    (cond
      (< x l) 0
      (<= x r) h
      :else 0
      )))

(defn skyline
  "given a collection of Box records and a numeric x value, returns the maximum height of any Box in the collection, measured at that x"
  [boxes x]
  (apply max (map #(box-height % x) boxes)))

I don’t see a need for type hints or anything fancy like that, yet. Maybe there will be, somewhere down the road? I somehow doubt it, though.

Next time I’ll work on the skyline-changed? predicate. Thinking a bit about that, I start to suspect there may be a little pressure building to make the Box record a little richer. We’ll see.

hang on one second here

But wait! Since writing the above, I walked away and came back, and while I was away my memory was jogged and I realize I promised to write a test that isn’t here (for an empty collection of Box items). And also as I proofread this page on my phone, a totally useful refactoring of the box-height function struck me. Let me do that.

And no, it was not a “trick” to get you to see these mistakes I’m making, as if I were going into the basement to investigate the strange noise the monster was making. I literally didn’t think of these until just now.

First, the test for an empty Box collection. Since I’m coming into this “afterwards”, I want to treat my existing codebase as if it were “legacy” code, so I write an intentionally failing test first:

(fact "skyline works with an empty collection"
  (skyline [] 5) => 77
  )

This is claiming that skyline applied to an empty collection will produce a result of 77, which if skyline works as intended should just fail because the number is wrong.

But instead:

FAIL "skyline works with an empty collection" at (core_test.clj:45)
    Expected: 77
      Actual: clojure.lang.ArityException: Wrong number of args (0) passed to: core/max
...

(Plus a bunch of stack trace info you don’t need to see.)

Well, there are a couple of ways to do this, but the easiest way I can see from here is to just drop a “default” height of 0 into the collection of numbers being sent to max. After all, my problem definition doesn’t allow negative height values, so until that is changed this seems a reasonable approach.

(defn skyline
  "given a collection of Box records and a numeric x value, returns the maximum height of any Box in the collection, measured at that x"
  [boxes x]
  (apply max
         (map #(box-height % x)
              (conj boxes 0))))

Aside from re-wrapping the function “lispily” to align the arguments, I’ve added that little (conj boxes 0) bit at the end. That jams the number 0 into the received collection boxes, regardless of whether it’s present or not in the original collection.

I suppose I could have done some kind of empty? predicate stuff, but… why? That might communicate my intention now better, but in some other sense this feels like it works, it’s readable, and it has the added benefit of always eliminating negative heights if they crop up.

Not that they will. And even if they did, I would want to have written a test before I asserted they were an issue. But /shrug nobody’s perfeck.

Aaaaannnd nope. Big time weird Clojure error messages, enigmatic as ever.

What have I done?? Here’s where I squint and realize… Wait a minute. I’m adding the value 0 to the collection of Box items. That’s not right! I need to add it to the collection of box-height values, before max is applied.

(defn skyline
  "given a collection of Box records and a numeric x value, returns the maximum height of any Box in the collection, measured at that x"
  [boxes x]
  (apply max
     (conj (map #(box-height % x) boxes)
           0)))

That works! Now I have an error I expect when I run the tests:

FAIL "skyline works with an empty collection" at (core_test.clj:45)
    Expected: 72
      Actual: 0

I change the test to what I expect will pass, and it does:

(fact "skyline works with an empty collection"
  (skyline [] 5) => 0
  )

refactoring more

So what struck me when I was looking over this page on my phone was that this function

(defn box-height
  "given a Box record and a numeric x value, returns the height of that Box measured at that x (or 0, if x lies outside the box)"
  [box x]
  (let [l (:left box)
        w (:width box)
        r (+ l w)
        h (:height box)]
    (cond
      (< x l) 0
      (<= x r) h
      :else 0
      )))

has an awful lot of let statements. And to be honest, they’re not even used more than once. That’s ridiculous, frankly.

So how about this change?

(defn box-height
  "given a Box record and a numeric x value, returns the height of that Box measured at that x (or 0, if x lies outside the box)"
  [box x]
  (let [left (:left box)]
    (cond
      (< x left)
        0
      (<= x (+ left (:width box)))
        (:height box)
      :else
        0
      )))

I’ve kept a more explicit left local variable, but since I’m only looking up the :width and :height once each, I just inlined those. I’m not sure it’s better, but at least I don’t have that top-heavy feeling I dislike in Clojure code. Generally, when I’m refactoring other people’s code, if they’ve got a huge pile of let function calls, and they’re nested, I am moved to pull those out into some intentional single-purpose function of its own.

Anyway, I think this does that a bit better. The fact that I’m using field names of the Box record directly makes it a bit wordier, but also more obvious what the various parts are doing.

Also, you should have yelled at me for using single-letter variables like l and r and h before. Sheesh.

All the tests pass, and I like the code a bit more. So now I should move along.

Next: testing whether a skyline has changed