Stacking Boxes
Stacking Boxes is problem 103 in the UVa Online Judge. Even though I include the problem description in this post, I encourage you to visit the UVa Online Judge because there you will be able to submit your solution to get it judged.
I substituted the verb nest with verb fits in the problem statement.
Problem
Consider an n-dimensional
“box” given by its dimensions. In two dimensions the box (2,3)
might
represent a box with length 2 units and width 3 units. In three
dimensions the box (4,8,9)
can represent a box 4 x 8 x 9
(length, width, and height). In 6 dimensions it is, perhaps, unclear
what the box (4,5,6,7,8,9)
represents; but we can analyze properties
of the box such as the sum of its dimensions.
In this problem you will analyze a property of a group of
n-dimensional boxes. You are to determine the longest stack
of boxes, that is a sequence of boxes b_1, b_2, ..., b_k
such
that each box b_i
fits in box b_i+1 (1 <= i < k)
.
A box D = (d_1, d_2, ..., d_n)
fits in a box E = (e_1, e_2, ...,
e_n)
if there is some rearrangement of the
d_i
such that when rearranged each dimension is
less than the corresponding dimension in box E
. This loosely
corresponds to turning box D
to see if it will fit in box E
. However,
since any rearrangement suffices, box D
can be contorted, not just
turned (see examples below).
For example, the box D = (2,6)
fits in the box E = (7,3)
since
D
can be rearranged as (6,2)
so that each dimension is less than the
corresponding dimension in E
. The box D = (9,5,7,3)
does NOT fit in
the box E = (2,10,6,8)
since no rearrangement of D
results in a box
that satisfies the fitting property, but F = (9,5,7,1)
does fit in
box E
since F
can be rearranged as (1,9,5,7)
which fits in E
.
Formally, we define that a box fits in another as follows: box D = (d_1, d_2, ..., d_n)
fits in box E = (e_1, e_2, ..., e_n)
if there is a permutation p
of 1 ... n
such that (d_p[1], d_p[2],
..., d_p[n])
“fits”
in (e_1, e_2, ..., e_n)
i.e., if d_p[i] < e_i
for all
1 <= i < n
.
Input
The input consists of a series of box sequences. Each box sequence
begins with a line consisting of the the number of boxes k
in the
sequence followed by the dimensionality of the boxes, n
(on the same
line.)
This line is followed by k
lines, one line per box with the n
measurements of each box on one line separated by one or more
spaces. The i-th line in the sequence 1 <= i <= k
gives the
measurements for the i-th box.
There may be several box sequences in the input file. Your program
should process all of them and determine, for each sequence, which of
the k
boxes determine the longest stack and the length of
that stack (the number of boxes in the stack).
In this problem the maximum dimensionality is 10 and the minimum dimensionality is 1. The maximum number of boxes in a sequence is 30.
Output
For each box sequence in the input file, output the length of the longest stack on one line followed on the next line by a list of the boxes that comprise this stack in order. The “smallest” or “innermost” box of the stack should be listed first, the next box (if there is one) should be listed second, etc.
The boxes should be numbered according to the order in which they appeared in the input file (first box is box 1, etc.).
If there is more than one longest stack then any one of them can be output.
Sample Input
Sample Output
Approach
We approach the problem as a search problem. We search for an answer in a set of candidates.
An answer is any longest stack. Consider the following sequence of boxes.
The stacks for the sequence are the following.
Stacks (1, 5) (3, 6)
, (2, 4) (3, 5)
, and (2, 4) (3, 6)
are the longest stacks.
Each of them is an answer for the given sequence of boxes.
The set of candidates consists of one stack for each given box. Each candidate is a longest stack that ends with the corresponding box. For the given sequence, the candidates are the following.
We choose candidates in the following way.
For each given box, we consider the set of stacks that end with the box and choose the first longest stack we find.
For example, for box (3, 6)
we consider the following stacks.
Stacks (1, 5) (3, 6)
and (2, 4) (3, 6)
are longest.
We choose the (1, 5) (3, 6)
as the candidate for (3, 6)
.
The set of candidates contains at least one answer.
The reason is that each answer is also a longest stack that ends with some box.
For example, answers (1, 5) (3, 6)
and (2, 4) (3, 6)
are the longest stacks that end with (3, 6)
.
By picking any of them as the candidate for box (3, 6)
, we include an answer in the set of candidates.
We answer with the first longest candidate we find.
Solution
We solve the problem in three steps. We preprocess the input sequence, construct the set of candidates, and search for a longest candidate.
Preprocessing consists in arranging the sequence of boxes so that for each box, the boxes that fit are located to the left of the box. We preprocess by sorting the dimensions of each box and then sorting the boxes lexicographically. Consider the following sequence of boxes.
After sorting the dimensions, the sequence is the following.
The preprocessed sequence is the following.
The preprocessed sequence is arranged.
For example, only box (2, 4)
fits (3, 5)
and box (2, 4)
is to the left of (3, 5)
.
Even though box (1, 5)
does not fit (3, 5)
, we do not care that box (1, 5)
is to the left of (3, 5)
.
We sort the dimensions of boxes because one sorted box fits into another sorted box when corresponding dimensions fit.
For example, box (2, 4)
fits (3, 6)
because 2 < 3
and 4 < 6
.
Another example, box (3, 6)
does not fit (2, 4)
because 3 does not fit 2.
Given that the dimensions are sorted, we already know that the other dimension of the first box does not fit 2.
In general, when we consider boxes a := (a1, a2, ..., an)
and b := (b1, b2, ..., bn)
, a
fits b
if and only if for each 1 <= i <= n
, ai < bi
.
There are two cases, either the condition is satisfied or not.
In the case where the condition is satisfied, we satisfy the definition of fits.
The following diagram illustrates the other case, the case where for some i
, ai
is not less than bi
.
In this case there is no way another dimension of a
fits bi
while the rest of the dimensions of a
fit in other dimensions of b
.
Consider that the only dimensions a1, ..., ai-1
may fit bi
.
Suppose that aj
fits bi
as follows.
Consider that ai, ..., an
may only fit bi+1, ..., bn
.
It is impossible to fit each dimension ai, ..., an
into bi+1, ..., bn
because number of dimensions is not the same.
Sorting the boxes lexicographically after sorting the dimensions arranges the sequence of boxes.
The reason is that for each box, all boxes to the right do not fit the box.
Consider box (2, 4)
in the following sorted sequence.
Box (2, 4)
comes before (3, 5)
because 2 < 3
, which is the same reason why (3, 5)
does not fit (2, 4)
.
In general, when we consider boxes a := (a1, a2, ..., an)
and b := (b1, b2, ..., bn)
, b
does not fit a
if a
comes before b
.
If a
comes before b
, either all dimensions of a
and b
are the same or there is a dimension ai
that is less or equal to dimension bi
, as illustrated in the following diagram.
In either case, the condition that each dimension of b
fits the corresponding dimension of a
is not satisfied and thus b
does not fit a
.
We construct the set of candidates by iterating the preprocessed sequence applying the following two rules to each box.
If no candidate to the left of the box fits the box, then the candidate for the box is the box itself.
Consider the following table that indicates the candidates for (1, 5)
and (2, 4)
.
For box (2, 4)
, candidate (1, 5)
does not fit, thus stack (2, 4)
is the candidate for box (2, 4)
.
If at least one candidate to the left fits the box, we stack the box on all candidates that fit and we choose the first longest stack we get.
Consider the following table that indicates the candidates for all boxes.
For box (3, 6)
, candidates (1, 5)
and (2, 4)
fit the box.
Each candidate corresponds to a longest stack that ends with (3, 6)
, stacks (1, 5), (3, 6)
and (2, 4) (3, 6)
respectively.
We choose (1, 5) (3, 6)
as the longest candidate for (3, 6)
because we consider candidate (1, 5)
before candidate (2, 4)
.
The construction of the set of candidates belongs to complexity class O(n^2)
.
Our construction corresponds to the O(n^2)
algorithm for the Longest Increasing Subsequence Problem.
We return the first longest candidate we find from left to right. Consider the selected candidate from the set of candidates.
Implementation
We implement our solution in C. We explain our implementation of the preprocessing, construction of the set of candidates, and search for a longest candidate. After that, we show the full code for our implementation.
We apply function merge_sort_ints(int *a, int n)
to sorting the dimensions of a given box.
Function merge_sort_ints
sorts array a
of n
integers by merge sort.
Consider the following application of merge_sort_ints
in line SORT_DIMS
.
That application sorts dimensions for each box after reading all the boxes for a sequence.
We apply function merge_sort_boxes(box *a, int n, int d)
to sorting the sequence of boxes.
Function merge_sort_boxes
sorts array a
of n
boxes that have d
dimensions by merge sort.
Consider the following application of merge_sort_boxes
in line SORT_BOXES
.
That application sorts the sequence of boxes after sorting dimensions.
We compute the candidate set with the following block of code in main
after sorting the sequence.
We search for a longest candidate and we keep the first we find in the following code.
The full code of our implementation is the following.
Our implementation was accepted by the UVa Online Judge.
Summary
We approach Stacking Boxes as a search problem.
We restrict the search space to a set of candidates.
The construction of the set of candidates belongs to complexity class O(n^2)
and is the most expensive part of our solution.
Our solution to the problem consists of three steps.
- Sort dimensions for each box so that comparing any two boxes reduces to comparing their corresponding dimensions.
- Sort the sequence of boxes lexicographically so that for each box, the boxes to the left are the boxes that may fit.
- Construct the set of candidates in a similar way to the
O(n^2)
algorithm for the Longest Increasing Subsequence Problem.
A faster solution
I have explored the possibility of solving the problem in at most O(n log n)
steps.
A quick look at Algorithmist shows that Stacking Boxes is a particular case of the Longest Increasing Subsequence Problem.
There I found an O(n log n)
algorithm that computes the longest increasing subsequence of an array of integers, which I tried to apply to the problem.
The algorithm looked promising because it refines a single solution as it processes the sequence of integers one by one.
The problem is that, unlike integers, boxes are not always comparable by means of the relation ‘fits’.
For example, neither (1, 5)
fits (3, 5)
nor vice versa.
In mathematicalese that means that the relation ‘fits’ is not a total relation.
Given that not all boxes are comparable, instead of refining only one solution, I could only think of an adaptation of the algorithm that refines a set of candidates.
The refinement consists in searching for each box a stack where you can place it.
The procedure for searching a stack for a box is similar to patiente sorting.
The difference is that finding the stack for each box cannot be done by binary search applying relation ‘fits’.
The worst case scenario is that all boxes in the input sequence are incomparable and for that reason my adaptation belongs to class O(n^2)
.
Do you have any idea how to solve Stacking Boxes faster? Let me know in the comments.