Elements of Dynamic Programming

We have done an example of dynamic programming: the matrix chain multiply problem, but what can be said, in general, to guide us to choosing DP?

Optimal Substructure
Overlapping Sub-problems
Variant: Memoization

Optimal Substructure: OS holds if optimal solution contains within it optimal solutions to sub problems. In matrix-chain multiplication optimally doing A₁, A₂, A₃, ...,A_n required A _1...kand A _{k+1 ...n} to be optimal. It is often easy to show the optimal sub problem property as follows:

Split problem into sub-problems
Sub-problems must be optimal, otherwise the optimal splitting would not have been optimal.

There is usually a suitable "space" of sub-problems. Some spaces are more "natural" than others.

For matrix-chain multiply we chose sub-problems as sub chains. We could have chosen all arbitrary products, but that would have been much larger than necessary! DP based on that would have to solve too many sub-problems.

A general way to investigate optimal substructure of a problem in DP is to look at optimal sub-, sub-sub, etc. problems for structure. When we noticed that sub problems of A₁, A₂, A₃, ...,A_nconsisted of sub-chains, it made sense to use sub-chains of the form A_i, ..., A_jas the "natural" space for sub-problems.

Overlapping Sub-problems: Space of sub-problems must be small: recursive solution re-solves the same sub-problem many times. Usually there are polynomially many sub-problems, and we revisit the same ones over and over again: overlapping sub-problems.

Recursive-Matrix-Chain (p, i, j) (inefficient recursive program)

if i = j
then return 0
m[i,j] Ź Ľ
for k Ź i to j - 1
do q Ź Recursive-Matrix-Chain (p, i, k ) + Recursive-Matrix-Chain (p, k + 1, j) + p_i-1p_kp_j
if q < m[i,j]
thenm[i,j] Ź q
return m[i,j]

With divide-and-conquer, each sub-problem is new, in DP, most of the sub-problems are old. Thus storing solutions to sub-problems with DP in a lookup table saves loads of time.

T (n) = running time to compute m[1,n] by the above recursive algorithm:

i = j and q < m[i,j] unit time are running time assumptions

T (1) ł 1

T (n) ł 1 +

n - 1
ĺ
k = 1 ( T (k) + T ( n - k ) + 1 )

ł 2

n - 1
ĺ
i = 1 T (i) + n

Will show T (n) ł 2^n-1 ( = W (2ⁿ))
Induction n = 1, T (1) ł 2^1-1 = 1 Ö

Assume true for up to n:

T (n) ł 2

n - 1
ĺ
i = 1 2^i-1 + n

= 2 ( 1 + 2¹+ 2²+ ... + 2^n-2) + n
= 2 ( (2^n-1-1) / ( 2 - 1 )) + n

= 2 ( 2^n-1 - 1 ) + n

= 2ⁿ- 2 + n ł 2^n-1( true when n ł 2 !!)

Recall: DP Q ( n²) sub-problems solved. Rule of thumb: Whenever a recursive approach solves the same problem repeatedly this signals a good DP candidate.

Memoization: What if we stored sub-problems and used the stored solutions in a recursive algorithm? This is like divide-and-conquer, top down, but should benefit like DP which is bottom-up. Memoized version maintains an entry in a table. One can use a fixed table or a hash table.

Memoized-Matrix-Chain (p)

n Ź length[p] - 1
for i Ź 1 to n
do for j Ź i to n
do m[i,j] Ź Ľ (initialize to "undefined" table entries)
return Lookup-Chain (p, 1, n)

Lookup-Chain (p, i, j)

if m[i,j] < Ľ (see if we know or not)
then return m[i,j]
if i = j
then m[i,j] Ź 0
else for k Ź i to j - 1
do q Ź Lookup-Chain (p, i, k) + Lookup-Chain (p, k + 1, j) + p_i-1p_kp_j
if q < m[i,j]
then m[i,j] Ź q
return m[i,j]

Each call is either:

O (1) if m[i,j] was previously computed

O (n) if not

Each m[i,j] called many times, but initialized only once. O ( n n² ) Memoization turns W (2ⁿ) Ž O ( n³ )!

Dynamic Programming - 2 of 3