Advanced Dynamic Programming
Fall 2015
Instructor: Dr. Rajesh Ganesan
Eng Bldg. Room 2217
Phone: (703) 993-1693
Fax: (703) 993-1521
Email: rganesan at gmu dot edu
DP refresher Notes
ADP need and eqautions
Machine replacement problem:
Limiting Probabilities and MDP, Exhaustive enumeration, LP solution to MDP, MDP- Average Reward/cost- Policy and Value Iteration, Discounted Cost-Policy and Value Iteration
Excel Example value iteration for MDP
Matlab files:
Value iteration Discounted cost criteria
Figure 4.2 page 120 value iteration_ADP
Figure 4.4 page 128 value iteration_ADP2
Figure 4.7 page 141 value iteration_ADP 3
Fig 4.7 page 141 with MSE value iteration_ADP3x
1) Solve the following using Matab or any other language using the algorithms a) value iteration b) 4.2 c) 4.4, and d) 4.7 given above.
Inventory control Example, see hand out, excel for inventory control
2) Water Resource example: see handout value iteration using excel
Solve the water resource problem using a) value iteration b) 4.2 c) 4.4, and d) 4.7 given above with dam capcity 20 units. The possible decisions are to release upto the dam's availability. Use same cost functions as in the handout. The probability of inflow is
Prob of inflow | |
P(0) | 0.02 |
P(1) | 0.05 |
P(2) | 0.01 |
P(3) | 0.06 |
P(4) | 0.02 |
P(5) | 0.02 |
P(6) | 0.08 |
P(7) | 0.09 |
P(8) | 0.01 |
P(9) | 0.04 |
P(10) | 0.1 |
P(11) | 0.04 |
P(12) | 0.02 |
P(13) | 0.16 |
P(14) | 0.04 |
P(15) | 0.05 |
P(16) | 0.02 |
P(17) | 0.08 |
P(18) | 0.06 |
P(19) | 0.02 |
P(20) | 0.01 |
How does the solution change if the probability of inflow is uniform (equal prob of 1/21) or poisson with mean 10.
Fig 4.7 page 141 with MSE value iteration_ADP4 to study different alpha decay rates
VFA matlab code with 4 schemes
VFA with diffusion wavelet
VFA DW Theory
DW - Haar wavelet demo to go with the ppt above
Diffusion wavelets DW code for best basis - multiple levels
excel to show value determination
ADP with only scaling function code
ADP with scaling and wavelet functions code