> Haghani & Dewey 2016 experiment with a double-or-nothing coin-flipping game where the player starts with $25 (ie. [$25]($2016)) and has an edge of 60%, and can play 300 times, choosing how much to bet each time, winning up to a maximum ceiling of $250. Most of their subjects fail to play well, earning an average $91, compared to the Haghani & Dewey 2016 heuristic benchmark of ~$240 in winnings achievable using a modified Kelly Criterion as their strategy. The KC, however, is not optimal for this problem as it ignores the ceiling and limited number of plays. > > We solve the problem of the value of optimal play exactly by using decision trees & dynamic programming for calculating the value function, with implementations in R, Haskell, and C. (See also [Problem #14](/problem-14 "‘Problem 14 Dynamic Programming Solutions’, Branwen et al 2022"){.backlink-not}.) We also provide a closed-form exact value formula in R & Python, several approximations using Monte Carlo/random forests/neural networks, visualizations of the value function, and a Python implementation of the game for the OpenAI Gym collection. > > We find that optimal play yields \$246.61 on average (rather than ~\$240), and so the human players actually earned only 36.8% of what was possible, losing \$155.6 in potential profit. Comparing decision trees and the Kelly criterion for various horizons (bets left), the relative advantage of the decision tree strategy depends on the horizon: it is highest when the player can make few bets (at _b_ = 23, with a difference of ~\$36), and decreases with number of bets as more strategies hit the ceiling. > > In the Kelly game, the maximum winnings, number of rounds, and edge are fixed; we describe a more difficult generalized version in which the 3 parameters are drawn from Pareto, normal, and beta distributions and are unknown to the player (who can use Bayesian inference to try to estimate them during play). Upper and lower bounds are estimated on the value of this game. In the variant of this game where subjects are not told the exact edge of 60%, a Bayesian decision tree approach shows that performance can closely approach that of the decision tree, with a penalty for 1 plausible prior of only \$1. > > Two deep reinforcement learning agents, DQN & DDPG, are implemented but DQN fails to learn and DDPG doesn't show acceptable performance with default settings, indicating better tuning may be required for them to solve the generalized Kelly game.

Total bets Value function Decision tree performance Kelly performance Difference ---------- -------------- ------------------------- ----------------- ---------- 1 30 30.1 26 4.1 2 36 35.5 27 8.5 3 43.2 42.4 28.2 14.2 4 45.36 46 29.3 16.7 5 47.95 49.3 30.5 18.8 6 53.65 52.5 31.5 21 7 54.59 51.4 32.7 18.7 8 57.57 56.1 34.1 22 9 61.83 61.4 35.6 25.8 10 62.6 63 36.8 26.2 11 66.76 68.6 38.3 30.3 12 68.22 66.9 40.2 26.7 13 70.6 70.7 41.8 28.9 14 74.13 71.2 43.2 28 15 75 73.8 44.9 28.9 16 79.13 79.7 46.8 32.9 17 79.75 79.1 47.8 31.3 18 82.5 84.1 50.1 34 19 84.71 83.7 52.7 31 20 86.25 85.6 54.2 31.4 21 89.8 88.6 56.4 32.2 22 90.28 90.2 56.5 33.7 23 93.85 94.8 58.7 36.1 24 94.49 93.7 60.7 33 25 97 95 62.2 32.8 26 98.83 97.1 65.2 31.9 27 100.38 98.7 68.1 30.6 28 103.23 102.1 69.4 32.7 29 103.95 102.5 73.4 29.1 30 107.61 107.7 73.7 34 31 107.64 107.3 74.7 32.6 32 110.4 106.7 76.1 30.6 33 111.41 107.9 79.2 28.7 34 113.38 115.1 80.6 34.5 35 115.24 115.4 82.2 33.2 36 116.48 116.1 84.2 31.9 37 119.09 118.5 87.1 31.4 38 119.69 119.9 86 33.9 39 122.94 125.4 89.6 35.8 40 122.96 119.7 92 27.7 41 125.54 124.2 95.2 29 42 126.28 124.4 96.9 27.5 43 128.21 128.5 97.1 31.4 44 129.63 130.5 100.3 30.2 45 130.96 130.1 100 30.1 46 132.98 131.9 100.9 31 47 133.78 132.4 104.2 28.2 48 136.33 134.2 104.5 29.7 49 136.64 133.6 107.7 25.9 50 139.36 141.2 110.1 31.1 51 139.54 139.4 113.6 25.8 52 141.68 140 113 27 53 142.45 140.7 113.5 27.2 54 144.08 141.8 115.5 26.3 55 145.36 145 116.3 28.7 56 146.52 150.2 119.7 30.5 57 148.26 146.4 119.7 26.7 58 149 143.9 120.6 23.3 59 151.15 151.8 124.2 27.6 60 151.5 148.5 124.4 24.1 61 154.01 151.9 125.5 26.4 62 154.01 150.9 127.1 23.8 63 156.05 157.9 128.4 29.5 64 156.52 154.3 129.9 24.4 65 158.14 155.5 132.3 23.2 66 159.03 156.8 132.1 24.7 67 160.25 157.4 133.2 24.2 68 161.53 159.2 137.1 22.1 69 162.39 160.2 135.9 24.3 70 164 161 137.8 23.2 71 164.54 162.2 137.8 24.4 72 166.46 166.7 138.3 28.4 73 166.7 165.2 142.7 22.5 74 168.81 169 145 24 75 168.85 168.3 143.1 25.2 76 170.59 169.5 144.4 25.1 77 171 165.6 146.4 19.2 78 172.39 171.2 147.5 23.7 79 173.13 171.3 150.6 20.7 80 174.21 170.6 151.8 18.8 81 175.24 174 152.3 21.7 82 176.04 175.5 153.8 21.7 83 177.34 174.9 151.8 23.1 84 177.87 177.6 152.5 25.1 85 179.4 178.5 157.3 21.2 86 179.7 177.1 156 21.1 87 181.44 178.9 158.1 20.8 88 181.52 179.6 160.1 19.5 89 183.15 181.1 159 22.1 90 183.33 182.8 163.3 19.5 91 184.68 184.2 162.3 21.9 92 185.13 183.4 162.5 20.9 93 186.21 187.5 165.1 22.4 94 186.9 185.3 160.5 24.8 95 187.75 188.6 164.8 23.8 96 188.66 186.4 167.1 19.3 97 189.29 187.6 168 19.6 98 190.39 188.9 167.7 21.2 99 190.82 187.8 169.8 18 100 192.1 190.7 168.4 22.3 101 192.34 192.5 171.8 20.7 102 193.78 192.6 170 22.6 103 193.86 193.2 170.7 22.5 104 195.24 194.1 170 24.1 105 195.35 192.9 174.1 18.8 106 196.52 195.2 176.8 18.4 107 196.84 194.5 173.4 21.1 108 197.79 194.4 179.1 15.3 109 198.3 195.5 176 19.5 110 199.07 196.7 179.1 17.6 111 199.74 198.7 181.2 17.5 112 200.34 201.1 178.2 22.9 113 201.17 197.9 180.9 17 114 201.6 200.3 181.2 19.1 115 202.57 202 183.2 18.8 116 202.85 201.6 181 20.6 117 203.94 201.7 181.4 20.3 118 204.09 201.2 183.6 17.6 119 205.29 205.9 185 20.9 120 205.32 201.3 186.8 14.5 121 206.4 204 182.2 21.8 122 206.53 203.7 186.2 17.5 123 207.44 205.7 186.1 19.6 124 207.72 205.2 189.5 15.7 125 208.48 203.9 191.4 12.5 126 208.9 209.3 188 21.3 127 209.52 206.7 187.7 19 128 210.06 209.5 188.5 21 129 210.54 206.5 192.4 14.1 130 211.2 211.1 190.9 20.2 131 211.56 207.1 195.6 11.5 132 212.32 210.3 194 16.3 133 212.57 212.1 191.1 21 134 213.42 211 192.7 18.3 135 213.56 210.2 195.8 14.4 136 214.5 213.3 196.8 16.5 137 214.55 211.3 194.4 16.9 138 215.46 212 196.6 15.4 139 215.52 210.8 197.4 13.4 140 216.3 215.3 197 18.3 141 216.47 217.8 199.3 18.5 142 217.13 215.4 197.3 18.1 143 217.41 214.8 196.2 18.6 144 217.96 213.9 200.1 13.8 145 218.33 215.7 200.4 15.3 146 218.77 217.4 200.1 17.3 147 219.24 217.5 199.7 17.8 148 219.58 218.5 200.3 18.2 149 220.13 218.4 200.3 18.1 150 220.39 220.4 201.9 18.5 151 221 218.1 201.6 16.5 152 221.18 220.5 203.9 16.6 153 221.86 220.6 202.6 18 154 221.96 220.5 205.2 15.3 155 222.69 218.7 203.1 15.6 156 222.72 220.6 204.4 16.2 157 223.43 220.6 203.3 17.3 158 223.48 221.1 202.8 18.3 159 224.09 222.6 207.1 15.5 160 224.22 224.5 207.5 17 161 224.74 220.8 206 14.8 162 224.95 224.2 208.1 16.1 163 225.39 223.8 208 15.8 164 225.67 222.8 209 13.8 165 226.03 223.4 208.6 14.8 166 226.37 224 210 14 167 226.66 225.3 209.2 16.1 168 227.06 224.1 211.6 12.5 169 227.28 224.5 210.5 14 170 227.73 223.8 211 12.8 171 227.89 226.9 209.1 17.8 172 228.39 226 212.2 13.8 173 228.49 226 211.8 14.2 174 229.04 226.6 212.1 14.5 175 229.09 227.9 211.3 16.6 176 229.67 226.4 211.5 14.9 177 229.67 228 214 14 178 230.18 228.4 215.1 13.3 179 230.24 227.5 213.3 14.2 180 230.68 229.2 213.6 15.6 181 230.8 229.5 215 14.5 182 231.18 228.7 213.9 14.8 183 231.36 229.8 216 13.8 184 231.67 230.6 214.4 16.2 185 231.9 231 213.1 17.9 186 232.16 231.2 216.2 15 189 232.94 231.1 217.9 13.2 190 233.1 230.4 217.6 12.8 191 233.45 231.1 218.4 12.7 192 233.56 231.9 219 12.9 193 233.94 232.1 216.6 15.5 194 234.02 232 219.3 12.7 195 234.43 231.8 217.5 14.3 196 234.47 232.4 220.6 11.8 197 234.9 233.5 218.6 14.9 198 234.9 233 219.3 13.7 199 235.3 233.4 220.2 13.2 200 235.33 233.8 221.1 12.7 201 235.67 235.5 218.8 16.7 202 235.75 233 222 11 203 236.05 232.9 220.4 12.5 204 236.17 233.9 220.1 13.8 205 236.42 234.8 221 13.8 206 236.57 234.4 221.4 13 207 236.78 234.8 222.6 12.2 208 236.96 236.4 222.5 13.9 209 237.14 234.6 223.5 11.1 210 237.35 236.6 222.6 14 211 237.49 235.7 221.9 13.8 212 237.73 234.4 222.4 12 213 237.83 234.9 226 8.9 214 238.1 237.3 223.9 13.4 215 238.17 237 223.6 13.4 216 238.46 235.7 225.1 10.6 217 238.5 236.6 223.6 13 218 238.81 237 226.1 10.9 219 238.83 236.4 225 11.4 220 239.15 237.7 225.7 12 221 239.15 236.8 225.9 10.9 222 239.43 237.7 225.9 11.8 223 239.46 238.6 224.8 13.8 224 239.71 237.1 226.3 10.8 225 239.77 238.7 227.4 11.3 226 239.98 238.7 225.9 12.8 227 240.07 238 226.9 11.1 228 240.25 240.5 227.6 12.9 229 240.36 238.8 227.5 11.3 230 240.51 237.9 225.8 12.1 231 240.65 238.5 228.2 10.3 232 240.77 239.3 226.6 12.7 233 240.92 238.8 226.1 12.7 234 241.03 240.2 228.8 11.4 235 241.2 240.4 227.5 12.9 236 241.28 240 227.4 12.6 237 241.46 239.8 228 11.8 238 241.52 240.6 228.8 11.8 239 241.72 240.1 228.7 11.4 240 241.76 240.2 229.2 11 241 241.98 240.3 229.2 11.1 242 242 240.7 229.3 11.4 243 242.22 240.5 229.7 10.8 244 242.23 239.9 229.2 10.7 245 242.44 241.2 230.3 10.9 246 242.45 240.7 230.5 10.2 247 242.64 241.3 231.5 9.8 248 242.68 239.2 230.4 8.79 249 242.84 241.5 230.3 11.2 250 242.89 241.4 230.6 10.8 251 243.03 242.2 230.4 11.8 252 243.1 241.7 232.3 9.39 253 243.22 242.6 232.2 10.4 254 243.31 241 229.7 11.3 255 243.41 240.7 231.1 9.59 256 243.51 242.2 231 11.2 257 243.59 241 232.4 8.59 258 243.7 242.1 230.2 11.9 259 243.77 242 232.1 9.90 260 243.9 243.2 230.4 12.8 261 243.95 242.9 233.6 9.30 262 244.08 242.6 233.6 9 263 244.12 243 231.3 11.7 264 244.26 242.3 233.5 8.80 265 244.29 241.8 233.8 8 266 244.44 242.9 233.1 9.80 267 244.46 242.6 233.8 8.79 268 244.61 242.9 234 8.90 269 244.62 244 234.3 9.69 270 244.76 243.6 234.4 9.19 271 244.77 243.6 235 8.59 272 244.91 243.2 234.7 8.5 273 244.93 243.9 233.2 10.7 274 245.04 243.5 233.5 10 275 245.08 243.7 234.2 9.5 276 245.18 243.4 234.8 8.59 277 245.23 244.2 234.2 10 278 245.31 244.8 234.8 10 279 245.37 244.1 234.7 9.40 280 245.44 243.7 234.1 9.59 281 245.51 244.1 234 10.1 282 245.57 243.6 235.8 7.79 283 245.65 243.8 235.3 8.5 284 245.7 244 235 9 285 245.78 244.3 236.9 7.40 286 245.82 243.7 235 8.69 287 245.91 244.6 236.2 8.40 288 245.94 244.7 235.4 9.29 289 246.04 245.2 237.3 7.89 290 246.06 244.6 234.8 9.79 291 246.16 243.8 235.6 8.20 292 246.17 244.8 236.2 8.60 293 246.28 244.6 236.2 8.40 294 246.29 245.2 236.9 8.29 295 246.39 245.2 237.2 8 296 246.39 245.2 236.5 8.69 297 246.49 245.1 235.7 9.40 298 246.5 244.9 237.4 7.5 299 246.59 245.4 237.5 7.90 300 246.61 246 236.2 9.80