Chapter 7.2 n-step Sarsa

5.6 μs
63.3 s
env
# RandomWalk1D

## Traits

| Trait Type        |                                          Value |
|:----------------- | ----------------------------------------------:|
| NumAgentStyle     |        ReinforcementLearningBase.SingleAgent() |
| DynamicStyle      |         ReinforcementLearningBase.Sequential() |
| InformationStyle  | ReinforcementLearningBase.PerfectInformation() |
| ChanceStyle       |      ReinforcementLearningBase.Deterministic() |
| RewardStyle       |     ReinforcementLearningBase.TerminalReward() |
| UtilityStyle      |         ReinforcementLearningBase.GeneralSum() |
| ActionStyle       |   ReinforcementLearningBase.MinimalActionSet() |
| StateStyle        | ReinforcementLearningBase.Observation{Int64}() |
| DefaultStateStyle | ReinforcementLearningBase.Observation{Int64}() |

## Is Environment Terminated?

No

## State Space

`Base.OneTo(21)`

## Action Space

`Base.OneTo(2)`

## Current State

```
11
```
18.5 ms
395 ns
true_values
-1.0:0.1:1.0
2.8 μs

Again, we first define a hook to calculate RMS

3.7 μs
1.6 ms
151 μs
run_once (generic function with 1 method)
53.9 μs
105 s