REINFORCE Benchmark

Proposed by Williams in 1992, REINFORCE is one of the oldest algorithms, but is still popular. For example, it is used in Google's AlphaGo and its descendants.

TODO clear below; link original paper

Name: Reinforce Benchmark

Date completed: 2018_02_04_012905, ongoing

Description: Standard Benchmark on Reinforce

Hypotheses: N/A

Prerequisites: N/A

Algorithms: Reinforce

Environments: CartPole-v0, Acrobot-v1

Specs: ("reinforce.json", "reinforce_benchmark")

Running instructions:

{
  "reinforce.json": {
    "reinforce_benchmark": "benchmark"
  }
}

Commit: c4538fc9c6e6cd5f1fb91ba742d95225ca4ad4a1

Results summary:

data: Reinforce_CartPole-v0_2018_02_04_012905

results matching ""

    No results matching ""