Engineering Roadmap

See Github projects for more

Algorithms

Environments

  • build the reacher and push unity envs
  • add OpenAI gym retro
  • update unity env and release
  • add GVG-AI

Saving

  • save entire agent. make independent of agent spec
  • enjoy mode tau for boltzmann
  • model saving: the best and the latest, checkpointing per 100 epi or 500 for big games.
  • turn clock to saved stage. only for resume training.
  • for enjoy mode, just load whole agent at end-stage, clock is restarted.
  • resume training via model reload and clock turning

Misc

  • ability to resume from random search
  • use torch vision tool for images, replace util
  • early termination on solved condition
  • tensor board
  • Parameter noise from baselines
  • long term: let memory keep soft reference to data space, make data space take arbitrary data.

results matching ""

    No results matching ""