rcognita.scenarios.EpisodicScenarioREINFORCE
- class rcognita.scenarios.EpisodicScenarioREINFORCE(*args, learning_rate=0.001, is_fixed_actor_weights=False, is_plot_critic=False, **kwargs)
- __init__(*args, learning_rate=0.001, is_fixed_actor_weights=False, is_plot_critic=False, **kwargs)
Initialize self. See help(type(self)) for accurate signature.
Methods
__init__
(*args[, learning_rate, …])Initialize self.
get_mean
(array)get_mean_REINFORCE_gradient
()iteration_update
()memorize
()This is a decorator for a simulator step method.
perform_post_step_operations
()plot_critic
()reload_pipeline
()reset_episode
()reset_iteration
()reset_simulation
()run
()step
()store_REINFORCE_objective_gradient
()update_outcome
(observation, action, delta)Sample-to-sample accumulated (summed up or integrated) stage objective.
update_time_from_cache
()Attributes
cache