Register for Ray Summit 2024 with keynotes from Mira Murati, Marc Andreessen, and Anastasis Germanidis.

ray.rllib.utils.探索.探索.探索.before_compute_actions#

Exploration.before_compute_actions(*, timestep: numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor | int | None = None, explore: numpy.array | jnp.ndarray | tf.Tensor | torch.Tensor | bool | None = None, tf_sess: tf.Session | None = None, **kwargs)[源代码]#

在调用 policy.compute_actions() 之前的准备工作钩子。

参数:

timestep – 一个可选的时间步张量。
explore – 一个可选的探索布尔标志。
tf_sess – 要使用的 tf-session 对象。
**kwargs – 向前兼容的关键字参数。