From 9a679538d7bad0e5d30cc185c0844328815b8c8a Mon Sep 17 00:00:00 2001 From: carschandler <92899389+carschandler@users.noreply.github.com> Date: Mon, 11 Nov 2024 20:07:17 -0600 Subject: [PATCH] Formatting error create_custom_env.md --- docs/introduction/create_custom_env.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/introduction/create_custom_env.md b/docs/introduction/create_custom_env.md index c8c6ff15d..b0fac6725 100644 --- a/docs/introduction/create_custom_env.md +++ b/docs/introduction/create_custom_env.md @@ -146,7 +146,7 @@ For our environment, several things need to happen during the step function: - We use the self._action_to_direction to convert the discrete action (e.g., 2) to a grid direction with our agent location. To prevent the agent from going out of bounds of the grid, we clip the agent's location to stay within bounds. - We compute the agent's reward by checking if the agent's current position is equal to the target's location. - - Since the environment doesn't truncate internally (we can apply a time limit wrapper to the environment during :meth:make), we permanently set truncated to False. + - Since the environment doesn't truncate internally (we can apply a time limit wrapper to the environment during :meth:`make`), we permanently set truncated to False. - We once again use _get_obs and _get_info to obtain the agent's observation and auxiliary information. ```