Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model and training upgrades #10

Merged
merged 18 commits into from
Jan 11, 2025
Merged

Model and training upgrades #10

merged 18 commits into from
Jan 11, 2025

Conversation

msosav
Copy link
Owner

@msosav msosav commented Jan 11, 2025

This pull request includes several significant updates to the project, focusing on adding a GitHub Actions workflow for formatting, enhancing the gym.py configuration, updating memory address handling, and improving the preprocessing environment setup.

GitHub Actions Workflow:

Configuration Enhancements:

  • config/gym.py: Introduced new configuration parameters (action_freq, exploration_reward, reward_scale, game_with_sound) and updated the __init__ method to use these parameters. Added new valid and release actions, and modified the observation space to include additional game state information.
  • config/gym.py: Refactored the step method to use a new run_action method, which handles sending input and ticking the emulator.

Memory Address Updates:

  • config/memory_addresses.py: Added detailed comments and new memory addresses for various game states, such as destination data, inventory items, dungeon item flags, and rupees.

Preprocessing Environment:

  • utils.py: Replaced the grayscale observation wrapper with a custom DictGrayScaleObservation class to handle dictionary-based observation spaces.

Model Training Configuration:

  • main.py: Updated the PPO model configuration for training, including changes to the policy type, learning rate, and other hyperparameters.

@msosav msosav merged commit 98cd7dc into main Jan 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant