proximal-policy-optimization

Here is 1 public repository matching this topic...

omerjakoby / MARIO-RL-PPO

This repository implements a Proximal Policy Optimization (PPO) agent that learns to play Super Mario Bros using TensorFlow/Keras and OpenAI Gym. Features CNNs for vision, Actor-Critic architecture, and parallel environments. Train your own Mario master or run a pre-trained one!

machine-learning tensorflow keras openai-gym cnn actor-critic mario-game proximal-policy-optimization ppo reinforcement-learning-agent ppo-algorithm

Updated Jun 1, 2025
PureBasic

Improve this page

Add a description, image, and links to the proximal-policy-optimization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the proximal-policy-optimization topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proximal-policy-optimization

Here is 1 public repository matching this topic...

omerjakoby / MARIO-RL-PPO

Improve this page

Add this topic to your repo