| Venture | Montezuma's Revenge | 
|---|---|
![]()  | 
![]()  | 
| ~ | New model for Montezuma | 
- Advantage Actor critic [1]
 - Parallel Advantage Actor critic [2]
 - Exploration by Random Network Distillation [3]
 - Proximal Policy Optimization Algorithms [4]
 
- python3.6
 - gym
 - OpenCV Python
 - PyTorch
 - tensorboardX
 
Modify the parameters in config.conf as you like.
python train.py
python eval.py
[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Exploration by Random Network Distillation
[4] Proximal Policy Optimization Algorithms



