Sum of Three Cubes

Overview

This repository implements an environment for the Sum of Cubes problem and tests it under various conditions. The PPO (Proximal Policy Optimization) implementation is based on the Arena 2.3 PPO lecture's Jupyter notebook (https://arena3-chapter2-rl.streamlit.app/[2.3]_PPO).

Installation

Install dependencies:

pip install -r requirements.txt

Usage

Configure the experiment using PPOArgs
Select your target environment
Start the training process

python run.py

Configuration Parameters

The PPOArgs class supports the following parameters:

total_timesteps: Total number of actions the experiment will take
num_envs: Number of parallel environments to run
max_k: Maximum value for k (where k = x³ + y³ + z³)
learning_rate: Learning rate for the optimization

Advantage Calculation Parameters

gamma
gae_lambda

Loss Calculation Parameters

clip_coef
ent_coef
vf_coef

Note: Different environments may require different parameter settings. You can adjust these in the PPOArgs class within PPO.py. You can also develop your own environment.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
environments		environments
.gitignore		.gitignore
Learn_dataset.ipynb		Learn_dataset.ipynb
PPO.py		PPO.py
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sum of Three Cubes

Overview

Installation

Usage

Configuration Parameters

Advantage Calculation Parameters

Loss Calculation Parameters

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Jaeha0526/SumofThreeCubes

Folders and files

Latest commit

History

Repository files navigation

Sum of Three Cubes

Overview

Installation

Usage

Configuration Parameters

Advantage Calculation Parameters

Loss Calculation Parameters

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages