Neuroevolution in EvoTorch¶

EvoTorch provides extensive native support for neuroevolution through the evotorch.neuroevolution sub-package and the NEProblem class. This page details the various features of this sub-package.

General Usage¶

To use neuroevolution, you can define a function that evaluates a single torch.nn.Module instance and returns a scalar fitness.

Let's consider a simple fitness function: We wish to train a network that takes a vector of length 3 as input, and produces a scalar output, such that the sign of its output is the same as the sign of the sum of its inputs. To train this network, we choose a reward of +1 when the output has the correct sign, and -1 otherwise. We also choose a batch size of 32. The goal of our neuroevolution task is to maximise the average reward.

import torch
from evotorch.tools import dtype_of, device_of

def sign_prediction_score(network: torch.nn.Module):
    samples = torch.randn((32, 3), dtype = dtype_of(network), device = device_of(network))  # Generate 32 random gaussian vectors
    network_out = network(samples)  # Apply the network to the gaussian vectors
    sign_out = torch.sign(network_out[:,0])  # Get the sign of the single output
    sign_sum = torch.sign(samples.sum(dim = -1))  # Get the sign of the sum of the inputs
    reward_gained = (sign_sum == sign_out).to(torch.float).sum()  # Number of times the network was correct
    reward_lost = (sign_sum != sign_out).to(torch.float).sum()  # Number of times the network was incorrect
    return (reward_gained - reward_lost) / 32

A NEProblem class can then be instantiated according to: an objective_sense ('min' or 'max'), a network definition (a torch.nn.Module instance, a torch.nn.Module class, a function returning a torch.nn.Module instance or a string representation of a network -- see below) and network_eval_func, our fitness function:

from evotorch.neuroevolution import NEProblem

sign_prediction_problem = NEProblem(
    objective_sense='max',  # The objective sense -- we wish to maximize the sign_prediction_score
    network=torch.nn.Linear(3, 1),  # The network is a Linear layer mapping 3 inputs to 1 output
    network_eval_func=sign_prediction_score,  # Networks will be evaluated according to sign_prediction_score
)

Applying PGPE to this problem with a default configuration

from evotorch.algorithms import PGPE
from evotorch.logging import PandasLogger

searcher = PGPE(
    sign_prediction_problem,
    popsize = 50,
    radius_init = 2.25,
    center_learning_rate = 0.2,
    stdev_learning_rate = 0.1,
)
logger = PandasLogger(searcher)
searcher.run(50)

should give some observable learning, as seen in the plot below.

logger.to_dataframe().mean_eval.plot()

Output

You can directly access a parameterized version of the network using the parameterize_network function. For example, the below code will generate an instance of the Linear network we defined earlier, parameterized with the \(\mu\) vector learned by the PGPE instance searcher.

trained_network = problem.parameterize_net(searcher.status['center'])

You can use the trained network, for example by passing an input that has a positive sum

trained_network(torch.tensor([[1., 2., 3.]]))

Output

    tensor([[53.2647]], grad_fn=<AddmmBackward0>)

or by passing an input that has a negative sum

trained_network(torch.tensor([[-1., 1., -1.]]))

Output

    tensor([[-9.7548]], grad_fn=<AddmmBackward0>)

Of course, you can also create a custom NEProblem instance by inheriting from that class. In this case, you simply need to override the _evaluate_network function with your desired fitness function:

class CustomNE(NEProblem):

    ...

    def _evaluate_network(self, network: torch.nn.Module) -> torch.Tensor:
        samples = self.make_gaussian((32, 3))  # Generate random gaussian vectors
        network_out = network(samples)  # Apply the network to the gaussian vectors
        sign_out = torch.sign(network_out[:,0])  # Get the sign of the single output
        sign_sum = torch.sign(samples.sum(dim = -1))  # Get the sign of the sum of the inputs
        reward_gained = (sign_sum == sign_out).to(torch.float).sum()  # Number of times the network was correct
        reward_lost = (sign_sum != sign_out).to(torch.float).sum()  # Number of times the network was incorrect
        return reward_gained - reward_lost

Neuroevolution at Scale¶

When using NEProblem classes without modifying the num_actors argument (i.e., without enabling Ray parallelization), the network used for evaluation will simply use the same device as the problem's main device attribute. This means that

problem = NEProblem(
    ...
    device="cuda:0",
)

will mean that the network of the problem will, by default, live on the device cuda:0. However, if you specify a positive num_actors value,

problem = NEProblem(
    ...
    num_actors=16,
)

Ray will by default block all of the actors from using any GPU and therefore every actor's network will live on the device cpu. You can work around this using the num_gpus_per_actor keyword argument,

problem = NEProblem(
    ...
    num_actors=16,
    num_gpus_per_actors=(4 / 16)  # I have 4 GPUs and 16 CPUs, so I assign 0.25 GPUs per actor
)

which will specify a fraction of a device to assign to each ray actor. In this case, the actors' networks will live on the actors' assigned device. You can manually intervene with this behaviour by overriding the property network_device. For example,

from evotorch.tools.misc import Device
class CustomNE(NEProblem):

    def __init__(self, ...):
        ...
        super().__init__(
            num_gpus_per_actor = 'all',
        )

    @property
    def network_device(self) -> Device:
        """The device on which the problem should place data e.g. the network"""
        return torch.device('cuda:3')

will cause each of the actors to attempt to move their networks to the device cuda:3. However, you should note that ray, by default, overrides the CUDA_VISIBLE_DEVICES environment variable so that each actor cannot see any GPUs unless the num_gpus flag is passed at instantiation to the ray actor. We provide a workaround to this through the num_gpus_per_actor='all' keyword argument that is passed to super().__init__, which will specify that all actors should be able to see all of the devices. and the above overriding of network_device will function as expected.

Using the num_actors, num_gpus_per_actor arguments and the network_device property, you can straightforwardly scale your neuroevolution experiments to hundreds of CPU cores and dozens of GPUs across ray clusters.

Instantiating Networks¶

NEProblem supports a number of ways to instantiate networks. The first, as we have seen, is to pass a torch.nn.Module instance. In this case, the module will be copied to all actors when using num_actors > 1. However, we generally discourage this usage as it means that the network will be moved to the CPU and copied to all actors, which can be a very expensive operation when working at scale. We provide a number of functionalities that allow automatic instantiation of the network within the actor so that no such copying is required.

You can pass a string representation of the following form

policy_str = 'torch.nn.Module(**args) >> torch.nn.Module(**args) >> torch.nn.Module(**args)'

Which will create a torch.nn.Sequential instance with each named torch Module class instantiated applied sequentially. For example, calling

network_str = 'Linear(3, 32) >> Tanh() >> Linear(32, 1)'

problem = NEProblem(
    objective_sense='max',
    network=network_str,
    network_eval_func=sign_prediction_score,
)

will cause a parameterized version of the network to return a torch.nn.Sequential module consisting of: torch.nn.Linear layer with 3 inputs and 32 outputs, a torch.nn.Tanh activation layer and a torch.nn.Linear layer with 32 inputs and 1 output.

print(problem.parameterize_net(problem.make_zeros(problem.solution_length)))

Output

Sequential(
    (0): Linear(in_features=3, out_features=32, bias=True)
    (1): Tanh()
    (2): Linear(in_features=32, out_features=1, bias=True)
)

The third option is to pass to the NEProblem class a function which returns a torch.nn.Module instance. For example, we can define a function which returns the above network:

def my_network():
    return torch.nn.Sequential(
        torch.nn.Linear(3, 32),
        torch.nn.Tanh(),
        torch.nn.Linear(32, 3),
    )

problem = NEProblem(
    objective_sense='max',
    network=my_network,
    network_eval_func=sign_prediction_score,
)
print(problem.parameterize_net(problem.make_zeros(problem.solution_length)))

Output

Sequential(
    (0): Linear(in_features=3, out_features=32, bias=True)
    (1): Tanh()
    (2): Linear(in_features=32, out_features=3, bias=True)
)

As any class definition is callable through its __init__ method, this also means that you can directly pass a torch.nn.Module class which will be instantiated. For example:

class MyNetwork(torch.nn.Module):

    def __init__(self):
        super().__init__()
        self.lin1 = torch.nn.Linear(3, 32)
        self.act = torch.nn.Tanh()
        self.lin2 = torch.nn.Linear(32, 3)

    def forward(self, data):
        return self.lin2(self.act(self.lin1))

problem = NEProblem(
    objective_sense='max',
    network=MyNetwork,
    network_eval_func=sign_prediction_score,
)
print(problem.parameterize_net(problem.make_zeros(problem.solution_length)))

Output

MyNetwork(
    (lin1): Linear(in_features=3, out_features=32, bias=True)
    (act): Tanh()
    (lin2): Linear(in_features=32, out_features=3, bias=True)
)

Dynamically Parameterizing Networks¶

The problem class itself can directly pass arguments to any of these methods at instantiation, although it cannot for obvious reasons pass arguments to a torch.nn.Module class that is already instantiated. For example, in our earlier example, we had \(d\), the dimension of the input vector. By defining the _network_constants property, we can pass this information to the instantiation of the network:

class CustomNE(NEProblem):

    def __init__(self, network: torch.nn.Module, k: int, d: int):

        super().__init__(
            objective_sense = 'max',  # This is a maximization task
            network = network,  # Pass the network to the super constructor
        )
        self._d = d  # Store d

    @property
    def _network_constants(self):
        return {'d': self._d}  # Pass self._d as 'd' to networks at instantiation

    def _evaluate_network(self, network: torch.nn.Module):
        ...

which then allows, for example, the use of the variable d in the string representation of the network

policy_str = 'Linear(d, 32) >> Tanh() >> Linear(32, 1)'
problem = CustomNE(
    network=policy_str,
    k=32,
    d=15,
)

such that it is directly passed to the instantiation of the network. You can observe this by printing the network

print(problem.parameterize_net(problem.make_zeros(problem.solution_length)))

Output

Sequential(
    (0): Linear(in_features=15, out_features=32, bias=True)
    (1): Tanh()
    (2): Linear(in_features=32, out_features=1, bias=True)
)

Similarly, you can now use d as an argument in your function definition

def my_network(d: int):
    return torch.nn.Sequential(
        torch.nn.Linear(d, 32),
        torch.nn.Tanh(),
        torch.nn.Linear(32, 3),
    )

or as an argument to your class definition

class MyNetwork(torch.nn.Module):

    def __init__(self, d: int):
        super().__init__()
        self.lin1 = torch.nn.Linear(d, 32)
        self.act = torch.nn.Tanh()
        self.lin2 = torch.nn.Linear(32, 3)

    def forward(self, data):
        return self.lin2(self.act(self.lin1(data)))

You can pass any further arguments to the network instantation, using the network_args keyword argument, expected as a dictionary. For example, if you specify that our network expects an integer argument hidden_dim:

class MyNetwork(torch.nn.Module):

    def __init__(self, hidden_dim: int):
        super().__init__()
        self.lin1 = torch.nn.Linear(3, hidden_dim)
        self.act = torch.nn.Tanh()
        self.lin2 = torch.nn.Linear(hidden_dim, 1)

    def forward(self, data):
        return self.lin2(self.act(self.lin1(data)))

and then create a problem class, passing a particular hidden_dim through network_args:

problem = NEProblem(
    objective_sense='max',
    network=MyNetwork,
    network_args={'hidden_dim': 17},
    network_eval_func=sign_prediction_score,
)
print(problem.parameterize_net(problem.make_zeros(problem.solution_length)))

Output

MyNetwork(
    (lin1): Linear(in_features=3, out_features=17, bias=True)
    (act): Tanh()
    (lin2): Linear(in_features=17, out_features=1, bias=True)
)

will instantiate MyNetwork with the hidden_dim variable set to that value -- in this case, hidden_dim = 17.

Equivalently you can do

policy_str = 'Linear(3, hidden_dim) >> Tanh() >> Linear(hidden_dim, bias)'

or

def my_network(hidden_dim: int):
    return torch.nn.Sequential(
        torch.nn.Linear(3, hidden_dim),
        torch.nn.Tanh(),
        torch.nn.Linear(hidden_dim, 1),
    )