Darwin Neuroevolution Framework
|
The following example illustrates the key objects and evolution cycle:
Running this piece of code should produce something like this:
The rest of the tutorial will break down this example and explain the key parts.
In the Darwin Framework, all the experiment variations and runs are recorded in a universe database, so one of the first things we need to do is open or create a darwin.Universe instance:
A universe may be explicitly closed using universe.close()
, or using the with
statement:
IMPORTAT: Don't close the universe while there are active experiments using
it. This will likely result in a crash the next time the experiment will try to write to the database.
In order to setup an experiment, we need a Population, which abstracts a particular type of Evolutionary Algoritm, ex. Conventional Neuroevolution (CNE) or NEAT. A population is a fixed length collection of genotypes (solution recipes)
Here we're instantiating a 'neat' population type and set its size to 1000
genotypes:
available_populations()
returns a list with the names of all available population types:
Besides the size, each population type has a number of configuration knobs, exposed through the .config
attribute, which is a dictionary-like object. Each Population is created with a sensible set of defaults, although for real experiments we may want to tweak the values.
NOTE: Each Population type has its specific set of configuration knobs.
Finally, a Population, once initialized (experiment.initialize_population()
), can be indexed and iterated over (the genotypes in a population are always sorted in descending fitness order)
Domains abstract the problem space and provides the details on how to evaluate and assign fitness values to each genotype in a Population.
The set of available domains can be discovered using avaiable_domains()
:
Similar to Populations, each Domain type exposes a specific set of configuration knobs through the config
attribute.
Domains and Populations don't know anything about each other, which is how Darwin Framework can run NxM combinations with only N Domains + M Populations.
The Experiment object encapsulates a particular Domain, Population combination:
Experiments also have a .config
attribute, plus a .core_config
attribute with even more knobs common to most Populations.
experiment.trace
returns the trace of the current run, once the experiment is initialed (see darwin.Trace
)experiment.reset()
will terminate the current evolution run, allowing the configuration values to be changed again.The whole point of setting up the Population and the Domain and putting them together as an Experiment, is to search for solutions (models) using an Evolutionary Algorithm.
The general structure of an evolution loop is simple enough:
evaluate_population()
returns a darwin.GenerationSummary
object.evaluate_population()
must be called before create_next_generation()
)The configuration knobs for Populations, Domains and Experiments are dictionary-like objects of type darwin.PropertySet
:
str(property_set)
returns a string representation of all the values in a property set (useful to discover the set of knobs/values)dir(property_set)
returns a list of all the properties in a property setdarwin.PropertySet.to_json()
returns a JSON string representation of the property setdarwin.PropertySet.from_json(json_string)
will update the property set from the json_string
, which is a JSON string.darwin.GenerationSummary
records key values after a Population evaluation:
darwin.GenerationSummary.generation
darwin.GenerationSummary.best_fitness
darwin.GenerationSummary.median_fitness
darwin.GenerationSummary.worst_fitness
darwin.GenerationSummary.champion
- best genotype in the generationdarwin.GenerationSummary.calibration_fitness
- domain specific scores against reference baselines (ex. a random agent)darwin.Trace
is a recording of all the generations in the current experiment, once started. It can be indexed and iterated over, returning the GenerationSummary
for all completed generations.