Appendixes

Default Values

PGAPack Default Values
Population size	100	`PGASetPopSize()`
Maximum iterations	1000	`PGASetMaxGAIterValue()`
Maximum no change iters	100	`PGASetMaxNoChangeValue()`
Max. population homogeneity before stopping	95	`PGASetMaxSimilarityValue()`
Number of new strings to generate	10% of population size (10)	`PGASetNumReplaceValue()`
Probability of crossover	0.85	`PGASetCrossoverProb()`
Uniform crossover bias	0.6	`PGASetUniformCrossoverProb()`
Mutation probability	1/L	`PGASetMutationProb()`
Real mutation constant	0.1 or 0.01	`PGASetMutationRealValue()`
Integer mutation constant	1	`PGASetMutationIntegerValue()`
Mutation range bounded	`PGA_FALSE`	`PGASetMutationBoundedFlag()`
Probabilistic binary tournament parameter	0.6	`PGASetPTournamentProb()`
Use restart operator	`PGA_FALSE`	`PGASetRestartFlag()`
Restart frequency	50	`PGASetRestartFrequencyValue()`
Restart allele mutation rate	0.5	`PGASetRestartAlleleChangeProb()`
Allow no duplicate strings	`PGA_FALSE`	`PGASetNoDuplicatesFlag()`
Multiplier for minimization problems	1.01	`PGASetFitnessCmaxValue()`
Parameter MAX in fitness by ranking	1.2	`PGASetMaxFitnessRank()`
Frequency of statistics printing	10	`PGASetPrintFrequencyValue()`
Print strings	`PGA_FALSE`	`PGASetPrintOptions()`
Print offline statistics	`PGA_FALSE`	`PGASetPrintOptions()`
Print online statistics	`PGA_FALSE`	`PGASetPrintOptions()`
Print best string	`PGA_FALSE`	`PGASetPrintOptions()`
Print worst string	`PGA_FALSE`	`PGASetPrintOptions()`
Print genetic distance	`PGA_FALSE`	`PGASetPrintOptions()`
Randomly initialize population	`PGA_TRUE`	`PGASetRandomInitFlag()`
Probability of initializing a bit to one	0.5	`PGASetBinaryInitProb()`
How to initialize real strings	Range	`PGASetRealInitRange()`
Real initialization range	\([0,1]\)	`PGASetRealInitRange()`
How to initialize integer strings	Permutation	`PGASetIntegerInitPermute()`
Integer initialization range	\([0,L-1]\)	`PGASetIntegerInitPermute()`
Seed random number with clock	`PGA_TRUE`	`PGASetRandomSeed()`
Default MPI communicator	`MPI_COMM_WORLD`	`PGASetCommunicator()`
DE scale factor \(F\)	0.9	`PGASetDEScaleFactor()`
DE auxiliary factor \(K\)	\(0.5 \cdot (F + 1)\)	`PGASetDEAuxFactor()` `
DE Crossover prob \(Cr\)	0.9	`PGASetDECrossoverProb()`
DE Dither	0.0	`PGASetDEDither()`
DE Jitter	0.0	`PGASetDEJitter()`
DE Either/Or Probability	0.5	`PGASetDEProbabilityEO()`
DE Number of differences	1	`PGASetDENumDiffs()`

Function Bindings

Symbolic Constants

PGAPack defines many symbolic constants that are used as arguments to PGAPack functions. These constants are the same for both Fortran and C. The constants are listed in section Constant Definitions These constants are the same for both Fortran and C. The constant groups also define the default values.

C Bindings

See Function Groups Standard API for functions of the top-level API using PGARun(), for Explicit Usage you want to consult the Function Groups Explicit Usage and if looking at the internal implementation the function calls are documented in Function Groups Internal Implementation.

Fortran 77 Bindings

Use the rules defined in Chapter Fortran Interface (and the machine-specific idiosyncrasies noted in Appendix Machine Idiosyncrasies) to determine the Fortran bindings.

Parallelism Background

Parallel Computer Taxonomy

Traditionally, parallel computers are classified according to Flynn’s taxonomy [Fly72]. Flynn’s classification distinguishes parallel computers according to the number of instruction streams and data operands being computed on simultaneously.

Flynn’s single-instruction single-data (SISD) model is the traditional sequential computer. A single program counter fetches instructions from memory. The instructions are executed on scalar operands. There is no parallelism in this model.

In the single-instruction multiple-data (SIMD) model there is again a single program counter fetching instructions from memory. However, now the operands of the instructions can be one of two types: either scalar or array. If the instruction calls for execution involving only scalar operands, it is executed by the control processor (i.e., the central processing unit fetching instructions from memory). If, on the other hand, the instruction calls for execution using array operands, it is broadcast to the array of processing elements. The processing elements are separate computing devices that rely upon the control processor to determine the instructions they will execute.

In a multiple-instruction multiple-data (MIMD) computer there exist multiple processors each of which has its own program counter. Processors execute independently of each other according to whatever instruction the program counter points to next. MIMD computers are usually further subdivided according to whether the processors share memory or each has its own memory.

In a shared-memory MIMD computer both the program’s instructions and the part of the program’s data to be shared exist within a single shared memory. Additionally, some data may be private to a processor and not be globally accessible by other processors. The processors execute asynchronously of each other. Communication and synchronization between the processors are handled by having them each read or write a shared-memory location.

A distributed-memory MIMD computer consists of multiple “nodes.” A node consists of a processor, its own memory, a network interface, and sometimes a local disk. The program instructions and data reside in the node’s memory. The nodes are connected via some type of network that allows them to communicate with each other. Parallelism is achieved by having each processor compute simultaneously on the data in its own memory. Communication and synchronization are handled by passing of messages (a destination node address and the local data to be sent) over the interconnection network.

Processes vs. Processors

We distinguish the two terms process and processor. A process is a software abstraction with a unique address space that can be scheduled by the operating system. A processor is the physical computer hardware on which computations take place.

On MIMD parallel computers, usually one process executes on each processor (although this is not required). On a uniprocessor, multiple processes timeshare the single processor.

Message-Passing Programming Model

In the message-passing programming model multiple processes communicate by passing messages—transferring data from the address space of one process into the address space of another process. When a process needs data stored in the memory of another process, the data must be sent from the process that “owns” it, to the memory of the process that needs it.

The message-passing programming model is currently one of the most popular. One reason for the popularity is portability. Message passing is the natural programming model on distributed-memory MIMD computers. Each process is naturally mapped to one of the machine’s nodes. A similar implementation is common on a workstation network where one process runs on each workstation. On a shared-memory MIMD computer multiple processes can emulate message passing by communicating only via logical message queues—areas of shared memory partitioned by process. On a uniprocessor the multiple processes that timeshare the physical processor can also emulate the idea of logical message queues for their communication.

One example of the message-passing programming model is the controller/responder model. In this model a controller process distributed work (computation to be performed) to the responder processes. The responders perform the work and return the result to the controller. In many implementations the controller plays a bookkeeping role only and does not perform any computation.

Parallel Genetic Algorithms

When using the term “parallel genetic algorithm” it is important to distinguish between parallel models, their (parallel or sequential) implementation, and the computer hardware.

Models

A sequential GA model (more accurately called a global model) has a single population and no restrictions (partitioning) upon which strings recombine with which. The sequential GA is the traditional GA model given in the literature. In a parallel GA model there are either multiple populations (an island model) or a partitioning of a single population (often called a fine-grained model).

Implementations

Both parallel and sequential GA models can have parallel or sequential implementations. A sequential implementation of the global model is the traditional approach discussed in the GA literature. One process, running on a uniprocessor (PCs and workstations), performs all the calculations. In a parallel implementation of the global model the steps of the GA (some or all of selection, crossover, mutation, and fitness calculation) are executed simultaneously by multiple processes running on a parallel computer or workstation network.

In a sequential implementation of a parallel GA model, multiple processes, each responsible for a subpopulation or partition of the full population, time share the processor of the uniprocessor they are running on. In a parallel implementation of a parallel GA model, the multiple processes each run on a unique processor of a parallel computer or workstation network.

MPI

MPI (Message Passing Interface) is a specification of a message-passing library for parallel computers and workstation networks—it defines a set of functions and their behavior. The actual implementation of this interface is left up to vendors and researchers to develop. At the time of this writing several implementations of MPI, both proprietary and freely available, exist. MPI was designed by a large group of parallel computer vendors, computer researchers, and application developers as a standard for message passing.

Communicators

Almost all MPI functions require a communicator. If MPI routines are called directly, the user must supply a communicator. Also, if any of PGAPack’s parallel routines, other than PGARun(), are used, the user must supply a communicator as well.

A communicator combines the notions of context and group. A context is an extension of the notion of a “tag” used in many other message-passing systems to identify a message. Contexts differ from tags in that they are allocated by the system, not the user, and that no wild-card matching among contexts is allowed. A group contains \(n\) processes whose rank is an integer between \(0,\ldots,n-1\). Processes may belong to more than one group and have a unique rank within each.

Any MPI implementation will always supply the default communicator MPI_COMM_WORLD. This communicator contains all processes that were created when MPI was initialized. For most users this is all they have to know about communicators. Simply supply MPI_COMM_WORLD whenever a communicator is required as an argument. For more sophisticated use, users are referred to [MPI94], [GLS94], [MPI21].

Parallel I/O

The issue of parallel I/O is independent of PGAPack. However, since we assume many PGAPack users will wish to do I/O we address this topic. A primary consideration has to do with whether one or all processors do I/O. Consider the following two code fragments, keeping in mind that they are being executed simultaneously by multiple processes:

ctx = PGACreate (&argc, argv, PGA_DATATYPE_BINARY, 30, PGA_MINIMIZE)

and

int len;
scanf ("%d",&len);
ctx = PGACreate (&argc, argv, PGA_DATATYPE_BINARY, len, PGA_MINIMIZE);

In the first case, all processes will receive the value of 30 for the string length since it is a constant. In the second case, however, the value of the string length is determined at run time. Whether one or all processes execute the scanf function is undefined in MPI and depends on the particular parallel computing environment. In PGAPack we require that all processes have the same values for all fields in the context variable. Since some of these fields may be set by using values specified at run time, we suggest that your I/O that reads in PGAPack parameters be done as follows:

#include "pgapack.h"
double evaluate (PGAContext *ctx, int p, int pop, double *aux);

int main( int argc, char **argv )
{
     PGAContext *ctx;
     int myid, len;

     MPI_Init(&argc, &argv);
     MPI_Comm_rank(MPI_COMM_WORLD, &myid);
     if (myid == 0) {                        /* Process 0 has a dialog */
         printf("String length? ");          /* with the user and      */
         scanf("%d", &len);                  /* broadcasts the user's  */
     }
     MPI_Bcast(&len, 1, MPI_INT, 0, MPI_COMM_WORLD);

     ctx = PGACreate(&argc, argv, PGA_DATATYPE_BINARY, len, PGA_MAXIMIZE);
     PGASetUp(ctx);
     PGARun(ctx, evaluate);
     PGADestroy(ctx);

     MPI_Finalize();
     return(0);
}

The key point is that only process 0 (as determined by MPI_Comm_rank) performs I/O and the value of len is then broadcast (using MPI_Bcast) to the other processes.

Machine Idiosyncrasies

Data Type Sizes

PGAPack is written entirely in ANSI C. However, because it is callable from Fortran, and no standards exist for interlanguage communication, problems may arise. These have to do with a lack of consistency in the size of data types between the two languages.

On all machines we have tested, an integer declaration in Fortran is the same size as an int declaration in C and everything works properly. For floating-point numbers, however, we have found at least one inconsistency. The requirement is for the Fortran floating-point number to be the same size as a C double. On most machines a Fortran double precision declaration is the equivalent size.

Since Fortran does not support pointers, an integer variable is used to hold the address of the context variable (and possibly MPI communicator addresses as well). Therefore, a Fortran integer must be “large enough” to hold an address on the machine. For all 32-bit address space machines we have tested this is the case. On machines with a 64-bit address space, however, this may not be true. Therefore we use constructs in Fortran to select an integer data type that is large enough, see chapter Fortran Interface for details.

Startup

The MPI standard provides for source code portability. However, the MPI standard does not specify how an MPI program shall be started or how the number of processes in the computation is specified. These will vary according to the computer being used and the choice of MPI implementation. This section used to have documentation about a lot of machines that no longer exist today. We refer you to the documentation of OpenMPI [OMPI23] or MPICH [MPIC23] or the documentation of whatever MPI implementation you are using.

Common Problems

This collects some problems seen over the years, some may be specific to MPI versions or variants that are no longer in use, since it is hard to know what is still relevant all information has been left in.

When reading input value to be used as parameters in PGASet calls, the PGAset calls themselves may not be executed until after PGACreate() has been called.
In C, when reading input parameters which are of type double, the scanf conversion specification should be of the form %lf, not %f which is appropriate for floats.
An infinite loop can occur if the number of permutations of the bit string is less than the population size. For example, for a binary-valued string of length four, there are \(2^4 = 16\) possibilities. If the population size is greater than 16, and duplicate strings are not allowed in the population, an infinite loop will occur.
Erroneous results can occur if the name of a user’s function conflicts with a library function used by PGAPack. For example, if a program defined its own ceil function, this would conflict with the C math library function of the same name.
All floating point constants and variables used in PGAPack are of type double. Particularly from Fortran, the user should be careful to make sure that they pass a double precision constant or variable.
PGACreate() removes command line arguments. One consequence is that if PGACreate() is called twice in the same program (unusual, but legal), the second PGACreate() call will not receive the command-line arguments.
If one includes mpi.h (or mpif.h) when it should not be, errors will result, as well as warnings about redefining macros and typedefs. This usually happens when a sequential version of PGAPack is used (with “fake” MPI stub routines and definitions) and the user’s program explicitly includes “real” mpi.h or mpif.h header files.
If one fails to include mpi.h (or mpif.h) when it should be (such as calling MPI functions directly) errors may result. Since pgapack.h includes mpi.h this should not happen in C. The Fortran include file, pgapackf.h, however, does not include mpif.h. The user must explicitly include it in every subroutine and function that makes MPI calls. Not including mpif.h could result in any of several different errors, including
- syntax errors when compiling (for example, MPI_COMM_WORLD being undefined)
- general errors in the computed results
- the program crashing when it calls the undefined subroutine MPI_Init
- general MPI errors such as:
```
0 - Error in MPI_COMM_RANK : Invalid communicator
[0] Aborting program!
```
We have also seen the following error from not including bmpif.h in the main program:
```
PGACreate: Invalid value of datatype: 0
PGAError: Fatal
```
If the ch_p4 device in MPICH is used to run on workstations one must have a correct processor group file (procgroup). The error message
```
(ptera-36%)a.out
p0_18429:  p4_error: open error on procgroup file (procgroup): 0
(ptera-37%)
```
may occur if the processor group file is not specified correctly. See the MPICH users guide for more details.
A common error with the procgroup file when using the ch_p4 device in MPICH is to have an incorrect path to the executable.
When compiling the examples directory we have seen “multiply defined” error messages. For example:
```
Making C examples
  Compiling classic
ld: /usr/local/mpi/lib/sun4/ch_p4/libmpi.a(initialize.o): _MPI_Initialized: multiply defined
collect2: ld returned 2 exit status
```
We have seen this error occur when a sequential version of PGAPack was built and the library (./lib/arch/libpgag.a or ./lib/arch/libpgaO.a) was not deleted before attempting to build a new, parallel version of PGAPack. The “fake” MPI stub routines are in the sequential library and have name conflicts when a “real” MPI library is referenced. The solution is to delete the old .a file and rerun make install. The Makefile target clobber takes care of deleting all exiting libraries:
```
make clobber
```