Hyperparameter tuning for GraphSAGE

Hyperparameter tuning for GraphSAGE#

Aggregation functions comparison#

Below is a comparison of runs with different aggregation functions. The aggregation functions were:

SumAggregation
MeanAggregation
SquareRootAggregation
LSTMAggregation

Please see the GraphSAGE notebook for more details on aggregation functions.

It was found that the LSTM aggregation function performed best, further tuning was therefore performed on this aggregation function, see below.

LSTM Aggregation fine-tuning#

Below is a comparison of runs with different LSTM aggregation hyperparameters.

Dropout, learning rate and batch size were the main hyperparameters tuned; it was found that increasing the batch size and number of epochs increased performance on the test set, while changing the layers or increasing dropout did not. Despite the model quickly fitting to the whole train set, training for additional epochs did increase performance on the test set, showing meaningful learning rather than overfitting.

Achieving the highest performance for this model does take a comparatively large number of epochs, however; we will see if integrating edge features can help speed up the learning.

Hyperparameter tuning for GraphSAGE

Contents

Hyperparameter tuning for GraphSAGE#

Aggregation functions comparison#

LSTM Aggregation fine-tuning#

Links to reports#