Predicting high order Reduced Order Model coefficients for proton therapy dose approximations using deep learning

More Info
expand_more

Abstract

The advantage of using protons to irradiate a tumour in cancer treatment, is that the energy can be delivered very precisely to the tumour without irradiating much of the surrounding tissue. The disadvantage of this is that small displacements of a patient can result in large deviations in the planned dose delivery according to the treatment plan. Therefore, many different dose distributions corresponding to different patient displacements have to be calculated to set up a robust treatment plan, which is computationally expensive. The solution to this problem is to use a combination of reduced order modelling and deep learning to perform computationally cheaper approximations of dose distributions corresponding to patient displacements. In previous research it has been shown that shallow and narrow neural networks were unable to predict the higher order modes of the reduced order model, which are required for the desired accuracy. Therefore, in this research it was studied how a feedforward neural network’s performance in predicting the higher order expansion coefficients of a reduced order model and the dose distributions depend on the neural network architecture and training approaches. \par First, a dose distribution was obtained by setting up a liver tumour patient treatment plan using treatment planning software matRad. Afterwards, dose distributions corresponding to simulated 1D, 2D and 3D displacements were generated using the optimized treatment plan. The amount of training samples was set to 1000, 2000 and 3000 and the amount of test samples was set to 200, 200 and 300 for the 1D, 2D and 3D case respectively. By using the singular value decomposition on the generated dose distributions of the training set, a reduced order model could be extracted and provided the expansion coefficients that represent the behaviour of the modes within the reduced order model. Afterwards, the expansion coefficients were inspected as a function of displacement to determine which mode showed strong non-linear behaviour. This mode was called the target mode or high order mode and all modes up until the target mode were included in the reduced basis that was used to approximate the dose distributions. Mode 18, 24 and 51 were chosen as the target modes for the 1D, 2D and 3D case respectively. Subsequently, the target accuracies of the dose predictions within the test set in terms of the average voxel fraction acceptances were calculated using a threshold of 1% difference between the true voxel dose and approximated dose, relative to the maximum dose of the true dose distribution. The target accuracies of the dose predictions within the test set were 99.86%, 99.64% and 99.46% for the 1D, 2D and 3D case respectively. In experiment 1, neural networks with varying amounts of hidden layers, neurons per hidden layer and loss functions were trained on the full set of training samples and, once trained, were used to predict the modes and dose distributions using the displacements from the test set. The amount of hidden layers and neurons per hidden layer were set to: 2, 3, 4 and 5 hidden layers with 40, 80, 120 and 160 neurons per hidden layer for the 1D-case; 2, 4 and 6 hidden layers with 100, 200, 300 and 400 neurons per hidden layer for the 2D-case; 2, 4 and 6 hidden layers with 150, 300, 450 and 600 neurons per hidden layer for the 3D-case. The chosen loss functions were the mean squared error and mean absolute error functions. Each hidden layer used the Rectified Linear Unit activation function and the Adamax optimizer was used. The highest achieved accuracies were 99.79%, 99.50% and 98.84% for the 1D, 2D and 3D case respectively. Experiment 2 was similar to experiment 1, but the networks for the 1D and 2D case were trained with half the amount of training samples. The neural networks of experiment 1 and 2 were trained with the displacements as input variables and all the corresponding expansion coefficients of the modes up until the target mode as output variables. In experiment 3, the neural networks of the 3D case were trained only with the target mode expansion coefficient instead of all expansion coefficients simultaneously. Finally, in experiment 4, a very wide but shallow neural network and a very deep but narrow neural network were compared with and without applying L2 regularization. The regularization parameter was set to 10^-3, 10^-4 and 10^-5. \par It was concluded that increasing the amount of hidden layers and neurons, up to a certain amount can improve the prediction of both the high order modes and dose distributions, after which the performance starts to deteriorate. The amount of hidden layers and neurons per layer at which this occurs is dependent on the loss function and the amount of training data. More precisely, the performance of neural networks that utilize the mean absolute error function deteriorate faster than networks utilizing the mean squared error loss function as the amount of hidden neurons and hidden layers increases. On the other hand, the amount of training data is also important to fully represent the features of the highly non-linear modes that deeper and wider neural networks are able to learn. Deeper networks trained longer than wider networks and regularization did not improve the performance of the neural networks within the trained amount of epochs and chosen amount of hidden layers and neurons per layer.