Vol 28, No 6 (2023)
Research paper
Published online: 2024-01-03

open access

Page views 398
Article views/downloads 242
Get Citation

Connect on Social Media

Connect on Social Media

research paper

Reports of Practical Oncology and Radiotherapy

2023, Volume 28, Number 6, pages: 737–745

DOI: 10.5603/rpor.98729

Submitted: 13.08.2023

Accepted: 04.12.2023

© 2023 Greater Poland Cancer Centre.

Published by Via Medica.

All rights reserved.

e-ISSN 2083–4640

ISSN 1507–1367

Comparison of deep learning models for building two-dimensional non-transit EPID Dosimetry on Varian Halcyon

Muhammad Mahdi Ramadhan1Wahyu Edy Wibowo2Prawito Prajitno1Supriyanto Ardjo Pawiro1
1Department Physics, Faculty of Mathematics and Natural Sciences Universitas Indonesia, Depok, , Indonesia
2Department of Radiation Oncology, Dr. Cipto Mangunkusumo General Hospital, Faculty of Medicine Universitas Indonesia, Jakarta, Indonesia

Address for correspondence: Supriyanto Ardjo Pawiro, Department Physics, Faculty of Mathematics and Natural Sciences Universitas Indonesia, Depok, 16424, Indonesia; e-mail: supriyanto.p@sci.ui.ac.id

This article is available in open access under Creative Common Attribution-Non-Commercial-No Derivatives 4.0 International (CC BY-NC-ND 4.0) license, allowing to download articles and share them with others as long as they credit the authors and the publisher, but without permission to change them in any way or use them commercially

Abstract
Background: This study compared the effectiveness of five deep learning models in constructing non-transit dosimetry with an a-Si electronic portal imaging device (EPID) on Varian Halcyon. Deep learning model is increasingly used to support prediction and decision-making in several fields including oncology and radiotherapy.
Materials and methods: Forty-seven unique plans of data obtained from breast cancer patients were calculated using Eclipse treatment planning system (TPS) and extracted from DICOM format as the ground truth. Varian Halcyon was then used to irradiate the a-Si 1200 EPID detector without an attenuator. The EPID and TPS images were augmented and divided randomly into two groups of equal sizes to distinguish the validation and training–test data. Five different deep learning models were then created and validated using a gamma index of 3%/3 mm.
Results: Four models successfully improved the similarity of the EPID images and the TPS-generated planned dose images. Meanwhile, the mismatch of the constituent components and number of parameters could cause the models to produce wrong results. The average gamma pass rates were 90.07 ± 4.96% for A-model, 77.42 ± 7.18% for B-model, 79.60 ± 6.56% for C-model, 80.21 ± 5.88% for D-model, and 80.47 ± 5.98% for E-model.
Conclusion: The deep learning model is proven to run fast and can increase the similarity of EPID images with TPS images to build non-transit dosimetry. However, more cases are needed to validate this model before being used in clinical activities.
Key words: radiotherapy; dosimetry; EPID; Cine images; deep learning
Rep Pract Oncol Radiother 2023;28(6):737–745

Introduction

In radiotherapy, radiation doses are delivered to a target (cancer cells) while causing minimal damage to the surrounding healthy organs [1]. Patient-specific quality assurance (PSQA) ensures the accuracy and safety of the treatment process [2]. PSQA is divided into two categories: pretreatment verification and during-treatment verification. Current recommendations on PSQA focus on pretreatment verification in which the patient’s plan is delivered to either a phantom or air, and the absolute dose is measured using ion chambers, films, or an electronic portal imaging device (EPID).

Varian (Varian Medical Systems, Inc., Palo Alto, CA, United States) released Halcyon which has a two-layer multi-leaf collimator (MLC) system that ensures rapid beam modulation and substantially reduces leakage between the MLC leaves, thereby increasing workflow efficiency in radiotherapy. Meanwhile, as the machine does not have collimator jaws, MLC is the only beam-forming component in the machine. Therefore, MLC location and optimization are critical to ensuring accurate dose delivery [2–4].

To ensure the consistency of the MLC and beam angle, an EPID detector is permanently attached inside the Halcyon. EPID images can be extracted to provide dose distribution information; as a dosimetry method for verification before and during therapy [5–14]. One type of dosimetry by EPID is non-transit dosimetry, which can predict the dose inside the patient for pre-treatment verification by measuring doses at a certain height without an attenuator. While many algorithms have been developed using mathematical formulas [6], a novel method for non-transit dosimetry using deep learning model is presented in this paper. Five models were developed, and the results were compared and validated using the gamma index with the criteria of 3%/3 mm.

Deep learning works by combining large amounts of data with fast and iterative intelligent processing algorithms. This allows the system to learn automatically from patterns or features in the data [15]. Researchers have used deep learning as a correction tool in dosimetry to increase the similarity of dose distributions through EPID [16–21]. However, no previous study has used deep learning to reconstruct the dose distributions inside patients with EPID for non-transit dosimetry.

Materials and methods

The experiment was performed at Cipto Mangunkusumo General Hospital using Varian Halcyon (Varian Medical Systems, Palo Alto, CA, United States) equipped with an a-Si 1200 EPID. After irradiation, a single image was extracted, and the pixel value was automatically converted into a calibrated unit. The source-to-detector distance (SDD) was fixed at 154 cm, as shown in Figure 1. The A-Si EPID 1200 has a maximum irradiation area measuring 43 × 43 cm², accompanied by a pixel dimension of 1,280 × 1,280. Consequently, the pixel size ratio was 0.340 mm/pixel. The term “max irradiation area” indicates the largest allowable irradiation field.

154807.png
Figure 1. Arrangement non transit dosimetry by electronic portal imaging device (EPID). SAD source axis distance; SDD source-to-detector distance; MLC multileaf collimator

Furthermore, the EPID images were compared to those of DICOM RT, which were generated from the Eclipse treatment planning system (TPS) using an anisotropic analytical algorithm (AAA). The flowchart of this study is shown in Figure 2 and is explained below.

154819.png
Figure 2. Workflow of artificial intelligence (AI) model development to build non-transit dosimetry by electronic portal imaging device (EPID)

Breast cancer was considered in this study because it has the highest age-standardized mortality rate of around 15.3 per 100,000 population [22].

The maximum field size that can be captured by the detector is calculated using a similar triangle theorem by utilizing the source axis distance (SAD) 100 cm divided by source-to-detector distance (SDD) 154 cm and multiplied by the max irradiated area 43 × 43 cm2. So, the maximum field was 27.92 × 27.92 cm2.

There are 47 unique fields of breast cancer cases that have a field size below the maximum allowed. All fields were recalculated again for gantry angles at 0 degree, so the total data reached 94 fields. The planned dose distributions were exported from the TPS in DICOM format and used as ground truth.

Augmentation

The original dataset was divided randomly into two equal parts, training–test dataset and validation dataset. This step is to prevent mixing between the training–test and validation datasets.

Deep neural networks require a lot of training data to obtain good results and prevent overfitting. The augmentation was done by rotating and flipping the dataset. This technique is a common image augmentation technique in deep neural network research [22, 24]. This augmentation technique adds more than 1,000 image pairs which is divided into 70% training data and 30% testing data.

Deep learning models

Five deep learning models were developed using the Python programming language and run on a Nvidia K80 GPU with 12 GB RAM memory. The inputs for the deep learning models were the EPID images, whereas the ground truths or targets were the planned dose images. Each model was optimized using the Adam optimization algorithm. Although the epoch was set to 300, training could be stopped earlier if there was no improvement in the results. The learning rate was 0.000001, and the loss value was calculated using mean squared error (MSE). MSE calculates the average of the squared differences between the predicted and actual values, this value is shown to be an accurate measure of the similarity between two images. The details of each model are explained below.

A-Model

This model is a convolutional neural network (CNN) with five layers, and the architecture is presented in Supplementary File Table S1. There were 64 hidden neurons in each layer, and the number of parameters was 113,155, which were optimized during the training sessions.

B-Model

This model is a CNN with six layers, and the architecture is presented in Supplementary File Table S2. The number of hidden neurons for each additional layer doubled, i.e., 16, 32, 64, 128, 256, and the last layer (sixth layer) was the output layer. Since this model had more layers, the total number of parameters increased to 1,097,285.

C-Model

This model is a convolutional autoencoder, and the architecture is presented in Supplementary File Table S3. It has been proven to eliminate noise in images by retaining spatial and temporal information [25]. This model comprised two major parts: the encoder and decoder sections. In the encoder section, when a new layer was added, the hidden neurons increased, whereas the image dimensions decreased. However, in the decoder section, when a new layer was added, the hidden neurons decreased, whereas the image dimensions increased. The input had the same number of dimensions as the output, and the total number of parameters was 1,018,817.

D-Model

This model is also a convolutional autoencoder, and the architecture is presented in Supplementary File Table S4. The difference between the D- and C-models is that the number of hidden neurons in the encoder section of the D-model was reduced from 256 to 128. Reducing the number of hidden neurons decreases the number of parameters for training. Based on these two models, the effects of the total number of parameters can be analyzed. The total number of parameters for this model was 427,713, which was 58.02% less than that for the C-model.

E-Model

This model is a U-Net, and the architecture is presented in Supplementary File Table S5. U-Net was originally invented and first used for biomedical image segmentation. Its architecture can be broadly considered as an encoder network followed by a decoder network (i.e., an autoencoder). Previous studies have shown that this model can be used to correct the reconstruction results of the absolute dose distributions of EPID dosimetry [16]. In this study, we adopted this model for non-transit EPID dosimetry. It had four depths, and each down-sampling block comprised two blocks containing a convolution layer, followed by batch normalization and a fixed rectified linear unit activation function. Additionally, it had the highest number of parameters (i.e., 7,759,521), which were used for training.

Validation

Validation was performed using the gamma index (γ) method, which is frequently used to verify complex modulated radiotherapy [26–29]. Several researchers have used it to measure the similarity between two images in 2D and 3D [30]. In this study, all validation images from EPID were compared with the planned dose, known as the origin, and subsequently, those EPID images were enhanced with the deep learning model and compared again to the planned dose. All comparisons used a gamma index of 3%/3 mm.

Results

The dataset used for validation is 47 and set each of shape 1,280 × 1,280 × 1 pixels. Every model was trained, tested, and validated with the same data. Table 1 reveals that deep learning can be used to build non-transit dosimetry on Halcyon. Moreover, the EPID images that improved by the A-, C-, D-, and E-models had higher average gamma pass rates than the origin. Surprisingly, the B-model had a lower average gamma pass rate compared to the origin, meaning this model is not suitable or needs improvement to build non-transit dosimetry. Furthermore, although the A-model had the least number of parameters, it had the highest average gamma pass rate. Therefore, in addition to the number of parameters, the accuracy of a model is determined by the constituent components of the model. If the model is not suitable for the problem, the output will be inaccurate, even if it has a high number of parameters.

Table 1. Gamma index 3%/3 mm results of the comparison between origin electronic portal imaging device (EPID) image without and with artificial intelligence (AI) model to planned dose image by treatment planning system (TPS) on 47 validation cases

Origin

A Model

B Model

C Model

D Model

E Model

Average γ-pass rate %

79.29 ± 6.27

90.07 ± 4.96

77.42 ± 7.18

79.60 ± 6.56

80.21 ± 5.88

80.47 ± 5.98

Max γ-pass rate %

87.74

94.70

88.22

89.66

89.68

89.69

Min γ-pass rate %

61.96

75.51

51.77

62.00

62.05

62.07

Table 2 shows a comparison of processing time and average gamma pass rate improvement from the origin for each model. From the table, the processing time did not have a linear relationship with the number of parameters; thus, more parameters do not always lead to longer processing times. The A-model had the shortest processing time because it had the least number of parameters, whereas the B-model had the longest processing time but not the highest number of parameters. The average processing time was less than 2 s, proving that deep learning can be used for clinical activities. The fastest model in descending order is A, D, C, E, and B. Moreover, the average gamma passing rate improvement from the origin revealed that the A-model had the greatest improvement compared to the other models, with the average reach 14.10 ± 8.79% from the origin.

Table 2. Average processing time and gamma pass rate improvement from origin on each model

Average processing time [s]

Average of gamma pass rate improvement from origin (%)

Model A

0.414 ± 0.113

14.10 ± 8.79

Model B

1.512 ± 0.199

–0.38 ± 9.99

Model C

0.788 ± 0.022

0.68 ± 8.22

Model D

0.651 ± 0.052

1.42 ± 6.94

Model E

1.187 ± 0.017

1.72 ± 6.88

The distribution of gamma pass rates obtained from the validation of various deep learning models and original data is presented in Figure 3. This categorization was performed with the objective of identifying successful image patterns in each model. It was observed that the A-model exhibited the most favorable results, with 30 out of 47 images belonging to the gamma pass rate group of 91–95%. Conversely, none of the other models achieved this group.

154831.png
Figure 3. The distribution of the number of cases from each artificial intelligence (AI) model with respect to the gamma pass rate interval

Figure 4 explains the comparison gamma pass rate maps for cases from the 91–95% and 76–80% groups for Model A. Referring to other research on developing deep learning models for dosimetry using EPID images, it has been highlighted that the penumbra area is of particular concern for accurate correction [16]. The penumbra area in an EPID image refers to the region that experiences the shadowing or attenuation effect of radiation due to the boundary of the irradiated object. This occurs because of the physical properties of radiation, where radiation can scatter and create shadows on the surface of the EPID detector. However, in this case, the pattern of the penumbra area has not been identified to have an impact on determining similarity to the target image. This is shown at Figure 4B, the brighter color is absent only at the boundary of the irradiated object.

154866.png
Figure 4. The difference of gamma index maps on model-A between (A) pass rate at 94.70% and (B) pass rate at 75.51%. The level of image similarity is indicated by a darker color

Discussion

The development of various deep learning models significantly depends on the problem to be solved. The absence of a golden standard for determining the layout of each hidden layer, number of parameters, and minimum amount of data required for accurate model results have become major topics in deep learning research [31–33].

In this study, it was proven that the number of parameters was not directly proportional to the output image. Table 2 reveals that the A-model, which had the least number of parameters, had the highest gamma index value compared to the other models. We assumed that the addition of more parameters to a model required more training data. The results of this study also strengthen other studies that try to compare performance for various hidden layers or parameters, the results show that when data is added to training and validation, the model with higher hidden layers will have better accuracy than the model with fewer hidden layers [34], but further research should be conducted to validate this assumption.

The best gamma index results from Models B, C, D, and E only reach below 90%. Unexpectedly, some of the gamma index results on model B show lower compared to the original image without artificial intelligence (AI). The larger parameter size of model B may necessitate a meticulous parameter tuning process and the utilization of a more extensive training dataset. Furthermore, these findings strongly suggest that model inaccuracies can have a detrimental impact on the results, potentially resulting in performance that is inferior to the original image without the incorporation of AI. Therefore, many deep learning studies incorporate a confusion matrix to analyze the likelihood of dangerous false positives and false negatives, as they could have significant implications in the healthcare domain. False positives refer to instances where the AI system incorrectly identifies a condition or event that is not present. On the other hand, false negatives occur when the system fails to detect a genuine condition or event that it is actually present.

In Figure 4, an extensive analysis was undertaken with the aim of identifying discernible patterns within the gamma maps generated by model A under varying levels of accuracy. Specifically, we examined the gamma maps corresponding to the model’s highest accuracy (Fig. 4A) and those associated with the lowest accuracy (Fig. 4B). Our analysis was focused on two key aspects: field size and the intensity of coloration within the penumbra region.

Despite meticulous examination and a comprehensive comparative approach, it is noteworthy that our findings did not reveal any prominent or readily identifiable patterns in either of these aspects. This outcome suggests that, within the scope of our study, there were no stark differences in field size or significant alterations in the brightness levels within the penumbra region that could be unequivocally attributed to variations in model accuracy. This lack of discernible patterns underscores the complexity of the relationship between model accuracy and the resulting gamma maps.

The study involves the construction of models A, B, C, D, and E, which are derived from three basic models, namely CNN, Autoencoder, and U-Net. By maintaining an equal amount of training data, CNN demonstrated superiority based on the gamma pass rate data. However, it is essential to recognize that deep learning encompasses more than just datasets; improper preprocessing techniques and misconfiguration of hyperparameters can lead to ineffective performance and reduced accuracy. Further research is necessary to ensure that each model is appropriately tuned to achieve optimal results.

In clinical implementation, the use of deep learning will greatly accelerate the pre-treatment verification of patient-specific quality assurance, the image obtained from the EPID is directly converted into a dose and projected on the patient’s target organ by the deep learning model so that the image can be directly compared with the dose calculated results from the TPS. This study succeeded in five deep learning models that can run below 2 seconds. Furthermore, it also adds to the evidence of the success of deep learning in building accurate dosimetry based on EPID images [16, 19, 34]. Moreover, the results should be validated by considering more cases and improved by replacing the loss value from MSE to the gamma index directly; however, faster computer processors and higher RAMs are required when training the model.

To create a deep learning (DL) model, comprehensive data in big number is needed, for example, to make DL capable of auto-contouring organ at risk at CT images, large number of CT image data is needed before and after contouring which will be used as targets, but this data is very hard to find as an open-source dataset. The lack of open datasets in radiotherapy means that the deep learning models that are created cannot be compared directly to other research. An open collection of EPID and TPS data in many cases and centers is really needed to help researchers only focus on developing their deep learning models and tuning their hyper parameter without worrying about data needs. In future research, there is potential for the integration of multiple distinct models with the goal of enhancing the precision of AI dosimetry.

Conclusion

In this study, a deep learning-based method for building non-transit dosimetry was proposed to improve the similarity between EPID images and planned dose images from TPS at the isocenter plane. Five different deep learning models were compared, and we found that the A-model had the best performance, with a gamma pass rate higher by 14.10 ± 8.79% compared to that of the EPID images not subjected to the deep learning models, followed by the E-, C-, D-, and B-models. In addition, the A-model had the fastest processing time of 0.414 ± 0.113 second, followed by the D-, C-, E-, and B-models.

Conflict of interest

There is no conflict of interest in this study.

Funding

This study was supported by Universitas Indonesia PUTI research grant (contract number: NKB-1665/UN2.RST/HKP.05/00/2020).

Acknowledgement

The authors thank the management and all staff members in the Department of Radiotherapy, Cipto Mangunkusumo Hospitals and Department of Physics, Universitas Indonesia.

References

  1. Podgorsak EB. Radiation Oncology Physics: A Handbook for Teachers and Students. , Vienna 2005.
  2. Laugeman E, Heermann A, Hilliard J, et al. Comprehensive validation of halcyon 2.0 plans and the implementation of patient specific QA with multiple detector platforms. J Appl Clin Med Phys. 2020; 21(7): 39–48, doi: 10.1002/acm2.12881, indexed in Pubmed: 32368862.
  3. Ray X, Bojechko C, Moore KL. Evaluating the sensitivity of Halcyon’s automatic transit image acquisition for treatment error detection: A phantom study using static IMRT. J Appl Clin Med Phys. 2019; 20(11): 131–143, doi: 10.1002/acm2.12749, indexed in Pubmed: 31587477.
  4. Jin P, Xie YH, Huang M, et al. Quantitative evaluation of transmission EPID daily imaging on a Halcyon Linac. J Phys: Conference Series. 2019; 1305(1): 012064, doi: 10.1088/1742-6596/1305/1/012064.
  5. Tan YI. 2D transit dosimetry using electronic portal imaging device. Thesis, University of Glasgow, 2016. doi: glathesis:2016-7043.
  6. van Elmpt W, McDermott L, Nijsten S, et al. A literature review of electronic portal imaging for radiotherapy dosimetry. Radiother Oncol. 2008; 88(3): 289–309, doi: 10.1016/j.radonc.2008.07.008, indexed in Pubmed: 18706727.
  7. Fuangrod T, Connor DJ, Mccurdy BM. Development of EPID-based Real time Dose Verification for Dynamic IMRT. Int J Med Health Biomed Bioengin Pharm Engineering. 2011; 5(8): 349–352.
  8. Thongsawad S, Chanton T, Saiyo N, et al. Development of EPID-based dosimetry for FFF-beam verification in radiation therapy. J Phys: Conference Series. 2019; 1285(1): 012031, doi: 10.1088/1742-6596/1285/1/012031.
  9. Ding A, Xing L, Han B. Development of an accurate EPID-based output measurement and dosimetric verification tool for electron beam therapy. Med Phys. 2015; 42(7): 4190–4198, doi: 10.1118/1.4922400, indexed in Pubmed: 26133618.
  10. McCurdy B. Dosimetry in radiotherapy using a-Si EPIDs: Systems, methods, and applications focusing on 3D patient dose estimation. J Phys: Conference Series. 2013; 444: 012002, doi: 10.1088/1742-6596/444/1/012002.
  11. Mans A, Wendling M, McDermott LN, et al. Catching errors with in vivo EPID dosimetry. Med Phys. 2010; 37(6): 2638–2644, doi: 10.1118/1.3397807, indexed in Pubmed: 20632575.
  12. Rowshanfarzad P, Sabet M, O’Connor DJ, et al. Detection and correction for EPID and gantry sag during arc delivery using cine EPID imaging. Med Phys. 2012; 39(2): 623–635, doi: 10.1118/1.3673958, indexed in Pubmed: 22320771.
  13. Warkentin B, Steciw S, Rathee S, et al. Dosimetric IMRT verification with a flat-panel EPID. Med Phys. 2003; 30(12): 3143–3155, doi: 10.1118/1.1625440, indexed in Pubmed: 14713081.
  14. Piermattei A, Fidanzio A, Stimato G, et al. In vivo dosimetry by an aSi-based EPID. Med Phys. 2006; 33(11): 4414–4422, doi: 10.1118/1.2360014, indexed in Pubmed: 17153420.
  15. Cireşan DV, Meier U, Masci LM, et al. Flexible, high performance convolutional neural networks for image classification. IJCAI International Joint Conference on Artificial Intelligence. 2011, doi: 10.5591/978-1-57735-516-8/IJCAI11-210.
  16. Olaciregui-Ruiz I, Torres-Xirau I, Teuwen J, et al. A Deep Learning-based correction to EPID dosimetry for attenuation and scatter in the Unity MR-Linac system. Phys Med. 2020; 71: 124–131, doi: 10.1016/j.ejmp.2020.02.020, indexed in Pubmed: 32135486.
  17. Siddique S, Chow JCL. Artificial intelligence in radiotherapy. Rep Pract Oncol Radiother. 2020; 25(4): 656–666, doi: 10.1016/j.rpor.2020.03.015, indexed in Pubmed: 32617080.
  18. Bridge P, Bridge R. Artificial Intelligence in Radiotherapy: A Philosophical Perspective. J Med Imaging Radiat Sci. 2019; 50(4 Suppl 2): S27–S31, doi: 10.1016/j.jmir.2019.09.003, indexed in Pubmed: 31591033.
  19. Ramadhan MM, Wibowo WE, Pawiro SA. Development of Artificial Intelligence as a Conversion Tool for Cine Electronic Portal Imaging Device Images to Radiotherapy Dosimetry: Preliminary Study. Iran J Med Phys. 2022, doi: 10.22038/IJMP.2022.60896.2024.
  20. Thompson RF, Valdes G, Fuller CD, et al. Artificial intelligence in radiation oncology: A specialty-wide disruptive transformation? Radiother Oncol. 2018; 129(3): 421–426, doi: 10.1016/j.radonc.2018.05.030, indexed in Pubmed: 29907338.
  21. Chan MF, Witztum A, Valdes G. Integration of AI and Machine Learning in Radiotherapy QA. Front Artif Intell. 2020; 3: 577620, doi: 10.3389/frai.2020.577620, indexed in Pubmed: 33733216.
  22. The Global Cancer Observatory: Indonesian Report 2018,” 2019. [Online]. Available. https://gco.iarc.fr/today/data/factsheets/populations/360-indonesia-fact-sheets.pdf.
  23. He K, Zhang X, Ren S, et al. Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016, doi: 10.1109/cvpr.2016.90.
  24. Szegedy C., Ioffe S., Vanhoucke V., Alemi A.A. Inception-v4, inception-ResNet and the impact of residual connections on learning,” in 31st AAAI Conference on Artificial Intelligence, AAAI 2017, 2017.
  25. Venkataraman P. Image Denoising Using Convolutional Autoencoder. arXiv. 2022; 2207: 11771v1.
  26. Hwang S, Ye S, Park J, et al. Study on the Sensitivity of Gamma-Index Evaluation Methods to Positioning Errors of High-Definition MLC of True Beam STx in VMAT Quality Assurance for SBRT. Int J Radiat Oncol Biol Phys. 2014; 90(1): S858–S859, doi: 10.1016/j.ijrobp.2014.05.2456.
  27. Hussein M, Rowshanfarzad P, Ebert MA, et al. A comparison of the gamma index analysis in various commercial IMRT/VMAT QA systems. Radiother Oncol. 2013; 109(3): 370–376, doi: 10.1016/j.radonc.2013.08.048, indexed in Pubmed: 24100148.
  28. Vieillevigne L, Molinier J, Brun T, et al. Gamma index comparison of three VMAT QA systems and evaluation of their sensitivity to delivery errors. Phys Med. 2015; 31(7): 720–725, doi: 10.1016/j.ejmp.2015.05.016, indexed in Pubmed: 26095758.
  29. Stasi M, Bresciani S, Miranti A, et al. Pretreatment patient-specific IMRT quality assurance: a correlation study between gamma index and patient clinical dose volume histogram. Med Phys. 2012; 39(12): 7626–7634, doi: 10.1118/1.4767763, indexed in Pubmed: 23231310.
  30. Low DA, Harms WB, Mutic S, et al. A technique for the quantitative evaluation of dose distributions. Med Phys. 1998; 25(5): 656–661, doi: 10.1118/1.598248, indexed in Pubmed: 9608475.
  31. Hüllermeier E, Waegeman W. Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods. Machine Learning. 2021; 110(3): 457–506, doi: 10.1007/s10994-021-05946-3.
  32. Kompa B, Snoek J, Beam AL. Second opinion needed: communicating uncertainty in medical machine learning. NPJ Digit Med. 2021; 4(1): 4, doi: 10.1038/s41746-020-00367-3, indexed in Pubmed: 33402680.
  33. Brownlee J. A Gentle Introduction to Uncertainty in Machine Learning. Machine Learning Mastery, pp. 1–10, 2020. https://machinelearningmastery.com/gentle-introduction-concept-drift-machine-learning/.
  34. Jia M, Wu Y, Yang Y, et al. Deep learning-enabled EPID-based 3D dosimetry for dose verification of step-and-shoot radiotherapy. Med Phys. 2021; 48(11): 6810–6819, doi: 10.1002/mp.15218, indexed in Pubmed: 34519365.