Abstract
Extravasation or partial extravasation of the radiopharmaceutical dose in PET can undermine SUV and image quality. A topical sensor has been validated using several metrics to characterize injection quality after manual injection. The performance of these metrics for autoinjector administration has been assessed. Methods: A single PET/CT scanner at a single site was used to characterize injections using an autoinjector with standardized apparatus, flush volume, and infusion rate (1-min infusion followed by 2 syringe flushes) for 18F-FDG, 68Ga-prostate-specific membrane antigen, and 68Ga-DOTATATE. In total, 296 patients with topical application of sensors were retrospectively analyzed using conventional statistical analysis and an artificial neural network. Results: Partial extravasation was noted in 1.3% of studies, with 9.1% (inclusive of partial extravasation) identified to have an injection anomaly (e.g., venous retention). Extravasation was independently predicted by the time that elapsed as the counts recorded by the injection sensor fell from the maximum value to within 200% of the reference sensor counts greater than 1,200 s; as the difference in counts for injection and reference sensors, normalized by dose, from 4 min after injection greater than 25; and as the ratio of the average counts per second recorded by the injection sensor at the end of a monitoring period to those of the reference sensor greater than 2. Conclusion: Extravasation and partial extravasation of PET doses are readily detected and differentiated using time–activity curve metrics. The metrics can provide the insight that could inform image quality or SUV accuracy issues. Further validation of key metrics is recommended in a larger and more diverse cohort.
PET and the SUV, which have been well documented to change management in 38% of oncology patients (1), rely on pharmacokinetic assumptions that assume an extravasation-free injection. Extravasation occurs when part or all of the administered dose is injected outside the venous system or leaks into surrounding tissue (2–4) because of errors in placement of the needle or dislodgement. Dislodgement might occur in response to injection volume, injection pressure, movement, or several patient factors. Partial extravasation of the intravenous dose administration undermines the predictability of dose delivery, image quality, and the accuracy of the SUV (2).
An examination of partial-extravasation rates in 400 patients (5) found a rate of 10.5%, with 31% of those being undetected using standard imaging protocols (arms outside the field of view). A second study provided an analysis of 1,367 patient studies and found an 18% extravasation rate involving between 1% and 22% of the injected dose (6). A smaller single-center analysis found 38% of patients to have extravasated doses (7). Identifying and characterizing PET dose partial extravasation present several challenges in the clinical setting. First, the injection site is generally outside the field of view (arms up) and undetected. Second, dose administration techniques (infusion, autoinjectors) may prevent visual identification of signs of extravasation (pain and swelling) at the time of injection. Third, when partial extravasation is noted, quantifying the extent of extravasation and correcting the SUV are not readily achievable. Consequently, the studies outlined above are likely to underestimate extravasation rates and impact. A broader literature review suggests that the detectable extravasation rate (likely to be an underestimate) ranges from 9% to 23% for manual administrations (4).
A simple method for detecting and potentially characterizing PET dose extravasation is the use of topically applied scintillation sensors to monitor activity migration from the injection site during the uptake phase. The only commercially available device designed for this purpose is the Lara (Lucerno Dynamics, LLC) device. The performance of the Lara device, to detect partial-dose extravasation and provide dynamic data that assist in characterizing the extravasation, has been widely reported for manual administration of 18F-FDG (2,4,7). 18F-FDG injections in the hand, wrist, or forearm have higher extravasation rates than injections in the antecubital fossa; higher doses increase extravasation rates, lower body weights are associated with increased extravasation, and lower flush volumes are associated with increased extravasation (4).
Generally, the administration of PET doses has not been monitored in the clinical scenario, perhaps because—compared with CT and MRI contrast administration—the relative volume of the PET dose is very small, the local consequences of extravasation are benign, and direct (manual) injection is relied on. It is important in principle, and in using detection systems, to differentiate dose extravasation, extravasation with complete resolution during the uptake period, and venous retention with rapid and complete resolution. The Lara device is coupled to Lucerno Dynamics proprietary software, whose algorithm generates several metrics that guide decision making regarding dose-injection quality and differentiation between extravasation and venous retention or other anomalies.
The aim of this investigation was to evaluate the Lara quality assurance tool and associated metrics—specifically, the role of metrics for characterizing extravasation using the autoinjector method. Autoinjectors reduce contamination risk and occupational exposure of the staff to radiation; however, the injection is not supervised to detect extravasation, and the bolus profile may vary with the infusion process.
MATERIALS AND METHODS
The project was approved by the institutional ethics committee for retrospective analysis of the data from a quality improvement initiative forming standard care in a single PET/CT scanner department at a single center. Patients referred for 18F-FDG, 68Ga-DOTATATE, and 68Ga-prostate-specific membrane antigen (PSMA) PET/CT studies underwent Lara monitoring as part of their standard care. Patients were cannulated in any vein, generally the antecubital fossa. A 20-gauge needle was used unless circumstances demanded a different gauge. In each of 3 injection uptake rooms, Lara sensors were set up and applied to the subject using adhesive pads 7 cm proximal to the injection site and the same site on the contralateral arm (reference arm) using a previously described method (7). A KARL100 autoinjector (Tema Sinergie) was connected through the wall to the cannula, with the injection delivered to the patient via the Rad-Inject pump (Tema Sinergie) over 1 min. The Rad-Inject pump then performed a double syringe flush at the same rate as the primary infusion (that is, the single-dose syringe was flushed twice). The total planned volume of the flush was standardized to 80 mL. After dose administration, sensors collected data at 1-s intervals throughout the uptake phase (generally 50 min before the patient was escorted to the scanning room). Once removed from the patient, the recording device was connected to a computer and data uploaded to the Lucerno Dynamics platform, where it was interpreted by the software.
Analysis
The Lara data were extracted and presented as a time–activity curve, with the typical display showing the injection side in black and the reference side in red (Fig. 1). Although the Lara metrics have been previously validated for manual injections of 18F-FDG using doses on the order of 370–740 MBq (10–20 mCi), several limitations associated with validity for this patient cohort demanded manual interpretation. These included the lower doses used for patient administration (185–370 MBq [5–10 mCi]), the shape and width of the curve generated by the KARL100 autoinjector administration, and the use of 68Ga radiopharmaceuticals.
The manual interpretation of time–activity curves was based on understanding ideal injection time–activity curves: reference counts remaining low while the injection counts rapidly peak before rapidly declining to meet the reference levels (Fig. 1). Examination of the slope of the bolus injection on the time–activity curve as it approaches the reference sensor time–activity curve needs consideration. The time–activity curve count for the injection sensor relative to the reference sensor was also evaluated at various points during the uptake period. The time for the injection sensor to reduce to double the reference sensor count was calculated, and the area-under-the-curve (AUC) ratios between injection and reference sensor on the time–activity curves was determined at multiple time points.
Specifically, the following metrics were calculated and analyzed (Fig. 2): aUCR10, which is the AUC ratio between the injection and reference curves at 1–10 min after injection; aUCR1, which is the AUC ratio between the injection and reference curves at 60–90 s after injection; c1, which is the average count per second recorded by the injection sensor at 60–90 s after injection; CEndINJ, which is the average count per second recorded by the injection sensor at the end of a monitoring period; CEndREF, which is the average count per second recorded by the reference sensor at the end of a monitoring period; tHalf, which is the average time (in seconds) required for the count recorded by the injection sensor to fall to half the previous value; tc50, which is the time (in seconds) that elapses as the count recorded by the injection sensor falls from the maximum value to within 200% of the reference sensor count (i.e., reference is 50% of value of injection curve); ndAvg1, which is the difference in counts for injection and reference sensors, normalized by dose, at 60–90 s after injection; ndAvgN, which is the difference in counts for injection and reference sensors, normalized by dose, from 4 min after injection; and time–activity curve score, which is a linear, weighted combination of these metrics, with the weights being determined from a logistic regression. Good injections are generally associated with a negative score, whereas scores over 200 typically indicate that part of the dose remains at the injection site.
These metrics are calculated automatically by the Lucerno Dynamics algorithm without operator input.
The statistical significance was calculated using χ2 analysis for nominal data and Student t testing for continuous data. F test ANOVA was used to determine statistically significant differences within grouped data. A P value of less than 0.05 was considered significant.
RESULTS
There were 296 valid cases, with 1.3% (4) demonstrating evidence of extravasation on the time–activity curves (Fig. 3) whereas 9.1% (27), inclusive of extravasation cases, demonstrated some abnormality associated with dose administration (largely slow venous clearance in the time–activity curve). Other key demographic data and differences among the different radiopharmaceuticals are summarized in Table 1.
No statistically significant differences were noted between normal, abnormal (venous retention), and extravasated subgroups with respect to dose (P = 0.060), despite normal having a mean lower dose; with respect to radiopharmaceutical (P = 0.315), despite a higher proportion of abnormal and extravasated studies being 18F-FDG; with respect to the person performing the injection (P = 0.414), patient weight (P = 0.529), age (P = 0.365), and sex (0.245), despite men showing a higher proportion of vascular retention and women a higher proportion of extravasation; with respect to the location of the injection (P = 0.561), despite there being a 2.1 times higher chance of extravasation for hand, wrist, or forearm injections over antecubital fossa; and with respect to orientation (P = 0.09), despite there being a higher proportion of abnormal and extravasated doses on the right side. Several statistically significant differences were noted among the calculated metrics for the different injection outcomes (Table 2). One study in which the dose leaked from the injection apparatus was classified for these purposes as extravasation, as the resulting time–activity curve paralleled an extravasation curve. Several time–activity curves, particularly for 68Ga-PSMA, were truncated early, with production of an end-of-curve count anomaly that could create an aberrant CEnd ratio (Fig. 4) requiring correction of the CEndINJ and CEndREF.
Although the mean values showed discriminatory power between normal, abnormal, and extravasated doses, at an individual dose level (indicated by the range and interquartile range), few metrics provided a defined cutoff between classifications. Specifically, there was overlap of not just the range for the injection score across the injection outcomes (Table 2) but also of the interquartile range. Conversely, tc50, ndAvgN, and the newly created CEnd ratio demonstrated distinct cutoffs between extravasated injections and other injections (Table 2). The tc50 and CEnd ratio might also be used as a cutoff to indicate whether injection characteristics should be considered to influence SUV calculation. Furthermore, at the interquartile range level, ndAvgN also provided a cutoff to differentiate a normal injection from an abnormal (nonextravasated) injection with venous retention.
The previous statistical evaluation reported limitations associated with Lara metrics for autoinjector time–activity curves, including the time–activity curve score. Extravasation was independently predicted by a tc50 greater than 1,200, ndAvgN greater than 25, and CEnd ratio greater than 2. In this evaluation, the original data were reevaluated using an artificial neural network (Neural Analyser, version 2.9.5). The purpose was to demonstrate the usefulness of machine learning as an analysis approach parallel to conventional statistical analysis (8–10) and to confirm or refute observations using conventional statistical analysis.
There were 36 input variables in 296 patients, including extracted metrics, demographic data, injection parameters, and outcomes. A single binary output was an extravasated injection evidenced by the manual examination and characterization of the time–activity curve as previously described, or no extravasation (normal and other anomalies such as venous retention). A correlation matrix (heat map) was calculated to identify redundancy among variables and exclusion when appropriate. Logistic correlations revealed dependency of the outcome (extravasation) on several variables, with those of significance including tc50 (1.0), CEnd ratio (0.974), and ndAvgN (0.786), supporting the previous statistical analysis. The initial network architecture after omission of redundancy included 12 scaling layer inputs, 1 hidden layer of 3 nodes using a logistic activation function (defines the output of each node based on its input), and a single probabilistic layer (binary). The weighted squared error method was used to determine the loss index because there was an imbalance between positive and negative outputs (grounded truth). The neural parameters’ normal weight was used as the regularization method to control neural network complexity. A quasi-Newtonian training method was applied, using gradient information to estimate the inverse Hessian for each iteration of the algorithm (no second derivatives). The loss function associated with the training phase estimated the error associated with the data observed by the neural network and decreased from 1.405 to 0.010 after 21 iterations. The selection loss is a measure of the neural network’s agility (generalizability to new data), and in this case the error decreased from 1.452 to 0.0002 after 21 iterations. This finding indicates the need to optimize the final architecture.
A growing-input method was used to calculate the correlation for every input against each output in the dataset. Beginning with the most highly correlated inputs, progressively decreasing correlated inputs were added to the network until the selection loss increased, allowing identification of the optimal number of inputs. Similarly, an incremental-order method was used for input order selection, starting with the minimum order and adding perceptrons (and complexity) until the loss increased. The final architecture of the neural network reflects the optimized subset of inputs and order with the lowest selection loss. In this case, the selection loss rose significantly for the second input, with the training loss near optimal for 1 input. Thus, the optimal number of inputs was determined to be 1, with a training loss less than 0.001 and selection loss of 0.011. The optimal order was also 1 (single perceptron), with a training loss of 0.013 and selection loss of less than 0.001. The final architecture had 1 scaling-input layer (tc50), 1 hidden layer of a single node (perceptron), and a binary probabilistic output.
The final architecture was evaluated using several tests, which indicated robust validation using a subset of the original patient data. Receiver-operator-characteristic analysis demonstrated an AUC of 1.0. This correlates with a sensitivity of 100% and specificity of 100% and is reflected in the confusion matrix (100% true-positives, 100% true-negatives, 0% false-negative, and 0% false-positive). The cumulative gain analysis demonstrates excellent positive performance against random classification, with a maximum gain score of 0.983.
DISCUSSION
Partial extravasation of the patient dose, even with complete resolution during the uptake period, significantly changes the radiopharmaceutical kinetics and impacts the accuracy of the SUV. This effect arises because of the decreased activity imaged and the trickle effect into the vascular space associated with extravasated injections (3). The precise impact of partial extravasation on SUV will depend on the pharmacokinetics of the infiltration, the proportion of the dose extravasated, and the proportion and rate of any dose that reenters circulation (4). Although the precise impact on image quality and quantitation is difficult to determine (4), a tc50 calculation above 1,200 s will almost certainly have a deleterious impact on SUV calculation, and—subject to the proportion of the dose extravasated being greater than 5%—a tc50 between 600 and 1,200 s may also result in SUV calculation errors but should prompt interpretative caution. A tc50 below 600 s indicates normality or venous retention, with rapid clearance and negligible impact on SUV. Further investigation and validation of this metric is warranted for this purpose.
For 18F-FDG studies, a time–activity curve score above 200 was achieved in only 33.3% of cases, as likely reflects the lack of validation against parameters used at this site: autoinjector slow bolus with double-flush triple peaks and significantly lower patient doses (∼185 MBq [5 mCi] compared with the 370 MBq [10 mCi] used for validation of the score metric). For classification of an extravasated dose, a score cutoff of 200 produced a 33% sensitivity and 96.8% specificity for the 18F-FDG subgroup. No alternative cutoff improved this performance. Clearly, the utility may be improved with metrics that accommodate the broader bolus, slower clearance times, and variations in patient dose. Indeed, the sensitivity and specificity in this study are well short of those previously reported (82% and 100%, respectively) in studies using 18F-FDG and manual injection only (4).
The automatically calculated metrics outlined above are particularly vulnerable to a broader bolus from slower administration, as is reflected in the less-than-optimal predictive performance of metrics dependent on curve behavior in the first 5 min, including the time–activity curve score. Metrics independent of the behavior of the time–activity curve over the first 5 min (e.g., tc50) were demonstrated to be more robust predictors of extravasation—and, indeed, of the ability to differentiate between extravasation and venous retention—and might be readily considered a marker for determining a deleterious impact of injection kinetics on SUV calculation (e.g., the tc50).
aUCR10
The aUCR10 assumes a tight bolus peaking at 1 min after injection with rapid clearance, which would produce a nearly 1:1 ratio for a normal injection. Abnormal injections depend on retention of injection sensor activity beyond 1 min to produce a ratio beyond 2:1. Unfortunately, this metric is skewed by any delay in clearance (e.g., venous retention), by a broader bolus, and by primary or secondary peaks associated with the autoinjector. Although aUCR10 may be less effective for differentiating normal from abnormal vascular retention, it should be sufficiently sensitive to predict extravasation. Because this metric is dose-dependent, variability will occur with different dose activities administered.
aUCR1
aUCR1 is limited to the period from 60 to 90 s after injection and, as such, is more susceptible to variations in the time–activity curve associated with the autoinjector and, thus, lower discriminatory power, including for extravasation. Because this metric is dose-dependent, variability will occur with different dose activities administered.
c1
c1 is the average counts per second recorded by the injection sensor during the interval between 60 and 90 s after injection. c1 will fluctuate with the dose administered and the radiopharmaceutical. The value of c1 is reliant on a tight bolus, and thus, the broader bolus of the autoinjector infusion combined with the multiple slow flushes (secondary and tertiary peaks) will undermine the rigor of this metric in predicting variation from normal, as well as the efficacy of differentiating extravasation from venous retention. Because this metric is dose-dependent, variability will occur with different dose activities administered.
CEndINJ and CEndREF
CEndINJ and CEndREF are likely susceptible to variations in monitoring time. Indeed, there was a wide variety of monitoring periods ranging from 20 through 60 min. Nonetheless, identifying an endpoint beyond 20 min should exhibit, in the absence of extravasation, convergence of CEndINJ and CEndREF. Individually, the metrics are not particularly useful as discriminators despite the fact that CEndINJ is higher for extravasation. Because these metrics are dose-dependent, variability will occur with different dose activities administered.
tHalf
tHalf is a mean half-clearance rate, which should be higher for extravasated injections. Clearance is also delayed for venous retention, and the clearance rate is confounded by the secondary and tertiary peaks associated with the autoinjector. It would be possible to use mathematic curve stripping to extract a single curve representative of clearance, but given that the sensors are not positioned over the injection site itself, it may be more useful to express tHalf as the half-clearance time for a specific period after the normal bolus period. For example, a first tHalf from 5 to 10 min and a second from 10 to 20 min would provide more useful metrics to differentiate among normal, venous retention, and extravasated injections.
tc50
tc50 appears to be the most useful metric for differentiating extravasation from other injection outcomes and perhaps makes the previously suggested 5- to 10-min and 10- to 20-min half-clearance calculations redundant. A normal time–activity curve, including in patients with venous retention, should see the injection counts approach the reference counts (to the point of injection counts reducing to within double the reference counts) by 10 min. A tc50 greater than 600 (seconds) suggests abnormality, but a tc50 greater than 1,200 (20 min) may offer a useful marker for certainty of extravasation.
ndAvg1
ndAvg1 is confounded by the broader peak and secondary and tertiary peaks associated with the autoinjector and in patients exhibiting venous retention. This issue decreases the discriminatory value of this metric.
ndAvgN
ndAvgN should have good discriminatory power between normal injections (including with autoinjectors) and abnormal injections. Among abnormal injections, there may be difficulty differentiating venous retention from extravasation at 4 min after injection.
Time–Activity Curve Score
The time–activity curve score is a linear, weighted combination of these metrics. Although the weights are determined from a logistic regression, in the case of the unique characteristics of time–activity curves for the KARL100 autoinjector and Rad-Inject pump, time–activity curve score may be simply multiplying redundancy—that is, multiplying metrics that offer the same insight and using multipliers that lack correlation.
CEnd ratio is a new metric introduced in this project that expresses the relationship of CEndINJ and CEndREF as a ratio. A ratio approximating 1:1 indicates normality at that time (normal injection or rapid resolution of venous retention) and is thought to offer a useful tool for differentiation of extravasation from other anomalies and normal injections.
Several other automated measures that contribute to determination of the time–activity curve score may be better as independent identifiers of extravasation, including tc50 and ndAvgN. Using a tc50 cutoff of 600 s, 100% sensitivity and 100% specificity for identifying extravasation was noted using both conventional statistical analysis and artificial neural network analysis. That is, if at 20 min (1,200 s), the reference curve is less than 50% of the value of the injection curve, careful consideration should be given to an extravasated injection. As previously discussed, a cutoff of 600 s provides 100% sensitivity but only 95% specificity, with venous retention among the false-positive cases. A cutoff of 25 for ndAvgN demonstrated 100% sensitivity and 99.7% specificity for classifying extravasation, although 100% sensitivity and specificity were noted for 18F-FDG studies only. The cutoff of 0 was used to distinguish abnormal time–activity curves (venous retention but excluding extravasation) from normal time–activity curves with a 95.5% sensitivity and 41.4% specificity. This project proposed the new metric CEnd ratio, which, using a cutoff of 2, provides 100% sensitivity and specificity in classifying extravasation.
The use of topical sensors and automated scoring would add incremental clinical value, with further validation of all metrics against low-dose injections via autoinjectors for a range of radiopharmaceuticals. The predictive power of existing Lucerno Dynamics metrics and those introduced in this project with respective cutoffs also needs to be further validated but could provide deeper insight into injection behavior. The time–activity curves and metrics are sensitive to identification of extravasated injections, thus helping to characterize dose behavior and obviate imaging of the injection site. Specifically, the following are recommended markers but warrant further validation: tc50 greater than 1,200 indicates extravasation and a negative impact on SUV; tc50 less than 600 indicates a normal injection or venous retention that resolves, and no impact on SUV should be expected; tc50 between 600 and 1,200 is not extravasated but needs careful assessment of poor bolus kinetics and the impact on SUV; ndAvgN less than 0 indicates a normal injection; ndAvgN greater than 25 indicates extravasation; ndAvgN between 0 and 25 suggests vascular retention but may be a normal injection; and CEnd ratio greater than 2 indicates extravasation.
Artificial neural network analysis suggested scores generated by multiple metrics compound error and include redundancy, are unnecessarily complex, and increase potential error. Although CEnd ratio and ndAvgN were identified with tc50 as the key variables, with very high correlation, use of these metrics in combination provides no greater predictive capability than tc50 alone—a case of Occam’s razor. While the neural analysis clarified findings of conventional analysis, the reliability of the neural network is limited by the small positive cohort in the binary analysis.
Identification of the presence of extravasation does not indicate an actual impact on image quality or SUV accuracy. Further research is required to differentiate clinically important extravasation from evidence of clinically insignificant extravasation. The U.S. Nuclear Regulatory Commission has determined, with respect to radiopharmaceutical administration, that extravasation frequently occurs and is virtually impossible to avoid, and the commission does not consider extravasation to be misadministration. As a result, extravasation avoids being subject to the reporting and accountability requirements of misadministration. This study contradicts this position, with use of an autoinjector technique not only reducing staff doses but demonstrating that very low levels of extravasation are possible. Potentially eliminating extravasation for PET studies is possible, and every effort to achieve this reduction is essential to enhance image quality, improve SUV accuracy and reliability, and minimize unintentional exposure of patient tissue to radiation. The recognition of extravasated doses as misadministration, with the associated investigation and reporting requirements of regulators in Australia, may also be a factor in reducing rates.
CONCLUSION
Topical monitoring and characterization of PET dose administration are possible and practical with the Lara device, with careful consideration and validation of individual metrics. Extravasation and partial extravasation of PET doses are readily detected and differentiated using time–activity curve metrics. The metrics can provide the insight that could inform image quality or SUV accuracy issues. Importantly, extravasation or partial extravasation of PET doses can be minimized with an autoinjector. Further validation of key metrics is recommended in a larger and more diverse (radiopharmaceuticals, injection methods) cohort with discrimination between clinically significant and clinically insignificant extravasation cases.
DISCLOSURE
This project was funded by an ANSTO/ANZSNM research grant. No other potential conflict of interest relevant to this article was reported.
Acknowledgments
This project was undertaken as part of the honors work of Stephanie Sanchez and supervised by Geoffrey Currie. We thank the Lucerno Dynamics team in the United States and the Australian distributors (Bayer) of the Lara device for their patience, encouragement, support, and guidance. We also thank the dedicated staff in the clinical department for supporting and assisting with the project.
Footnotes
Published online Jun. 9, 2020.
REFERENCES
- Received for publication March 11, 2020.
- Accepted for publication April 16, 2020.