Abstract
In PET, partial-volume effects cause errors in estimation of size and activity for small objects with radiopharmaceutical uptake. Recent methods for image reconstruction, compared with traditional reconstruction techniques, include algorithms for resolution recovery that result in images with higher resolution and enable quantification of size and activity of smaller objects. The purpose of this study was to evaluate a combination of 2 algorithms for volume delineation and partial-volume correction on uptake volumes smaller than 0.7 mL using image reconstruction algorithms with and without resolution recovery. Methods: Volumes of interests (VOIs) were delineated using a threshold intensity calculated as a weighted sum of tumor and background intensities. These VOIs were used for calculating correction factors by convolving a tumor mask with the system point-spread function. The methods algorithms were evaluated using a phantom constructed from 5 small different-sized balloons filled with 18F-FDG in background activity. Six different backgrounds were used. Data were acquired using a PET/CT scanner, and the images were reconstructed using 2 iterative algorithms, one of which used a resolution recovery algorithm. Results: For the images reconstructed using the resolution recovery algorithm, the method for volume delineation resulted in VOI sizes that were correct within 1 SD for all balloons of a volume of 0.35 mL (equivalent diameter, 8.8 mm) and larger, in all backgrounds. For the images reconstructed without resolution recovery, the VOI sizes were background-dependent and generally less accurate. Correct volume delineations generally led to accurate activity estimates. Conclusion: The algorithms tested on the phantom developed for this study could, for this PET camera and these reconstruction algorithms, be used for accurate volume delineation and activity quantification of lesions 0.35 mL and larger.
PET with 18F-FDG is frequently used for diagnosis and staging of tumors and for therapy evaluation (1). Many methods for analyzing 18F-FDG uptake in tumors are quantitative or semiquantitative (2), and a frequently used measure is the standardized uptake value (SUV) (3). SUV is subject to several factors affecting its reliability, such as motion artifacts, dependence on time between injection and image acquisition, and the method used for volume delineation (4). In addition to these, the relatively poor spatial resolution introduces biases, especially for small objects (5), such as small tumors and lymph nodes. This is called the partial-volume effect (PVE) (6). There is a demand for more complex algorithms for correct delineation also for larger tumors (7), and because SUV plays an important prognostic role in many cancer studies, such as non–small cell lung cancer imaging, there is a need for correct delineation and SUV calculations (8). Many studies have been performed on this subject (9–11) for varying objects and resolution. There are also extensive reviews on the subject (5,12).
The spatial resolution of PET images is normally between 3.5 and 7 mm, measured as the full width at half maximum (FWHM) of the point-spread function (PSF). The relatively poor spatial resolution causes spillover between regions with different activity concentrations. PVE is often quantified with the recovery coefficient (RC)—that is, the ratio between the activity concentration measured in the image and the true activity concentration (13). The RCs can also be calculated theoretically and used as factors for correcting the measured activity in a volume of interest (VOI).
The SUV is often used as a measured quantity for diagnosis and staging of tumors and lymph nodes (14,15). Commonly, a threshold value is used as a measure of malignancy (16). SUV can, for example, be measured as the mean value within a VOI or a region of interest (ROI) or as the maximum voxel value within the VOI or ROI. Using the mean results in a measure that is more affected by PVE, whereas the maximum value is more sensitive to noise and typically represents an overestimation of the actual value (17). Both the mean and maximum values are dependent on the method used for image reconstruction, because the reconstruction method affects image noise and resolution. If proper partial-volume corrections can be performed, the threshold value for determining malignancy will be independent of the reconstruction method chosen.
Recent methods for image reconstruction, compared with traditional reconstruction techniques, include algorithms for resolution recovery that result in images with higher resolution (18) and enable quantification of the activity and size of smaller objects than before. Many studies of PVE have been performed using acrylic or glass phantoms with fillable spheres of different sizes (9,10,19). However, these spheres have walls of approximately 1-mm thickness, which creates activity-free zones and will result in an underestimation of spill-in from the background. This is problematic for the estimation of both volume and activity as has been shown in other studies. A study by Bazañez-Borgert et al. (20) focuses on measured RCs that affect the estimated activity and shows that RCs can be up to 21% higher in phantoms without plastic walls. A study by Hofheinz et al. (21) addresses the effect on the volume-reproducing threshold and shows, for example, that application of a threshold optimized using spheres with walls can lead to an overestimation of volumes by up to 43%. The impact on RCs and estimated volumes can be expected to be even higher for smaller objects imaged with higher resolution.
The aims of this study were to investigate a slightly modified version of a method for the delineation of tumor volume (7) and to correct the measured mean activity within the volume for PVE using correction factors (13). The aim was to study the effect on small volumes such as lymph nodes and small tumors, and it was therefore of particular interest to evaluate possible improvements when including a modern resolution recovery algorithm in the reconstruction. The corrected results were compared with the mean uptake without correction and with the maximum uptake values. For the purpose of the study, we developed a phantom with active volumes ranging from approximately 0.1 to 0.7 mL (6–11 mm in equivalent diameter). The active volumes were made of balloons having walls that are considerably thinner than those in commercially available phantoms, which is a clear advantage when evaluating PVE. We used standard scanning protocols for head and neck tumors, which is one type of study that can use SUVs in the diagnosis of small tumors and lymph nodes (22). However, the methods for tumor delineation and PVE correction are general and could be implemented in the same manner for other studies.
MATERIALS AND METHODS
Phantom
We constructed a phantom using 5 small balloons (Goldbal 3; Balt Extrusion) initially intended for neurointervention. The balloons, which can be seen in Figure 1, were filled to different volumes with an 18F-FDG solution. The volumes were determined using a scale, and the small volumes, which were of interest for PVE correction, ranged from 0.113 to 0.698 mL. The wall thickness of the smallest balloon was estimated to 0.03 mm using the measured weight of the expandable part (3 mg), the estimated equivalent diameter of 6.0 mm, and the density of Latex (Dow Chemical Co.) (0.92 g/cm3 (23)). Two larger balloons of 1.468 and 9.172 mL were used as references for activity concentration and for optimizing parameters in the volume delineation method. These balloons were made using 2 fingers of a rubber glove, which were filled with an 18F-FDG solution and tied with a knot. The 1.468-mL balloon showed signs of leakage and was hence not included in further analysis. The 5 balloons intended for neurointervention will be referred to as the balloons, and the larger, homemade balloon will be referred to as the reference balloon.
The smallest of the 5 balloons was basically spheric, whereas the larger one was ellipsoidal, with a height and width as defined in Figure 2. An equivalent diameter was calculated as the diameter of a sphere with the measured balloon volume, determined from the scale. Table 1 lists the volume (as measured from the scale), diameter, height, and equivalent diameter of the balloons.
The balloons and the reference balloons, attached to syringes, were mounted to hang from the lid of a plastic box filled with 10 L of a solution of water mixed with 18F-FDG (Fig. 2). The 18F-FDG and water solution, which served as the background, was mixed to a homogeneous activity concentration using a propeller connected to a drilling machine.
Clinical 18F-FDG PET/CT examinations showed that a typical background in the head and neck region of a patient, 1 h after injection of 4 MBq/kg, had an SUV of about 0.7, corresponding to an activity concentration of approximately 2 kBq/mL. By adding activity to the background, we could modify the simulated lesion-to-background activity ratio without refilling the balloons. During the course of the phantom experiment, we kept the same clinically realistic noise level in the background by varying the scan time, taking background activity and radioactive decay into account. The phantom was first imaged with no background activity. The background activity was then gradually increased in 6 steps, giving—in addition to the zero-background measurement—a series with balloon-to-background ratios of 14.9, 12.5, 9.1, 6.0, 4.1, and 2.0. The images from the 2.0 ratio measurement, however, were discarded because the balloons were not visible with such a low ratio. The ratios that were used correspond to simulated SUVs in a lesion, ranging from approximately 3 to 10 with a background of approximately 0.7.
Data Acquisition
The PET/CT Discovery 690 (GE Healthcare) scanner at the Nuclear Medicine Department, Umeå University Hospital, was used for image acquisition and reconstruction. The VUE-point HD (VP HD) and SharpIR iterative reconstructions were used as supplied with the scanner. A CT scan was obtained for the purpose of attenuation correction, followed by a dynamic PET scan of 10 equal-length time frames to get 10 measurements of the same data for use with statistical analysis.
The images were reconstructed with 2 different 3-dimensional ordered-subset expectation maximization scatter- and attenuation- correction reconstruction methods, as supplied with the PET/CT scanner. Both methods are routinely used for head and neck 18F-FDG PET/CT patients at Umeå University Hospital. The VUE-point HD (VP HD) reconstruction was applied using 2 iterations, 24 subsets, and a gaussian postprocessing filter of 6.4 mm in FWHM. The SharpIR reconstruction method (18), which includes a resolution recovery, was applied using 3 iterations, 24 subsets, and a gaussian postprocessing filter of 3 mm in FWHM. Both reconstructions generated a 50-cm field of view (FOV), 256 × 256 image matrix (1.95 × 1.95 mm pixel size), and slice thickness of 3.27 mm.
To allow for a long experiment time, we started with a relatively high activity concentration of 62 kBq/mL in the balloons. We could therefore reduce the scan time to 2 min per FOV instead of the clinically used 4-min patient scan time per FOV for a head and neck 18F-FDG PET/CT study.
PSF
The method for PVE correction requires knowledge about the PSF, which was measured according to the following procedure. The end of a thin capillary tube was filled with 18F-FDG and served as an approximate point source. Because the PSF is dependent on the position in the PET scanner, the capillary tube was placed at different distances from the origin in the transversal plane (at radii 0, 1, 2, 5, 10, and 15 cm).
The PSF images were reconstructed with the 2 algorithms described in the “Data Acquisition” section, and a 3-dimensional gaussian function was fitted to the result, using the least-squares method. The maximum value in the PSF image was selected, and the vectors in x, y, and z directions that included the maximum value were used for analysis. For all 3 vectors, the sum of the squared differences between the measured values in each voxel and the integral of the gaussian function over the length of 1 voxel was minimized for varying mean values and FWHMs. The software imlook4d (www.diccom-port.com) and Matlab (The MathWorks, Inc.) was used for the analyses.
All balloons were located within 10 cm of the origin of the FOV, which is also the region in which tumors or lymph nodes typically are found in an 18F-FDG head and neck study. Within this area, we could not find any position dependency of the FWHM; therefore, a mean of the 0- to 10-cm measurements was used for the PVE corrections.
Image Processing
The image processing can be divided into 2 parts: the determination of the volume of each balloon and the PVE correction of the mean activity within the volume.
Several methods for volume delineation have been proposed (7,24–27), and we chose a threshold-based method (7) that has been shown to yield accurate results (28). The threshold, Ithreshold, is calculated as a weighted sum of the balloon and background intensities (Imean and Ibackground), as in Equation 1. ϵ is a weighting factor that will be described below.
Imean has been proposed to be calculated as in Equations 2 and 3 (7,28), where Pi is the value of voxel i, Imax is the maximum voxel value in the balloon, and fi is the elements in a matrix of the same size as the PET image.
In Equation 3, Imean is, however, dependent on the background level. We propose a small change, where the threshold for Imean is proportional to the difference between maximum activity and background activity as in Equations 4 and 5:
This gives a threshold that is less dependent on background activity. For Ibackground, we used the mean intensity within a manually defined VOI of 4,725 voxels in the constant background volume. ϵ is a parameter depending on image resolution that needs to be optimized for the scanner and for different image reconstructions (28), which are performed by varying ϵ in steps of 0.025 and selecting the value that yields the VOI size closest to the true volume of the 9.172-mL reference balloon. For each background, the ϵ corresponding to the most accurate result was selected. The mean value of these ϵ values, rounded to closest 0.025 step, was then used for all backgrounds.
PVE correction was performed using correction factors, determined with the calculated VOI as a mask, which is convolved with the system PSF (5,13). The idea is that the VOI has the shape of the true uptake, which in PET images is blurred by the PSF. The technique is based on the assumption that the image has 2 homogeneous regions: tumor–balloon and background. Calculating the biases introduced by blurring the VOI will yield the correction factors α and β in Equations 7 and 8 (5). α is proportional to the activity left in the VOI after spill-out, and β is proportional to spill-in from the surrounding background. VOI in the equations is a 3-dimensional matrix of the same size as the PET images of the balloons, where all voxels within the active lesion are set to 1 and all other voxels have value of 0. PSF is a matrix of the PSF of the same size as the VOI, and i and j are indices for the voxels in the matrices. ONE is a matrix with every voxel value equal to 1, and ⊗ is the convolution symbol.
If Itotal is the mean activity concentration measured in the balloon in the PET image, Icorrected is the real mean activity concentration in the balloon, and Ibackground is the mean activity concentration in the background, the following equation gives the balloon activity (13):
Image Analysis
For all 10 frames, for each balloon in each background, we collected the values for uncorrected activity, corrected activity, and the maximum voxel value for the uncorrected volume. The maximum voxel value was included, because this is a common method for measuring SUVs of small objects. All values were divided by the true uptake to enable comparison. The reference balloon was used for determining true uptake, using the mean activity concentration of the 27 innermost voxels in the PET image volume. The 10 frames of the same data were used to calculate means and corresponding SDs. The values were plotted for each background as a function of the volume of each balloon. In addition, the volumes are presented for each background as a plot of the estimated balloon volume as a function of the true volume, determined from weighing the balloons.
RESULTS
Volume Estimation
The weighting factor ϵ in Equation 1 was determined for the reference balloon in all background levels, and the result can be seen in Table 2. The mean for ϵ was calculated as 0.35 for VP HD images and 0.275 for SharpIR images. These values were used to determine the volumes of the 5 small balloons. The highest volume error for the largest balloon in the VP HD images using an ϵ of 0.35 was for the 0 background measurement, where the error is 14% of the true volume. For images reconstructed with SharpIR, using an ϵ of 0.275 results in a maximum error of 7%, for the background ratio 4.1.
Figure 3 shows a comparison of the methods for defining Imean and I′mean in Equations 3 and 5, respectively. The figure shows that the bias in the estimated volume is smaller for the small balloons when using I′mean instead of Imean in Equations 3 and 5, respectively. The difference between the methods can be seen only for high backgrounds (ratios, 6.0 and 4.1, respectively).
The measured volumes from 3 backgrounds can be seen in Figure 4. According to the figure, volumes as small as 0.35 mL (8.8 mm in equivalent diameter) can be correctly estimated in images reconstructed with SharpIR for a large spread of lesion-to-background ratios. These volumes had corrected means within 1 SD from the true value. The SD increased for lower lesion-to-background ratios. For lesions with lower uptake, the uncertainties are higher and a small object can be mistaken for a larger one. For example, the SDs are always higher for the lesion-to-background ratio of 4.1 than for the ratio of 14.9.
In Figure 4B it can be seen that the volumes determined from the VP HD reconstructions are inferior to those reconstructed using SharpIR. The estimated volumes are far from the true volumes, with an error as high as 100% for the 0.35-mL balloon with a ratio of 14.9. The estimates also depend on the background level, with lower estimated volumes for higher backgrounds.
PVE Correction
The PSF was measured to have a FWHM of 3.2 mm in the radial direction and 4.7 mm in the axial direction for the SharpIR reconstruction. The corresponding values for VP HD are 7.3 mm in the radial direction and 6.1 mm in the axial direction. The differences between the values at different radii from the origin were never larger than 0.3 mm and showed no correlation to positions within a radius of 10 cm. The mean values were used for calculating the correction factors according to Equations 7 and 8.
The results of uncorrected and PVE-corrected activity concentrations within the estimated VOIs are shown in Figure 5, together with the maximum voxel values within the uncorrected VOI. Figure 5 shows that the activity estimates have means within 1 SD from the true values for all balloons larger than 0.35 mL in images reconstructed with SharpIR. For VP HD, the measured relative uptake is far from 1, both before and after partial-volume corrections, as shown in Figure 5. The activity estimates are background-dependent, with lower activity estimates in the balloons with lower backgrounds.
Figure 5 also shows that using the maximum uptake as an activity measure gives an overestimate in images reconstructed with SharpIR. Only a few of the mean maximum values are within 1 SD of the true value. The overestimates vary with balloon size and are as large as 46% for the second largest balloon in Figure 5A. Figure 5 also shows that the SD of the maximum uptake is comparable to, or higher than, the PVE-corrected uptake.
DISCUSSION
We have evaluated methods for delineation and quantification of active volumes smaller than 0.7 mL on images reconstructed with 2 different algorithms. Together, these methods compose a technique for volume delineation and quantification of small active volumes, such as lymph nodes and small tumors. No prior information on lesion size or shape is required.
The phantom developed for this study is useful for studies of PVEs. Most important, it eliminates the effect of plastic walls that most other phantoms have, thus making it more similar to the patient geometry. Compared with this phantom, patient images can be expected to give slightly degraded results due to motion, such as from breathing, although many studies are performed using motion correction (29–31). Another difference is that the phantom has a homogeneous activity concentration within both the tumor and the background, which usually is not the case in patient images. For the small lesions considered in this study, the heterogeneities within the lesion in patients should not be too comprehensive. PVE correction of larger heterogeneous tumors will be affected differently, depending on the characteristics of the heterogeneities. If necrosis is present, the necrotic part will not be included in the calculated volume, which is advantageous for this PVE-correction approach. In general, however, heterogeneity will make the estimated volumes less reliable. If the maximum value is collected from a so-called hot spot within the tumor, there is a risk that the calculated threshold is too high to include parts of the tumor with relatively low uptake. Other approaches for calculating thresholds should then be considered. For example, the hot spot can be discarded from the VOI, which is used for determining the maximum value. The case of a heterogeneous background must also be considered, in which case a mean of an ROI or VOI surrounding the tumor could be used.
Figure 4A shows that the limit for reliable volume delineation in images reconstructed with SharpIR is about 0.35 mL. For a spheric volume, this would correspond to a diameter of 8.8 mm, which is approximately 2.7 times the image resolution. Smaller volumes do not have maximum values representative of the uptake (5), and the threshold values will therefore be biased. For the same reasons, balloons of these sizes cannot be correctly delineated in images reconstructed with VP HD. The largest balloon, 0.698 mL if it were spheric, would have an equivalent diameter of 11 mm. This diameter is only 1.5 times the image resolution in images reconstructed with VP HD, which is too small for correct volume determination.
Correct volume delineation is important for the PVE correction, which implies that PVE corrections have fairly accurate results for volumes larger than 0.35 mL in images reconstructed with SharpIR. The results in Figures 5A, 5C, and 5E show lower SDs for higher lesion-to-background ratios but approximately the same means for all ratios. Lower lesion-to-background ratios imply more variation in the results. Because the volumes are incorrectly delineated in images reconstructed with VP HD, the activity estimation is relatively poor. If more accurate volume estimates were available from other imaging modules such as CT or MR, the results would most likely be substantially improved, both for VP HD and for volumes smaller than 0.35 mL using SharpIR.
The method used for PVE correction is fairly sensitive to deviations in the PSF, making accurate measurements crucial. In this study, the objects were positioned in the central part of the FOV. The PSF was found to be fairly constant in this area, and a constant PSF was therefore assumed. If the studied objects are outside the central FOV, it would be recommended to use a spatially varying PSF. The SharpIR algorithm includes correction for spatial variations in the PSF, which should cause a less position-dependent resolution, making the spatially varying PSF a smaller problem, albeit still a problem. Because the image reconstruction algorithms use postfiltering, the largest contribution to the PSF comes from the filter.
The PVE-corrected values in this study are generally more accurate than the maximum values for the VOI and have less statistical variation, thus making this the superior method. Statistical variation is increased, compared with uncorrected values, due to uncertainties in volume delineation, but the mean is substantially more accurate. The method is easily implemented because both lesion delineation and PVE correction can be automated from a maximum value and a background VOI. This automation also makes the method relatively user-independent. The only subjective parameter is the background VOI, which is manually drawn in this study.
A high number of studies have earlier been made on the subject of PVEs (9,10,19). However, many of these are performed on phantoms with spheres with relatively thick walls, which reduce effects from background activity. This degradation is small when the FWHM of the PSF is as large as 6–7 mm but increases substantially with an FWHM of 3–4 mm in the case of one of the reconstruction algorithms evaluated in this study. Most studies are also made with spheres larger than 10 mm in diameter, which is fairly large, compared with the resolution using modern algorithms for image reconstruction.
CONCLUSION
The tested method for volume delineation gives accurate estimates of volumes of active lesions of 0.35 mL and larger on images reconstructed with a modern resolution recovery method. These lesions could also be correctly quantified in terms of activity. Images reconstructed without resolution recovery were not appropriate for volume delineation and thereby not for activity quantification of the small lesions in this study. The results further support implementation of resolution recovery methods in PET imaging.
DISCLOSURE
This study was jointly supported by the Swedish Cancer Society, Lion's Cancer Research Foundation, Umeå University, the Faculty of Medicine at Umeå University, and the University Hospital of Umeå. No other potential conflict of interest relevant to this article was reported.
Footnotes
Published online May 8, 2013.
REFERENCES
- Received for publication February 22, 2013.
- Accepted for publication February 24, 2013.