* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Segmentation and Alignment of 3-D Transaxial Myocardial Perfusion Images and
Aphelion (software) wikipedia , lookup
3D television wikipedia , lookup
Medical imaging wikipedia , lookup
Indexed color wikipedia , lookup
Edge detection wikipedia , lookup
Computer vision wikipedia , lookup
Anaglyph 3D wikipedia , lookup
Spatial anti-aliasing wikipedia , lookup
Image editing wikipedia , lookup
Stereoscopy wikipedia , lookup
Stereo display wikipedia , lookup
Segmentation and Alignment of 3-D Transaxial Myocardial Perfusion Images and Automatic Dopamine Transporter Quantification Leo Bergnéhr April, 2008 Reg nr: LiTH-IMT/MI30-A-EX--08/465--SE Segmentation and Alignment of 3-D Transaxial Myocardial Perfusion Images and Automatic Dopamine Transporter Quantification Master’s Thesis in Biomedical Engineering at Linköping University by Leo Bergnéhr April, 2008 Reg nr: LiTH-IMT/MI30-A-EX--08/465--SE Supervisor: Joakim Rydell Examiner: Magnus Borga Abstract Nuclear medical imaging such as SPECT (Single Photon Emission Tomography) is an imaging modality which is readily used in many applications for measuring physiological properties of the human body. One very common type of examination using SPECT is when measuring myocardial perfusion (blood flow in the heart tissue), which is often used to examine e.g. a possible myocardial infarction (heart attack). In order for doctors to give a qualitative diagnose based on these images, the images must first be segmented and rotated by a medical technologist. This is performed due to the fact that the heart of different patients, or for patients at different times of examination, is not situated and rotated equally, which is an essential assumption for the doctor when examining the images. Consequently, as different technologists with different amount of experience and expertise will rotate images differently, variability between operators arises and can often become a problem in the process of diagnosing. Another type of nuclear medical examination is when quantifying dopamine transporters in the basal ganglia in the brain. This is commonly done for patients showing symptoms of Parkinson’s disease or similar diseases. In order to specify the severity of the disease, a scheme for calculating different fractions between parts of the dopamine transporter area is often used. This is tedious work for the person performing the quantification, and despite the acquired three dimensional images, quantification is too often performed on one or more slices of the image volume. In resemblance with myocardial perfusion examinations, variability between different operators can also here present a possible source of errors. In this thesis, a novel method for automatically segmenting the left ventricle of the heart in SPECT-images is presented. The segmentation is based on an intensity-invariant local-phase based approach, thus removing the difficulty of the commonly varying intensity in myocardial perfusion images. Additionally, the method is used to estimate the angle of the left ventricle of the heart. Furthermore, the method is slightly adjusted, and a new approach on automatically quantifying dopamine transporters in the basal ganglia using the DaTSCANTM radiotracer is proposed. Sammanfattning Nukleärmedicinska bilder som exempelvis SPECT (Single Photon Emission Tomography) är en bildgenererande teknik som ofta används i många applikationer vid mätning av fysiologiska egenskaper i den mänskliga kroppen. En vanlig sorts undersökning som använder sig av SPECT är myokardiell perfusion (blodflöde i hjärtvävnaden), som ofta används för att undersöka t.ex. en möjlig hjärtinfarkt. För att göra det möjligt för läkare att ställa en kvalitativ diagnos baserad på dessa bilder, måste bilderna först segmenteras och roteras av en biomedicinsk analytiker. Detta utförs på grund av att hjärtat hos olika patienter, eller hos patienter vid olika examinationstillfällen, inte är lokaliserat och roterat på samma sätt, vilket är ett väsentligt antagande av läkaren vid granskning av bilderna. Eftersom olika biomedicinska analytiker med olika mängd erfarenhet och expertis roterar bilderna olika uppkommer variation av de slutgiltiga bilder, vilket ofta kan vara ett problem vid diagnostisering. En annan sorts nukleärmedicinsk undersökning är vid kvantifiering av dopaminreceptorer i de basala ganglierna i hjärnan. Detta utförs ofta på patienter som visar symptom av Parkinsons sjukdom, eller liknande sjukdomar. För att kunna bestämma graden av sjukdomen används ofta ett utförande för att räkna ut olika kvoter mellan områden runt dopaminreceptorerna. Detta är ett tröttsamt arbete för personen som utför kvantifieringen och trots att de insamlade bilderna är tredimensionella, utförs kvantifieringen allt för ofta endast på en eller flera skivor av bildvolymen. I likhet med myokardiell perfusionsundersökningar är variation mellan kvantifiering utförd av olika personer en möjlig felkälla. I den här rapporten presenteras en ny metod för att automatiskt segmentera hjärtats vänstra kammare i SPECT-bilder. Segmenteringen är baserad på en intensitetsinvariant lokal-fasbaserad lösning, vilket eliminerar svårigheterna med den i myokardiella perfusionsbilder ofta varierande intensiteten. Dessutom används metoden för att uppskatta vinkeln hos hjärtats vänstra kammare. Efter att metoden sedan smått justerats används den som ett förslag på ett nytt sätt att automatiskt kvantifiera dopaminreceptorer i de basala ganglierna, vid användning av den radioaktiva lösningen DaTSCANTM . Acknowledgements This project has been carried out at Exini Diagnostics AB and I would like to thank everyone there for their support and friendliness. Special gratitude goes to my company supervisor, professor Lars Edenbrandt, who initiated the project and invited me to work on it. Thank you; bringer of good confidence! Additionally, I would like to thank my university supervisor, Joakim Rydell, at the Department of Biomedical Engineering. In the beginning of this project I had a lot of questions, to which you always had enlightening answers, despite completing a PhD at the same time. Finally, I give a special acknowledgement to my family and near friends; not for helping me while formulating mathematical expressions, but for just being there when I was not. Thank you! Table of Contents 1 Introduction 1.1 Background 1.2 Objectives . 1.3 Method . . 1.4 Outline . . 1.5 Notation . . 2 Scintigraphy 2.1 Nuclear medicine physics . . . . . . . . . . . . 2.1.1 Decay of radioactive isotopes . . . . . 2.1.2 Detection of radiation . . . . . . . . . 2.2 Single Photon Emission Computed Tomography 2.2.1 Image quality . . . . . . . . . . . . . . 2.3 Scintigraphic examinations . . . . . . . . . . . 2.3.1 Myocardial perfusion SPECT . . . . . 2.3.2 DaTSCAN Quantification . . . . . . . 2.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Image Segmentation 3.1 Segmentation methods 3.1.1 Thresholding . 3.1.2 Region growing 3.1.3 Active shapes . 3.2 Image registration . . . 3.3 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 The Morphon Method 4.1 Displacement field estimation . . 4.1.1 Quadrature phase . . . . 4.1.2 Least square solution . . 4.2 Displacement field accumulation 4.3 Displacement field regularisation 4.4 The model . . . . . . . . . . . . 4.4.1 Deformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 2 2 3 4 . . . . . . . . . . . . . . . . . . 5 5 5 6 8 8 9 9 10 11 . . . . . . . . . . . . 13 13 13 14 15 15 17 . . . . . . . 19 20 20 24 24 25 26 26 . . . . . . . . . . . . 8 Table of Contents 4.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Rotation of Myocardial Perfusion Images 5.1 Pre-processing of images . . . . . . . 5.2 2-D Morphon segmentation . . . . . . 5.2.1 Locating the heart . . . . . . 5.2.2 Segmentation . . . . . . . . . 5.3 3-D Morphon segmentation . . . . . . 5.4 Angling of the heart . . . . . . . . . . 5.4.1 Angle estimation . . . . . . . 5.4.2 Rotation . . . . . . . . . . . . 5.5 Summary . . . . . . . . . . . . . . . 6 Dopamine Transporter Quantification 6.1 3-D Morphon segmentation . . . . . . . . . . . 6.1.1 Locating the basal ganglia . . . . . . . 6.1.2 Segmentation . . . . . . . . . . . . . . 6.2 Quantification . . . . . . . . . . . . . . . . . . 6.2.1 Weighted principal component analysis 6.3 Summary . . . . . . . . . . . . . . . . . . . . 7 Results 7.1 Segmentation and alignment of 3-D transaxial myocardial perfusion images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.1.1 Segmentation . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.1.2 Alignment . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.2 Automatic dopamine transporter quantification . . . . . . . . . . . . . 7.2.1 Segmentation . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.2.2 Quantification . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 9 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 . . . . . . . . . 29 29 30 31 31 32 33 34 35 35 . . . . . . 37 37 38 39 40 41 43 45 45 45 46 46 52 52 Discussion 8.1 Segmentation and alignment of 3-D transaxial myocardial perfusion images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.2 Automatic dopamine transporter quantification . . . . . . . . . . . . . 59 59 61 Conclusions 63 1 Introduction 1.1 Background Scintigraphy is a widely used medical imaging modality for observing physiological properties in the human body. Two examples that are considered in this thesis are scintigraphic heart images and images of the basal ganglia in the brain. At the time of writing, a scintigraphic examination of the heart is at hospitals carried out by a technologist. The main tasks for the technologist is to prepare the patient for the diagnostic procedure, and later on to perform a sequence of manual operations on the acquired images. These operations includes positioning the location of the heart to a predefined area in the images, determining the orientation of the heart and concluding whereas the image quality is sufficient for the doctor to base a diagnosis on. The reorientation of the heart is necessary due to the simple fact that different individuals do not look the same; not on the outside, nor on the inside of the body. In heart scintigraphy, this presents a problem. For a doctor to give a qualitative diagnosis, the result of an examination is compared to other images. This may occur directly, by looking at different images from other patients, or indirectly, by taking into account the vast amount of images in the past that is the experience of the doctor. However, both these methods depend on the presumption that the images have been observed from the same point of view, which would not be the case if the technologist did not manually compensate for angular and rotational differences of the orientation of the heart, between different scintigraphic examinations. However, as different technologists have different experience and slightly different ways of making these adjustments, the result will not be consistent for all examinations. An automatic process for doing this would not only hopefully make the result more consistent, but also save time that the technologist has to spend on performing these tasks manually. One method for doing this has been proposed by Germano et al. [1], hoping 2 Chapter 1. Introduction to move more and more into a fully automated process [2]. However, as this method is based on intensity of the images, a more insensitive method may be desired. Moreover, scintigraphic images of the basal ganglia in the brain are beginning to be more and more popular to diagnose Parkinson’s disease. At the time of writing, quantification is used to measure a number of properties from the images of the basal ganglia, thus being able to determine the state of the patient. This process is tedious and can present difficulties if the state of the basal ganglia is poor. An automatic method for doing this quantification would benefit in much the same ways as the case with scintigraphic heart examinations. The process would be much faster compared to manual quantification, and the variability between medical staff would be eliminated. Additionally, manual quantification is performed in two dimensions and hence it could be beneficial to quantify in 3-D. Morton et al. [3] has evaluated two computer driven methods in the same field and found that a 3-D based method gives better concordance with visual assessments. 1.2 Objectives The primary aim of this project is to investigate if an automatic reorientation of threedimensional, transaxial myocardial perfusion SPECT images is possible, and if so, to design and implement an automatic segmentation method that can distinguish, and compensate for, differences in anatomical structure of the heart; particularly the angle and position of the left ventricle. The method should be able to segment and reorient the left ventricle from transaxial images into short-axis (oblique) images. Secondarily, the same method should, after little modification, be applicable to scintigraphic images of the basal ganglia. These should be segmented, and a variety of useful parameters should be extracted. These parameters should correspond to parameters that are extracted in manual quantification. 1.3 Method The work in this project has been divided into two different parts. First, the segmentation algorithm was implemented, and secondly the algorithm was applied on the two problems at hand, which first required some pre-processing in each case. ComputaR tions and simulations have been carried out with the help of M ATLAB! , version 6 and 7, from The MathWorks, Inc. The main part of the functions and methods used has been implemented with the help of M ATLAB-included primitive functions. However, to decrease computation time, a C-compiled version of convolution has been used, with courtesy of Gunnar Farnebäck at the Department of Biomedical Engineering, Linköping 1.4 Outline 3 university. To decrease the time of implementation, some functions from the M ATLAB Image Processing Toolbox have also been used. These are the bwlabel() and regionprops() for some simple morphological operations, and tformarray() for rotating 3-D volumetric objects. All the images used for development of the algorithm have been provided by Swedish hospitals. The scintigraphic heart images originate from Sahlgrenska university hospital in Gothenburg, Sweden, while the DaTSCAN-images of the basal ganglia have been provided by The university hospital of Norrland in Umeå, Sweden. 1.4 Outline Here follow a short summary of the contents of each chapter of this thesis. Chapter 2 The basics of nuclear physics is presented along with the essential parts of a nuclear medical examination. In this chapter, relevant facts about myocardial perfusion examinations and DaTSCAN examinations are also introduced. Chapter 3 In this chapter, image segmentation and registration is introduced. Some commonly used techniques are presented along with their good and poor properties. Chapter 4 The main method in this thesis, the Morphon method, is gone through in some detail in this chapter. Chapter 5 Here the problem with the estimation of angles of the heart in myocardial perfusion images is addressed with the help of the earlier presented methods and some complementary techniques. Chapter 6 This chapter involves applying the Morphon method on the problem of automatic quantification of dopamine transporters and introduces some additional tools for solving this. Chapter 7 The results of both parts of the thesis are presented in this chapter. Chapter 8 The results of the previous chapter are discussed and some future development of the methods is proposed. Chapter 9 This chapter concludes the whole thesis, its results and possible effects on the problems at hand. 4 1.5 Chapter 1. Introduction Notation This thesis inevitably involves some mathematics. The notations used are presented below. s, S v v̂ |v| M v T , MT M−1 s∗ (s1 ∗ s2 )(x) E[x] Scalar Vector Unit vector Norm of vector Matrix. Transpose of vector and matrix Inverse of matrix Complex conjugate of the complex signal s Convolution of s1 and s2 Expectation value of the stochastic variable x 2 Scintigraphy To be able to understand the process of generating scintigraphic images using nuclear medical methods such as SPECT (Single Photon Emission Computed Tomography), the basic physics behind it is laid forward in this chapter. Details on the physical aspects and more information regarding different isotopes can be found in [4] and [5, 6] respectively. 2.1 Nuclear medicine physics Nuclear physical applications in medicine have been in clinical use for many decades. The first experiments started somewhere around 1935, when Chiewitz and de Heversy [7] tested the technique on rats. At first, nuclear medicine was mainly used in therapeutical treatments, such as that of cancer, introduced by Lindgren [8]. However, as acquisition instruments and radiotracers improved, it also became a very popular way of examining physiological and anatomical properties of many parts of the human body, for example the heart [9] and the brain [10]. 2.1.1 Decay of radioactive isotopes In all imaging modalities, some kind of conceptual probe is used to gather information about the observed object at hand. This probe can be very different in different situations. For example, at an ultra sound examination, mechanical waves are used to gather information about the object, by propagation and reflection between different acoustic impedances, and in magnetic resonance imaging, radio frequency fields, interacting with fundamental properties of matter, are the probes at hand. In nuclear imaging, radiation is used to acquire information about an object. Radiation, which originates from radioactive isotopes1 . 1 An isotope of an element is a specific variation of that element, where the number of neutrons in the nucleus differ between different isotopes. 6 Chapter 2. Scintigraphy There are a variety of radioactive isotopes used in nuclear medicine. Depending on the type of examination and the region of interest in the body, the energy of the decaying particles from the isotopes has to be taken into consideration. For example, in SPECT, which is the modality of interest in this thesis, the most common isotope used is 99m Tc (technetium), which is suitable due to, among other things, its half-life and the energy of its emitted photons [11]; that is about the same as for conventional X-ray γ-energies2 . A radioactive isotope (radioisotope) is, by definition, unstable. That means that the nucleus of that element can, and will, spontaneously decay with a certain probability, given a period of time. The faster the nucleus decays, the faster will it reach its half-time value; that is when 50 percent of the amount of isotope has vanished in favour of decay radiation of some sort. There is a tremendous amount of different isotopes, both occurring naturally on Earth and synthetically produced with the help of nuclear reactions. There are also many different types of decaying processes for different types of nucleus. These different types of decay have been categorised after the type of particle emitted in the process. The most known particle emission, or radiation, is γ-radiation, where the emitted particles are photons, or light. Other common types of radiation are α-, β − - and β + -radiation [4]. β- and γ-radiation is of much interest from a nuclear medical point of view since they are the key particles in SPECT, while α-radiation is attenuated quickly in most media and is more harmful to human tissue, and therefore is used sparsely in medical diagnostics [5]. β −/+ -particles are more commonly known as electrons (e− ) and positrons3 (e+ ). 2.1.2 Detection of radiation As was stated earlier, γ- and β-radiation is preferable in nuclear medical diagnostics, however the methods for collecting images with the two methods differ. In SPECT imaging, isotopes like 99m Tc, mentioned in the first paragraph of 2.1.1, are used for different purposes. While some have been stated, the one more important for the basic principles of this imaging modality is that these kind of isotopes decay with a single emission of a γ-particle. Since the nucleus decays spontaneously, it will have an equal probability of emitting a photon in any direction [4], which is illustrated in Figure 2.1. The emitted photon will have a total energy of about 140 keV, which is suitable for many photon detectors at the time of writing [6]. To detect the emitted γ-rays, different types of detectors can be used. However, particularly in nuclear medicine scintigraphy, a common method is to use a so called scintillator for this purpose. A scintillator is basically a type of 2 The Greek letter γ is in physics often used as a symbol for photon or light. The positron is the antiparticle of the electron, which means it has exactly the same properties, except for electrical charge, which is the opposite. 3 2.1 Nuclear medicine physics 7 ! Figure 2.1: The one-step process when a radioactive isotope decays with γ-radiation. crystal4 with specific characteristics. These crystals have in common that they undergo a process called fluorescence when interacting with γ-rays of an energy matching the characteristics of the crystal. Fluorescence is a process where an element will absorb a γ-ray and immediately emit light within the visible range. This is due to the photo electric effect, where electrons in an atom can be pushed to a higher energy state and then, when falling back to the original state, send out a flash of light [4]. This light can later be amplified, for example with the help of a photomultiplier, and then detected. Guerra [6] points out that while the scintillator is more sensitive and therefore preferable in many applications, other detectors are used, such as gas-filled chambers and solid state semiconductors. 1 e+ 2 ee+ ! Figure 2.2: The two-step process when a radioactive isotope decays with a positron (β-decay), which shortly thereafter reacts with an electron, creating γ-radiation. While single photon emitters (SPE) such as 99m Tc are used to detect one γ-ray at a time, 4 Common crystals that are used in nuclear medicine are: Cerium doped lutetium orthosilicate (LSO), cerium doped gadolinium orthosilicate (GSO) or yttrium aluminium perovskite (YAP) [6]. 8 Chapter 2. Scintigraphy there is another method used in nuclear medicine that utilise positron emitters (PE), such as Oxygen-15. In accordance to the name, positron emitters decay by emitting positrons; that is with β + -decay (see figure 2.2) [5]. However, as an antiparticle to the electron, the positron will readily react with electrons it encounters, annihilating both particles and sending out two photons near exactly in the opposite direction to each other. In vivo, the positron will react almost instantly with an electron, producing two photons that will radiate from the same source [4]. Consequently, even though another type of nuclear decay is the source of radiation, photons will, in the end, be detected by both type of radioisotopes. In nuclear medicine, the imaging modality using PE is referred to as PET (Positron Emission Tomography). PET-images are very similar to SPECT-images, but they will not be studied any further in this thesis due to the lack of PET-image material. 2.2 Single Photon Emission Computed Tomography The imaging modality that is the source of all images in this thesis is SPECT. In SPECT, photons from single photon emitters are detected in a manner as described in 2.1.2. SPECT is a widely used modality and has been used for some time. One of the great advantages of SPECT compared to projectional modalities like X-ray imaging is its ability to acquire images in 3-D, thereof the word tomography in the abbreviation. Larsson [12] presented one of the first working tomographic systems and the image quality has been much improved ever since. 2.2.1 Image quality What should be mentioned about the images generated in a SPECT examination is that they are low in resolution and signal to noise ratio compared to images from other imaging modalities, such as CT (computed tomography) and MRI (magnetic resonance imaging). A typical SPECT-image has a resolution of 643 voxels5 , while the other modalities can have resolutions of 5123 voxels and above [5]. The low resolution in SPECT-images originates from the fact that the photon count, i.e. the number of photons that are detected, is quite low and that SPECT registers physiological processes in the body, not anatomical structures as CT and MRI are mainly used for. Further more, even though the photon count could probably be increased with a more radiating radioisotope, it would consequently increase the dose of radiation applied to the patient. 5 The three-dimensional correspondence to an image pixel. 2.3 Scintigraphic examinations 2.3 9 Scintigraphic examinations There are many steps involved in a clinical scintigraphic examination. In this section, the process of examining myocardial perfusion as well as dopamine activity in the basal ganglia, and some problems that arrises with manual diagnoses in such examinations, will be presented. In most nuclear medical applications the radioisotope that generates the image signal is tagged to a compound of some sort, called a pharmaceutical agent, which is injected into the patient. Together, the isotope and the agent are named a radiotracer, which is used to attach itself to the organ that is to be investigated, and to make the radioisotope less hostile to the body. For example, as exemplified by Wilson et al. [13], in myocardial perfusion SPECT examinations, tert-butyl is commonly used, and for the brain images studied in this thesis, the popular DaTSCANTM radiotracer from GE Healthcare [14] has been used. 2.3.1 Myocardial perfusion SPECT Myocardial perfusion SPECT examinations are widely used as a routine examination for patients with chest pain. Some time after the technologist has injected the radiotracer into the patient, he/she will be placed in a camera station, often referred to as γ-camera, with a detector suitable for SPECT γ-energies [6]. After the camera has finished with the acquisition the image is reconstructed using some reconstruction technique, e.g. as proposed by Perkins [5]; filtered back-projection or iterative reconstruction. This will Figure 2.3: A 3-D volume image displayed as iso-surfaces (left) and two slices (center and right), with the left ventricle of the heart marked with an arrow. yield the images, which are tomographic 3-D volume data, i.e. each voxel in space has three space-coordinates (x, y, z) as well as an intensity level. In practice, the 3-D volume consists of a number of slices. This makes it convenient to display the volume in the form of a stack of 2-D images, which is often used clinically. Strauss and Miller 10 Chapter 2. Scintigraphy [15] describes the procedure in more detail. An example of a 3-D volume acquired using SPECT can be found in Figure 2.3. The next task for the technologist is to rotate the acquired images of the heart into predefined angles. This is done by choosing two slices from the volume, which the technologist finds good for estimating the angle of the heart [15]. An example is given in Figure 2.4. Figure 2.4: The manual rotation of the heart performed by the technologist. The heart is here illustrated before (first row) and after (second row) rotation, for slices through each axis respectively. It is obvious that the centre line of the left ventricle, which is commonly seen as the line that should align with the axes after rotation [15], cannot be exactly defined for such an asymmetrical object as the left ventricle. Even more so when the images acquired are of poor quality, or if the uptake of radiotracer in the left ventricle is poor, due to, e.g., myocardial infarction. This poses a problem for myocardial perfusion SPECT images, and is discussed in this thesis. 2.3.2 DaTSCAN Quantification DaTSCAN quantification is another SPECT-application that is very commonly used to diagnose Parkinson’s disease. If a patient suffers from Perkinson’s disease, the basal ganglia are often degenerated, i.e. the number of dopamine receptors are reduced. Moreover, the degeneration is generally first observed from the same place of the ganglia. However, the quantification methods that are used introduces great inter- and intraoperator variability, which is discussed by Tatsch et al. [16]. Often, a two-dimensional quantification method with predefined or adjustable regions of interest (ROI) is used, 2.4 Summary 11 which requires operator interaction throughout the whole quantification process. Visual interpretation of the images are also common, which introduces even more sources of error, e.g. the use of different colour maps; noted in [16] as: Use of non-continuous color tables may overestimate findings due to abrupt color changes. Tatsch et al. [16] As with myocardial perfusion SPECT images (section 2.3.1), the variability is great between different images. An example of how the basal ganglia are manually quantified is given in Figure 2.5, where a typical ROI has been drawn around each of the basal ganglia, as well as for the background signal of the image. The quantification of the Figure 2.5: Illustrating a typical ROI of the basal ganglia and background area (4) of a manually quantified DaTSCAN-image. images is commonly based on measuring the signal inside the ROI (or different parts of it) of the basal ganglia, and comparing it to some kind of background signal. The background signal has traditionally, with 2-D quantification, been chosen as the visual cortex (see nr. 4 in Figure 2.5), but there is no absolute standard [16]. The quantification procedure can take a significant amount of time for an inexperienced operator and there is little standardisation in the way that the different ROI:s are selected and adjusted. 2.4 Summary After this chapter, the reader should be sufficiently informed on how SPECT-imaging works, both theoretically, physically and on a more practical level, which has been exemplified with the two applications that are involved in this thesis; myocardial perfusion 12 Chapter 2. Scintigraphy SPECT and DaTSCAN quantification. Problems that both applications have in common regarding inter- and intraoperator variability as well as dimensional reduction approximations from 3-D to 2-D, have also been introduced. 3 Image Segmentation Image segmentation is in principle a method for bringing out certain, more interesting, parts of a signal, and has been used in medical applications for many years. The signal may for example consist of a two dimensional image, or as in the case of this thesis, a three dimensional volume. Both cases are common situations in medical informatics. In this chapter, a brief overview of common image segmentation techniques will be covered, and since the principles are the same for signals of different dimension, examples will mainly concern two dimensional images, since they are easier to visualise. Image segmentation in more detail can be found in [17]. 3.1 Segmentation methods There are many different image segmentation methods. From the very simple and intuitive, to techniques based on advanced mathematical concepts. An important issue to consider is therefore what type of segmentation method to apply, given a certain situation. This often depends on the image modality used to acquire the image, as well as the complexity of the part of the image that is to be segmented. 3.1.1 Thresholding In a very simple and clear image, simple and intuitive segmentation methods can often be sufficient. In such images, with a good signal to noise ratio, so called thresholding is often a good choice. Segmentation by thresholding works simply by keeping the part (pixels/voxels) of the image that lies in a certain intensity interval. This is demonstrated in Figure 3.1 b), where the pixels above a certain intensity level in the original image has been set to 1, and the rest of the image to 0, and can also be described mathematically 14 Chapter 3. Image Segmentation with Isegmented ! 1, a < I < b = , 0, o.w. (3.1) where I is the original image and a and b are the thresholds. However, if the image signal is bad and mixed with noise, thresholding will often yield a bad result. This is shown in Figure 3.1 c, where noise has been added to the original image. Figure 3.1: a) (left): An X-ray image of the hand of Wilhelm Röntgen’s wife. b) (centre): The image after thresholding. c) (right): The image after thresholding where noise has been added to the original image. Furthermore, another problem with thresholding is choosing a good threshold level that is not specific to one image. This can be done by for example examining the histogram of the image, or by other more or less sophisticated guesses. At last, segmentation by thresholding is often not a good choice where the interesting parts of an image has the same intensity levels as the rest of the image, and is only separated by structure and shape. In such situations, some kind of image registration is often required for good results. 3.1.2 Region growing A segmentation method that resembles thresholding is region growing [17]. However, in contrast to thresholding, which operates on each point in an image independently of neighbouring points, region growing takes into account the surrounding connected pixels. Region growing requires a starting point in the image. This point should be a known point within the area of interest, and consequently the method requires manual 3.2 Image registration 15 input or some kind of pre-processing. When the point is chosen, the method works iteratively. A neighbouring point is chosen. If the intensity of that point is within a certain intensity level from the starting point, the point is considered part of the final segmentation, and the process starts over, now starting at this point. There are several more or less advanced techniques for implementing region growing, however, it often gets very slow with higher image dimensions. More over, region growing can in its basic form handle images with much noise very poorly. There are ways of reducing noise, but nevertheless, region growing will not take into account the structure of the interesting parts of an image and is highly dependent on image intensity. 3.1.3 Active shapes The thought behind active shapes takes region growing a step further by introducing prior knowledge of the the object that is to be segmented. The goal in active shapes is to find a line in 2-D or surface in 3-D, that encloses the object of interest after segmentation. Active shapes works by first defining a model. The model is built from a data set of other images of the same kind of object that is going to be segmented. In each one of the images in the data set, the same landmarks (often anatomical) have been marked. By observing how the position of the landmarks differ throughout the data set, the variance of each landmark can be calculated [17]. In turn, this is used to define how the model is allowed to move while it is segmented. The actual deformation of the model can be generated in various ways. One common way is to use the energy in the image to calculate edges and borders, to where the model should deform. Compared to region growing techniques, active shapes are much more robust and insensitive to noise. This is due to the prior knowledge introduced about the object, which prevents the model from deforming in an unnatural way. The constraint of active shapes is off course the data set. If the data set is small, the model will not be able to deform sufficiently when introduced to cases that are not represented in the data set. However, with a large data set, active shapes is often a good method in more complicated segmentation situations. 3.2 Image registration In many situations, particularly in medical imaging, simple segmentation methods such as thresholding is not sufficient to yield good results. The segmentation process can then often be helped by adding image registration to the segmentation process. Image registration is in principle a method for deforming one image to align to another image. In medical informatics this is often used to map one image, with one type of information, to another image, containing other types of information. One example is to map an image of a brain acquired with computed tomography (CT) or magnetic resonance 16 Chapter 3. Image Segmentation imaging (MRI) to another image of the same brain, acquired with SPECT. Since the brain may not be in the same position at both times of examination, image registration is used to make the same areas of the two images correspond to the same areas of the brain. In this example, information about the anatomy of the brain, in the CT or MRI images, will then be complemented with physiological information from the SPECT image. Mathematically, image registration can be described by finding a deformation field, or a displacement field, v(x), that after acting on an image, I2 (x) will deform the image in such a way that it will map to the target image1 , I1 (x). The mapping in n dimensions can be described by v(x) : Rn "→ Rn . (3.2) Moreover, to be able to describe the quality of the registration process, the error $2 = $I2 (x + v(x)) − I1 (x)$2 (3.3) is introduced. If this error is zero, the registration process has been completely successful. This is not a realistic goal in practice, since the two images do not contain exactly the same information, in which case image registration would not be necessary. Consequently, v(x) describes how each point in I2 should be moved to minimise $. There are many attempts of solving this problem, and many involve minimising some variation of a least square problem. For example, if the following assumptions are made: 1. v(x) can locally be described solely by a spatial displacement. I(x, t) = I(x + ∆x, t + 1), (3.4) where t denotes the time in the discrete registration process. 2. The image can in each point be approximated with a first order Taylor expansion. I(x + v(x), t + 1) = I(x, t + 1) + ∇I T v, ∇I = [∂I/∂x, ∂I/∂y]T . (3.5) If equations (3.4) and (3.5) are combined, the equation for the so called optical flow, denoted by v, can be found. ∇I T v − ∆t = 0, ∆t = I(x, t) − I(x, t + 1). (3.6) Optical flow is one of the oldest concepts in image registration and computer vision and a lot of research has been done around it. Consequently, there are many methods for 1 Target image is usually the name of the image that is left untouched. 3.3 Summary 17 solving the differential equation (3.6). As a matter of fact, there are many solutions for v in each point in the image; solutions that are not practically usable, since image points can then have completely independent solutions of each other. This is described in more detail in [17], chapter 6:6. However, one robust solution that consider the optical flow as a continuous function is a least square problem formulation: " $2 = ([∇I(xi )]T v(xi ) − ∆t (xi )) (3.7) i By assuming different properties for v in equation (3.7), different types of solutions for the displacement field can be found, with different amount of rigidity [18]. For example, in some situations a mere translation or rotation of v is a good enough registration model, while in other more complex applications, a non-rigid model must be used, which allows for any kind of deformation of I2 . The parameterized solutions for different levels of rigidity can be found in [17]. In this section, the optical flow was found through differences in the intensity of images. While straightforward, this approach is obviously dependent on the assumption in (3.4). There are, however, other more robust procedures for estimating the optical flow in a registration process between two images. Differences in local phase is one such approach, and will be described in more detail in chapter 4. 3.3 Summary In this chapter, an overview of more and less advanced segmentation methods have been presented. From thresholding, which is easy to understand and implement, to registration based methods; derived within a rich mathematical framework and still issue to much research and algorithm optimisation. 4 The Morphon Method The Morphon method is a newly developed method for image segmentation and registration, first introduced by Knutsson and Andersson [19]. Since then, the method has been under refinement and have been applied in several medical applications [20, 21, 22, 23]. This chapter is mainly an overview of the Morphon method, based on [19] and [20]. The method can be brought to a much more advanced level, nevertheless, this simplified (and hopefully faster) version is sufficient for the subject of this thesis. The Morphon method is essentially a registration driven segmentation technique. It works by first estimating a displacement field through differences in local phase between the segmentation target image and a model image, which resembles the target image in e.g. intensity variations. The displacement field is further processed to ensure robustness of the method. The model of the object that is to be segmented is deformed according to the displacement field, and the whole process is iterated until the process is satisfactory and the model has mapped on to the target image. In the beginning of the work of this thesis, a method for solving the stated problem in a satisfactory way had not yet been proposed. After investigating several image segmentation and registration methods, the conclusion was finally that a phase based approach such as the Morphon method had several advantages in comparison, given the problematics of this project. The primary reasons for choosing the Morphon method can be summarised with the following arguments. 1. Phase: As mentioned in section 3.2, a phase based approach on image registration can be advantageous compared to classical techniques with intensity based optical flow estimation. Even more so in the images that is considered in this thesis. Section 2.3.1 shows that SPECT images of the heart can vary considerably in intensity. Not only among different images, but also between the heart and other organs. In such situations, intensity based techniques are inferior to a method such as the Morphon, which, as presented in 4.1.1, is insensitive to intensity variations. 20 Chapter 4. The Morphon Method 2. Noise: If the images are of low quality1 , which is not an unusual case with SPECT images (see section 2.2.1), assumption 2 (equation (3.5) might very well not hold. However, the Morphon method uses robust methods such as displacement field accumulation and regularisation to prevent the displacement field from diverging. 3. Model: The Morphon framework includes an easy and dynamic way for including prior knowledge about the object that is to be segmented. This makes it easy to develop further for other applications, as will be presented in chapter 7, Segmentation and quantification of DaTSCAN images. 4.1 Displacement field estimation A displacement field is essentially a vector field. In this approach, the field is dense, which means that in each point in space, there exists a vector with a certain direction and magnitude. Since the image space is discretised with pixels or voxels, the vector field will have the same size as the image that is to be deformed by it. There are, however, other applications, where the deformation field is not dense, and a vector describes the movement of a block of discrete points. To estimate a displacement field in the Morphon method, analysis of the local phase of the images is made. 4.1.1 Quadrature phase Phase can principally be described as the structure of the image. Mathematically, the phase, as it is used in this thesis, is the response from a certain set of filters that respond to edges and borders in the image. These filters are known as quadrature filters and are defined as follows. Quadrature filter 4.1.1. A filter, f (x), is a quadrature filter if its Fourier transform, F (ω), is zero on one side of a hyper plane through the origin in the Fourier domain. That is, there exists a vector, n̂, such that ! 0, n̂T ω ≤ 0 F (ω) = . (4.1) F (ω), o.w. In this thesis, one special type of quadrature filters have been used, namely lognormal2 filters, which are described, for example in [24], in the Fourier domain as 4 F (ω) = e− B2 ln2 ln 1 2( ω ) ωc (4.2) Quality can be hard to define. Here, quality mainly refers to signal-to-noise ratio and the amount of artifacts. 2 Lognormal filters are often referred to as lognorm filters. 4.1 Displacement field estimation 21 for positive values of ω. Here, B is the relative bandwidth of the filter in octaves and ω c is the centre frequency of F (ω). In Figure 4.1, a lognorm filter is shown in the spatial and frequency domain. For displaying reasons it is shown for only 1 dimension, but the principle is the same for arbitrary dimensions. Due to the quadrature filter definition in the frequency domain, quadrature filters are always complex in the spatial domain. This can be understood since F (ω) '= F (−ω), (4.3) F (ω) = F (−ω). (4.4) while a necessary requirement for f (x) to be real is that !&#" # '()* +,)-./)01 $%+ !&# $%* $%) !&!" $%( $%' ! $%& $%! !!&!" $%" $%# !!&# ! " #! #" $! $" %! $ !! !" !# $ # " ! Figure 4.1: Lognormal filter with real and imaginary parts in the spatial domain (left) and the Fourier domain (right) In computational practice, filters are discrete, and it is often desirable to adjust the finite length of the quadrature filter. A smaller filter will imply less computations while making calculations with the filter. Moreover, the size of the filter will have impact on the size of the items that the filter is able to detect. The size of the filter can, and will, therefore depend on the application at hand. A good way to design a filter kernel, that introduces much control over the filter properties, is suggested by Knutsson et al. [25] and is based on a least square formulation of the filter design problem. The main idea is to introduce a number of ideal filters in each domain where they can be easily described. Here the quadrature filter has been defined in the Fourier domain (4.1.1), which is suitable for this technique. However, it is also desirable to achieve a filter which is small in the spatial domain, i.e. ideally resembling δ(x), an impulse3 . In short, the problem can be formulated as " $2 = $Wj (fj$ − Bj f )$2 , (4.5) j 3 Mathematically known as the Dirac delta function. 22 Chapter 4. The Morphon Method where f is the desired filter in the spatial domain, Bj is the base matrix that when multiplied by f will give f described in domain i, and fj$ is the ideal filter for domain i. Wj is a weight matrix which can be used to make certain parts of the filter more important, i.e. more sensitive to errors. For this case, the only domain other than the spatial is the Fourier domain. Thus, Bj will here either be the Fourier base matrix or the unity matrix. In each case respectively, Bj f will give F (ω) and f . As mentioned above, the phase, φ, of a signal, s(x), is related to the response from these filters. More specifically, phase can be defined as the angle of the filter response between the signal and the filter, derived from q(x) = (s ∗ f )(x), (4.6) where q is the filter response and ∗ denotes the convolution operator. Assuming s(x) is real valued4 , q is complex and can be rewritten on polar form as q(x) = A(x)eiφ(x) . (4.7) Here A(x) is the magnitude of the filter response, and the phase can now be found through φ(x) = arg(q(x)). (4.8) Finally, what is left is to generate a displacement field from the phase of two images. Assuming the two images I1 and I2 , we get two different responses from the same set of filters. Since lognorm filters, from definition 4.1.1, are dependant on the direction of n̂, we actually get one response from each filter in the filter set. A filter set can consist of as many filters wanted, however, there is a minimum of filters needed given a certain dimension of the images to be able to cover the whole Fourier domain. This is due to the fact that the filters have a direction and a limited spread. For example, in two dimensions, at least three filters are needed and in three dimensions, six filters is the minimum to span over the whole domain. This is discussed in more detail for example in [26]. The six directions for 3-D given in [26], for a second order filter, and which are used in this thesis, are n̂1 n̂2 n̂3 n̂4 n̂5 n̂6 4 = c[b, −a, 0]T = c[b, a, 0]T = c[0, b, −a]T , = c[0, b, a]T = c[−a, 0, b]T = c[a, 0, b]T The signal in this context is a two or three dimensional image, which in most circumstances is real valued. 4.1 Displacement field estimation a = 2, b=1+ √ 5, c= 23 # √ 10 + 2 5, where a, b and c is only for normalisation. From above, it is therefore necessary to apply convolution between the signal and all the n number of filters in the filter set. This leads to n filter responses for each image, each one representing the phase and magnitude in the direction of filter j (see Figure 4.2 for an example), i.e. (4.9) qj = (s ∗ fj )(x). From this we have one response from I1 and one from I2 for each filter in the filter set: Figure 4.2: The original X-ray hand (left) along with the phase (centre) and magnitude (right) of it after being filtered with a quadrature filter. Here it is seen that the filter captures edges corresponding to the filter direction, giving more certainty (magnitude) to structures with a frequency within the passband of the filter. qj (I1 ) = (I1 ∗ fj )(x), qj (I2 ) = (I2 ∗ fj )(x). (4.10) Further more, the phase difference at position x between the images can be found through the product between qi, I1 and the conjugate of qi, I2 , or qj (I1 )qj∗ (I2 ) = Aj (I1 (x))Aj (I2 (x))ei(φj (I1 )−φj (I2 )) . (4.11) From this, the phase difference is, analogue to equation (4.8), equal to ∆φi = arg(qj (I1 )qj∗ (I2 )) = φj (I1 ) − φj (I2 ). (4.12) dj (x) ∝ ∆φj . (4.13) Since this is the difference between the images, it is also locally proportional to the displacement field estimate at that point: Moreover, what we now have is a set of n displacement field estimates, each one representing the estimate in direction i. 24 Chapter 4. The Morphon Method 4.1.2 Least square solution To bring the information from each one of the estimates together into one displacement field estimate, a least square problem is a suitable way of postulating this problem. Since least square problems have the great advantage of being able to weight different points with different importance, as is common in filter design, for example described in [25], this can also be used for the derivation of the total displacement field. In this case, the weight function is called the certainty of the filter responses and is defined as cj = Aj (I1 (x))Aj (I2 (x)). (4.14) The certainty will then be high where the two filter responses match each other in structure, which is sensible, and this makes for all the parts in the least square problem to calculate the whole displacement field, d = (d1 , . . . , dN ), for N dimensions. " min [cj (n̂Tj d − dj )]2 , (4.15) d j for each direction, n̂j . Further more, since the Morphon method is an iterative registration scheme, the displacement field estimate is only valid for the current iteration, and must be updated with each iteration, thus generating a new field, dk , for iteration k, which brings us to a central part of the Morphon method. 4.2 Displacement field accumulation As presented above, each iteration generates a new displacement field, solved from equation (4.15). However, the field for iteration k might not be a good estimate compared to estimates prior to that iteration. To prevent bad estimates to disrupt the registration process, a field accumulation is therefore used to make good estimates gain greater influence on the final displacement field. The accumulation also have another effect on the registration process as it prevents interpolation smoothing to be spread throughout the iterations. Since the model (I2 here) has to be deformed in each iteration and a new displacement found from the deformed model, smoothing would occur if dk would be applied directly on the model for each iteration. Instead, the newly found field is added to the field found and accumulated in iterations before, removing interpolation artefacts since the original model is now used for each iteration. The accumulated displacement field is updated through d$a = ca da + ck (da + dk ) , ca + ck (4.16) 4.3 Displacement field regularisation 25 where da denotes the accumulated field for the current iteration and d$a is the next, updated accumulated field. ca and ck both describes certainty measures, associated with the ones mentioned in equation (4.14). ck is directly coupled with the displacement estimate for each iteration, dk , and is defined as ck = n " cj , (4.17) j i.e. the sum of all certainty measures for each quadrature filter. ca , on the other hand, is another accumulated entity and is a certainty measure weighted with its own certainty, expressed as c$a = c2a + c2k , ca + ck (4.18) where c$a , analogue with above, is the updated certainty measure. To summarise; the accumulation of the displacement field implies that good displacement field estimates will have a higher effect on the accumulated (final) displacement field than poor ones. This creates a robustness of the algorithm as well as increases its reliability. 4.3 Displacement field regularisation As mentioned in section 4.1.1 about quadrature phase of images, the phase is only a local description of the image. If the accumulated field estimates from equation (4.16) were to be directly applied to the model image, it would most likely tear the image apart, diverging into something far from a successful registration operation. However, since the field represents local changes in the image, it is sensible to think that an averaging of sorts would represent a change on a more global scale. This averaging can be made more or less complicated. Often, it is done by means of a basic Gaussian averaging: dr = (d ∗ g)(x), (4.19) where g is a Gaussian kernel and dr is the regularised field, but can be made more advanced in many ways. In this thesis, a special form of normalized convolution, as first introduced by Knutsson and Westin [27], called normalized averaging, is used. Normalized averaging introduces the great advantage of allowing to include the certainty measure in the regularisation process as well, adding even more robustness and reliability to the registration. Normalized averaging is defined as dr = ((ca d) ∗ g)(x) , (ca ∗ g)(x) (4.20) where the division and multiplication are taken element wise. The effect of a small and a large standard deviation of a Gaussian kernel is presented in Figure 4.3. 26 Chapter 4. The Morphon Method Figure 4.3: A displacement field during a Morphon registration process with a small (left) and large (right) regularisation of the field. 4.4 The model The model in the Morphon method is the image that is deformed by the displacement field, and hence contains the information about the registered object after the registration has finished. However, to perform a successful registration, an appropriate model is needed. The type of model often depends on the context or problem. Nevertheless, a good model should contain approximately the same information regarding shape and intensity variations as the target object. For example, in this thesis segmentation is performed on SPECT images of the heart. A suitable model could therefore reside in a manually segmented heart from another SPECT image. 4.4.1 Deformation The model is deformed, using a suitable interpolation method, by the accumulated displacement field at the end of each iteration and the deformed model is used to generate the new accumulated deformation field, which in turn deforms the original model. This is iterated until some kind of accuracy, δ, or a maximum number of iterations is reached. The accuracy could for example be the change of the accumulated displacement field between two, in the iteration process, adjacent fields, i.e. $ $ $ dr, k − dr, k−1 $ $, $ δ=$ $ dr, k where the relative difference has been chosen as the accuracy. (4.21) 4.5 Summary 4.5 27 Summary Here, a simpler version of the Morphon method has been described in some depth. The method takes advantage in its local phase based registration scheme, which is iterated through different resolutions. A displacement field is found for each iteration and is accumulated and regularised to make the registration more robust, as well as adding the option to make the model more or less rigid and deformable. 5 Rotation of Myocardial Perfusion Images As presented in section 2.3.1, there are some problems with the image analysis in scintigraphic heart examinations as they are carried out at the time of writing this thesis. The main problems are: • Interoperator and intraoperator inconsistency, that ends in difficulties in comparing images acquired from different technologists, or even from from the same technologist. • Time consuming activities for the technologist after images have been acquired. • Difficulties in manual alignment for images with poor uptake. The approach in this thesis for designing an automatic method for doing all this is based on the Morphon from chapter 4. The automatic process involves some pre-processing, to help the later registration scheme, followed by a 2-D registration and segmentation to decrease the amount of data. After this, the 3-D image should have been decreased to a small volume, containing the left ventricle of the heart. A 3-D Morphon is then applied to the image, from which a displacement field is generated. This displacement field, with enough regularisation, is a rough description of how the model have been rigidly transformed to match the target image. By estimating the angle with the help of two lines, the rotation of the left ventricle can be known, and the target image rotated correctly. The sequence is in this chapter described in detail. 5.1 Pre-processing of images The first problem in this application that must be dealt with is that the amount of data has to be decreased, in order to increase performance of the analysis. This is primary done in a quite primitive way by cropping the 3-D image along all of its dimensions. The cropping assumes that the technologist has placed the patient correctly in the γ- 30 Chapter 5. Rotation of Myocardial Perfusion Images camera and that the heart is not positioned in the right side of the body. The cropping is illustrated in Figure 5.1. Figure 5.1: The pre-cropping that will help the registration process. To the left, the original image, and to the right the cropped image. The left ventricle is marked with a black rectangle in both images. 5.2 2-D Morphon segmentation After the image has been cropped, the first Morphon based segmentation is performed. As this is a 2-D segmentation, the 3-D volume must first be reduced. This is done by transforming the volume into a max intensity projection (MIP) image, i.e. by generating a 2-D image where each pixel is the maximum of all the voxels that after a projection would result in that pixel. This is performed along the y-axis, which is the axis in the same direction as the nose of the patient, or that is pointing out from the patient’s chest. This is illustrated in Figure 5.2 a). Figure 5.2: a) Maximum intensity projection along the y-axis (green). b) Maximum intensity projection along the x-axis (red). The resulting projections are symbolised by the purple plane. 5.2 2-D Morphon segmentation 5.2.1 31 Locating the heart Since other organs can have very similar shape and intensity variation compared to the left ventricle of the heart, it is important to help the registration as much as possible. This is done by finding a point of the heart where the Morphon can be initiated. By assuming that there are only uptake in form of noise in the parts above (in the negative z-direction, which is symbolised by the blue vector in Figure 5.2) the heart, a thresholding can be performed to remove any noise, after which the heart should contain the lowest point in the z-direction. This assumption has been made after investigating a great number of images (over 600). As presented in section 3.1.1, the threshold level can be found in different ways. Here, a dynamic way is needed to keep the thresholding from removing the heart completely. This is done by calculating the histogram of the image. This will result in a curve with some maxima and minima, where the maxima are a representation of intensity levels that are more common in the image. As this do not apply to noise to the same extent, a suitable threshold level that contains the heart can be found. However, this may not be enough if the noise level is very high. In order to separate noise from the heart, the size of the binary clusters that are found is calculated. If the size is too small, it is likely that it does not represent the heart. Additionally, a distance test is performed. This test calculates the distance from the centre of mass (COM) of the image and the point that has been found, where the COM is defined as % I(xi )xi %i , ∀i, (5.1) i I(xi ) for a pixel at position xi with intensity level I(xi ). As the intensity levels in the MIP image are much higher in the lower parts of the body, it is there that the COM will be found. The distance from the COM to the heart is therefore relatively short, and points that are found too far away from the COM can be regarded as noise. 5.2.2 Segmentation When a point of the heart has been found, the 2-D Morphon can be initiated and performed. The model used is a MIP image from another heart, which represents the intensity variations in a suitable way. Moreover, as the model may still, despite the tests from above, be initiated quite far from the heart, the registration is first performed on a lower scale of the image (naturally, with a down sampled model as well). Still, the same size of the quadrature filters is used, which implies that the model will "see" longer than on full scale. To perform the registration on different scales is a good way of preventing the registration to converge locally on details, before it has reached the heart. When the model has converged, it should be known where the heart is situated in the 2-D MIP-image. It can then be cut out along the third dimensions, resulting in a 32 Chapter 5. Rotation of Myocardial Perfusion Images Figure 5.3: The result after two 2-D Morphon segmentations. cylinder like volume which is oriented along the axis of the maximum intensity projection (y in this case). The resulting volume is then made a MIP along the x-axis, which is illustrated in Figure 5.2 b), and another 2-D Morphon is initiated. However, as much of the interfering organs around the heart has been cut away already, it is sufficient to initiate the model at the COM of the MIP image, which will result in a good registration. Consequently, when the registration has converged, even more can be cut away in the same manner as above. The result is a relatively small box, containing the heart and possibly other organs that lies very close to the heart. An example of the result can be seen in Figure 5.3. 5.3 3-D Morphon segmentation From the segmentation covered in the last section, the heart is now separated from the majority of interfering organs. The last segmentation process is now made in three dimensions to be able to gather information about the orientation of the heart. As the co-ordinates of a point that is known to have its position somewhere in the heart, this point is a sensible starting point for the model in the 3-D Morphon registration. However, as the starting point will probably not correspond exactly to the same point in the model, the 3-D registration is here as well performed in multiple resolutions. The reg- 5.4 Angling of the heart 33 ularisation of the displacement field is relatively large on the lower scales to keep the model from deforming incorrectly in the absence of the details of higher resolutions. At full resolution, on the other hand, the standard deviation of the regularisation filter is relatively smaller. This means that the model can deform into more detail, which is necessary due to the great variation between images from scintigraphic heart examinations. The model in the 3-D segmentation is as in the 2-D case a manually segmented 3D heart. The model will deform significantly between different types of hearts. An example of this is illustrated in Figure 5.4. Figure 5.4: The original model (top-left) and three examples of it after being deformed in the registration process. 5.4 Angling of the heart After the 3-D Morphon registration has converged, the model should have aligned itself to the left ventricle of the heart. The model can then be used to segment the left ventricle, which is one of the aims of this thesis. Moreover, the rotation of the heart requires some thought on how to represent the rotation of the model. 34 5.4.1 Chapter 5. Rotation of Myocardial Perfusion Images Angle estimation To represent an angle between two objects, a minimum of two lines is needed. The first line has been manually defined from the original model. The line does not need to be represented by voxel coordinates in the image, but can be mathematically defined as desired. Here, the line is situated in the middle of the left ventricle, in the cavity called the lumen, and has a length that does not exceed the length of the heart in the same direction. Mathematically, the line can be expressed as x0 vx x0 + tv = y0 + t vy , z0 vz t = 0 : l, (5.2) where x is the starting coordinate of the line, v is the unit direction vector and t is a parameter which determines the length, l, of the line. The deformation field that has been found through the 3-D registration can now be used to deform the line accordingly. However, as a small regularisation has been used on the full scale registration, the line is likely to be deformed into a discrete curve. This is not desirable and hence the field is further regularised. With a large regularisation filter, the global displacement will more and more correspond to a rigid transformation field. However, the deformed line will nonetheless become slightly bent. This is solved by fitting a line to the deformed points of the line, resulting in a linear regression problem: min α, β " i [zi − (αxi + β)]2 , x = (x, y), (5.3) with parameters α and β, or min $Ax̃ − b$, x̃ (5.4) where A is a known basis matrix, x̃ is an unknown parameter vector (here, containing α and β) and b is a known vector containing the function values (here, the z-axis). From linear algebra we know that the solution for a least square problem is: x̂ = (AT A)−1 AT b. (5.5) This will yield a straight line that has been approximately rigidly transformed from the original line, and the angle, θ, for each axis respectively can be calculated from the definition of the scalar product between two vectors, u and v, given as uT v = |u||v|cosθ. (5.6) 5.5 Summary 5.4.2 35 Rotation As the angles of rotation around each axis have now been estimated, the remaining task is to rotate the image heart with the corresponding angles. This can be done in a multitude of ways, e.g. applying a rigid rotation matrix and interpolating the image to the new position. The segmented and rotated heart can then be displayed in any way desired, however, the common clinical way is in slices through each axis. An example of a segmented heart, before and after rotation, can be found in Figure 5.5, where a surface of the volume object is displayed, and in Figure 5.6, where one slice through each axis is shown. Figure 5.5: Iso-surface plot of an automatically segmented heart (left) and the same heart after rotation (right). Figure 5.6: Three slices through the rotated heart as they are commonly displayed in clinical use. 5.5 Summary The segmentation and alignment of myocardial SPECT-images is performed with the aid of two 2-D and one 3-D registrations, based on the Morphon method from chapter 4. The angles are consequently found through deforming a line along with the deformation of the model in the method. 6 Dopamine Transporter Quantification From section 2.3.2, it is clear that there are currently some problems involved in quantification of 3-D scintigraphic brain images which aims at diagnosing e.g. Parkinson’s disease. This relies both in the variability between operators and the fact that much quantification is made in two dimensions, while the acquired images are in fact in 3-D. Thus, The 3-D segmentation method used in chapter 5 has been somewhat modified and complemented, and then applied on the quantification problem at hand. The algorithm for automatic 3-D quantification works by first dividing the SPECT-brain image into two parts and quantifying each part individually. This is followed by an initiation of the 3-D Morphon segmentation process. The segmented basal ganglia are then further analysed and quantified with methods comparable to the manual 2-D quantification commonly used clinically. To be able to do this, a method called weighted principal component analysis (wPCA) is introduced. 6.1 3-D Morphon segmentation Compared to the segmentation process of the heart in chapter 5, specifically section 5.3, very little pre-processing is necessary in the case of DaTSCAN SPECT images. In myocardial perfusion SPECT images, the image signal from the left ventricle of the heart constitutes a small part of the total image signal. This is in contrast to DaTSCAN SPECT images of the brain, where the basal ganglia (the region of interest) stand out in a much clearer way. This is illustrated in Figure 6.1. Consequently, the initiation of the 3-D segmentation can be greatly simplified compared to the pre-processing in chapter 5; e.g. the two 2-D segmentations from section 5.2.2. 38 Chapter 6. Dopamine Transporter Quantification Figure 6.1: Two iso-plots of a myocardial perfusion SPECT image (left) and a DaTSCAN SPECT image (right). The left ventricle is marked with an arrow and the basal ganglia are the red bean-shaped objects. 6.1.1 Locating the basal ganglia As mentioned above, the brain image is first divided into two halves, each containing at least one whole basal ganglia. The division is simply done by examining the intensity profile through the axis going through both of the basal ganglia and dividing the volume at the minimum between the two maxima, as illustrated in Figure 6.2. This makes it pos4 x 10 4.5 4 Intensiity 3.5 3 2.5 2 1.5 1 5 10 15 20 25 Position 30 35 40 Figure 6.2: The intensity profile of the brain (left), with marked local maxima and local minimum, taken along the x-axis, shown to the right. sible to divide the brain in approximately the two brain halves, each image containing one basal ganglia. Further more, since the basal ganglia in almost all cases have a significantly higher signal than the rest of the brain, locating the basal ganglia in each side is generally not a very difficult task. Choosing the point of maximum intensity as the starting point for 6.1 3-D Morphon segmentation 39 further segmentation is often good enough, however, to increase the robustness of the algorithm, the brain is first multiplied with a weight function, which makes intensity levels closer to the centre of the brain (where the basal ganglia are located) more important. After this, the point of maximum intensity is selected as the initiation point for the 3-D segmentation of each basal ganglia. 6.1.2 Segmentation With a point inside the basal ganglia, the model for the 3-D Morphon segmentation can be initiated. The model here is simply a healthy basal ganglia, which is mirrored when used in the other side of the brain. As explained in section 2.3.2, the basal ganglia are degenerated from the same place if a patient suffers from Parkinson’s disease. This fact can be used as an advantage in the segmentation process. If it is assumed that the relatively unaffected part of the basal ganglia is still high in image signal, the corresponding part of the model can be initiated at the highest intensity point in the image. This is illustrated in 2-D in Figure 6.3, while it is in practice performed in three dimensions. a b Degenerated basal ganglia Model Figure 6.3: a) The model (white) and a degenerated basal ganglia (gray) with the fixed initiation point of the model (square) and the maximum intensity point of the image (circle). b) The model and image after the initiation point of the model has been initiated at the maximum intensity point in the image. With the model initiated at a spot relatively close to the object that is to be segmented, the 3-D Morphon segmentation process can be performed. As with the case of the segmentation of chapter 5, 5.3, the registration process is performed sequentially in multiple resolutions to help the model to fit on a more global scale at first. One question that one might ask is that if the basal ganglia have degenerated, how can the model fit to something that is not there anymore? As a matter of fact, degenerated basal ganglia 40 Chapter 6. Dopamine Transporter Quantification tend to leave a trace, where there used to be dopamine transmitters before degeneration (see Figure 6.4). Since the Morphon is phase based, this trace is as good a structure to fit to as a healthy basal ganglia. Moreover, the model transformation is made relatively rigid to keep the model from deforming too freely in cases where little or none signal is left in some parts of the basal ganglia. Figure 6.4: The faint signal (red) that is left where the basal ganglia has degenerated still works as a structure to register on. After the registration process has completed, the model is thresholded to a small extent in order to remove possible noise or interpolation artifacts, introduced during the registration. The model has now very likely been deformed into a shape that fits well into the area of the brain where the basal ganglia are situated, even if the dopamine transporters have been degenerated. The result of two segmentation processes is presented in Figure 6.5, where the slice of maximum signal intensity of the images has been selected for displaying reasons. 6.2 Quantification After the area of interest, containing whole or parts of basal ganglia, has been successfully segmented the process of quantifying the segmented data remains. This has traditionally been done by medical staff, as described in section 2.3.2, from two-dimensional slices of 3-D volumetric scintigraphic images. However, the method used here will rely on the whole three-dimensional data that has been acquired from the segmentation process. Nonetheless, it will be influenced by the traditional methods in order to be able to compare the results with previous examinations. Quantification is presently performed by selecting multiple regions inside the ROI (the basal ganglia). These regions are then compared by calculating quotes of various kinds. This method has been compared to a manual segmentation method where the region of the basal ganglia has been divided into two parts; one front and one rear part, as can be seen from the contour in Figure 6.5. 6.2 Quantification 41 Figure 6.5: Two examples after the 3-D segmentation process. For displaying reasons, the segmented area is illustrated by a contour on 2-D slices of the 3-D volumetric images. Additionally, the segmented areas have been divided into two parts by a dividing line, in order to extract corresponding parameters. Consequently, the need for finding the long axis of the basal ganglia was needed. In order to achieve this, tools from the statistical domain were used. 6.2.1 Weighted principal component analysis Principal component analysis (PCA) can effectively be used in a dataset to find the direction where the data is varying the most. PCA can be derived in many different ways, depending on the context, however, one property of PCA is that it finds high variance in data. This suits the problem at hand well, as the direction of high variance of voxels is very likely the main axis of the basal ganglia. As maximum variance is desired, finding the local maxima of the variance can be used to derive PCA. If the empirical mean of the dataset is assumed to be zero (without loss of generality), the variance in direction ŵ can be expressed as 2 σŵ = E[xT ŵ] = E[(ŵT x)(xT ŵ)] = ŵT E[xxT ]ŵ = ŵT Cŵ = wT Cw , wT w (6.1) where E denotes the expectation value operator, x a data vector and C the co-variance matrix of x. By differentiating the variance we get 2 ∂σŵ 2 = K(Cw − σŵ w) = 0, ∂w (6.2) 42 Chapter 6. Dopamine Transporter Quantification where K '= 0. Moving around some of the terms and disposing K yields 2 Cw = σŵ w, (6.3) which is identified as an eigenvalue equation: Ce = λe. (6.4) The eigenvectors will, consequently, indicate the direction of highest variance, next highest variance and lowest variance respectively, with all vectors being orthogonal. However, in its original version, PCA on images works with the positions of the pixels (or voxels) in the image only. As low intensity noise will then be as important as the high intensity parts in the centre of the basal ganglia, the direction of variance will much likely not correspond to the long axis of the basal ganglia. To make the procedure more robust, the co-ordinates of the data (voxels) in the image are first weighted with the intensity level at the corresponding position. This can be expressed mathematically if x is the data vector with observations (pixels or voxels) ordered as columns, and the rows representing the different variables (in this case, the co-ordinates (x, y) or (x, y, z)). The weighted data, xw , for row, k, is then expressed as x(k)w = x(k)W, ∀k (6.5) where W is a diagonal weight matrix with the intensity values of the image in the diagonal. The result of this method is illustrated in Figure 6.6. Figure 6.6: Principal component analysis (left) compared to intensity weighted principal component analysis (right) with test data where a brighter color indicates higher intensity values. The solid line illustrates the eigenvector with the corresponding highest eigenvalue, i.e. the direction of highest variance or highest weighted variance. This will give the means to approximate the long axis of the basal ganglia in order to find the front and back of the organ, as illustrated by the dividing line in Figure 6.5. The front and back part of the basal ganglia is here defined as dividing the ganglia at a certain point along the long axis. If the segmentation was successful, this would approximate dividing the ganglia with a plane, having the long axis vector as the normal vector. 6.3 Summary 6.3 43 Summary The quantification of DaTSCAN-images can be performed through first dividing the brain into two halves. Each part will then be target for a single 3-D Morphon segmentation, which leads to the localisation of the basal ganglia. By finding the long axis of the two ganglia, they can then be divided into parts, and several fractions and values can be extracted. 7 Results In this chapter, the results from the automatic segmentation and alignment of the myocardial perfusion images and the quantification of DaTSCAN SPECT-images will be presented in two separate parts. Regarding the current implementation of the methods, it can be noted that they both take around 15 seconds per image to perform their respective task. 7.1 Segmentation and alignment of 3-D transaxial myocardial perfusion images The segmentation and alignment algorithm can be divided and tested in two parts; the segmentation of the left ventricle, and the angling of the left ventricle after segmentation. The segmentation is obviously a necessary step for the alignment to work properly, but can nonetheless be interesting as a separate segmentation method. 7.1.1 Segmentation In order to test the success of the segmentation algorithm implemented in this thesis, a large number of randomly selected myocardial perfusion images has been used. The original images have all been acquired immediately after image reconstruction in the γcamera workstation; often called transversal or transaxial images. No attention has been paid to different reconstruction algorithms, as they are estimated to have little effect on the final result. The material consists of images from both male and female patients, varying in age. Moreover, as only patients showing symptoms of a heart disease are examined at the hospital where the images have been acquired, the images are not representative to the perfusion of an average person, but often indicates some kind of heart disorder. 46 Chapter 7. Results 100 images have been used to test the algorithm. Since automatic evaluation can be difficult for segmentation algorithms, all the images have been visually evaluated after the segmentation. The importance of the segmentation in this context is to crop the large transversal image volumes into a small image volume containing essentially only the left ventricle. No effort has been made to remove noise and/or other organs that may be situated close to the heart, as they can be of help to medical staff when examining the images. The segmentation was evaluated to have been completely successful in 100 of the 100 cases, that is in 100 percent. Some examples from the 100 tested images can be seen in figure Figure 7.1. Another factor that may be of interest when evaluating the segmentation process is how well the 3-D model deformed into the target image. Examples of this are shown in Figure 7.2, where the deformed model, corresponding to the images in Figure 7.1, are displayed. Note that the above images have not yet been rotated, but only target for segmentation. 7.1.2 Alignment After the 100 images above had been segmented, some of these were rotated with corresponding estimated angles. Two angles was used to rotate the heart and thus the rotation around the axis of the heart was neglected as it has lower priority compared to the main, global, rotation. Manual rotation of the same images was also performed by a technologist on 60 of the 100 images. These manual operations were made in the ordinary clinical environment that is used when manually adjusting the images after image reconstruction. The manual alignment was therefore carried out completely independently of the algorithm. Angles estimated from an application at the workstation were also included to give further data to compare to. This software, however, works on already manually segmented images. The estimated angles for each of the involved parts (the algorithm used in this thesis, the manual estimation and the additional semi-automatic estimation) are presented in Figure 7.3 and 7.4. The mean difference for the automatic method was approximately 6 and 7 degrees for the two estimated angles. However, there were peaks of differences of up to 30 degrees. Additionally, in Figure 7.5 there are some examples of more and less successful rotations made by the automatic algorithm, displayed in the same way as above. 7.2 Automatic dopamine transporter quantification As there are few developed commercial applications performing automatic 3-D quantification of DaTSCAN-images, the results of this part of the thesis consist much in considering the correlation between the automatic method introduced here with manual 7.2 Automatic dopamine transporter quantification 47 Figure 7.1: Results from the automatic segmentation displayed as appropriate slices along all three axes in each image. Note the great variation between the different hearts. 48 Chapter 7. Results Figure 7.2: The deformed models corresponding to the images in Figure 7.1, displayed in the same slices. 7.2 Automatic dopamine transporter quantification 49 *! )! (! '! &! %! $! #! "! !+ ! "! #! $! %! &! ,-./01.23 415-16 ,-./!015+72889:9539 ;902!1-./01.23 ;902!1-./!015+72889:9539 + '! Figure 7.3: The first of the two angles in degrees, named HLA, estimated with different methods. The dotted and dashed horisontal lines indicate the mean difference for the automatic and semi-automatic methods respectively. 50 Chapter 7. Results "!! *! )! (! '! &! %! $! #! "! !+ ! "! #! $! %! &! ,-./01.23 415-16 ,-./!015+72889:9539 ;902!1-./01.23 ;902!1-./!015+72889:9539 + '! Figure 7.4: The second of the two angles in degrees, named VLA, estimated with different methods. The dotted and dashed horisontal lines indicate the mean difference for the automatic and semi-automatic methods respectively. 7.2 Automatic dopamine transporter quantification Figure 7.5: Slices through volumes after rotation has been performed. 51 52 Chapter 7. Results 2-D quantification methods. As in the previous section, the results from the automatic quantification can be divided into a segmentation part and a quantification part. The material that the method was tested on was made up of 70 images, taken directly after image reconstruction in the γ-camera. 7.2.1 Segmentation The segmentation process was here, as in the previous results, evaluated manually for each image and failed in one case of the 70 images. This was due to the fact that the uptake in the basal ganglia of one side of the examined brain was lower than in other parts of the brain, resulting in that the model of the segmentation process was initiated too far from the basal ganglia. The segmentation did, however, succeed for the other side of the brain. This gives a rate of success of approximately 99 percent. Images of successful segmentations are presented in Figure 7.6. Figure 7.6: Illustrating the results from the segmentation process, where the contour line around the basal ganglia indicates the segmented area. 7.2.2 Quantification As have been mentioned, there are, at the time of writing, few algorithms that perform automatic 3-D quantification of DaTSCAN-images, and the quality of the result can 7.2 Automatic dopamine transporter quantification 53 therefore be difficult to define. Nevertheless, even though the technique presented here operates in three dimensions, it should, if performing well, correlate with manual two dimensional approaches. Automatic quantification has therefore been performed on the same images as the segmentation process, and a set of different values have been calculated, which are often used in clinical quantification of this kind of images. The quantification values are defined by different fractions between whole or parts of the basal ganglia signal and the background signal in the brain, as well as fractions between the left and right basal ganglia, and the front and back parts (as defined in the last paragraph of section 6.2.1) of each ganglia. These values are explained and clarified further in Figure 7.7. With these areas, a number of different fractions, fi , can be defined A1 A2 B1 C B2 Figure 7.7: The different fractions between different parts of the basal ganglia and the background signal are defined by the areas above. as f1 = f5 = A1 +A2 −C 2 −C , f2 = B1 +B , C C f1 A2 B2 , f6 = B1 , f7 = f2 , A1 f3 = A2C−C , f8 = ff34 f4 = B2 −C C (7.1) where C is the mean signal of the whole brain, excluding the basal ganglia. The different fractions are illustrated in Figure 7.8, 7.9, 7.10 and 7.11. 54 Chapter 7. Results 5 5 4 4 3 3 2 2 1 1 10 20 30 40 50 10 Correlation:0.89803 20 30 40 50 Correlation:0.91175 4 4 3 3 2 2 1 1 1 2 3 4 5 1 2 3 4 5 Figure 7.8: The fractions f1 and f2 with the values of manual (green) and automatic (blue) quantification as well as the relative difference (red) along with scatter plots for both sides. 7.2 Automatic dopamine transporter quantification 5 5 4 4 3 3 2 2 1 1 10 20 30 40 50 55 10 Correlation:0.91397 20 30 40 50 Correlation:0.9189 4 4 3 3 2 2 1 1 1 2 3 4 5 1 2 3 4 5 Figure 7.9: The fractions f3 and f4 with the values of manual (green) and automatic (blue) quantification as well as the relative difference (red) along with scatter plots for both sides. 56 Chapter 7. Results 1.2 1 1 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 10 20 30 40 50 10 Correlation:0.79792 20 30 40 50 Correlation:0.84629 1.1 1 1 0.9 0.9 0.8 0.8 0.7 0.6 0.7 0.5 0.6 0.8 1 1.2 0.6 0.8 1 1.2 Figure 7.10: The fractions f5 and f6 with the values of manual (green) and automatic (blue) quantification as well as the relative difference (red) along with scatter plots for both sides. The manual values available were rounded to one digit, which explains the discretization of the horizontal axis. 7.2 Automatic dopamine transporter quantification 57 %!!&!! %!! &! &! &!! ! ! !&!! !&! ! !&!! !&! !%!! !"!! !%!! !"!! ! !%&! ! "! #! $! !%!! !%&! ! ! "! "! #! #! $! $! !%!! ()**+,-./)01!2345"5 '())*+,-.(/0!12&222 '())*+,-.(/0!12&222 &! '! '! ! ! ! ! !&! !&!!'! !'! !%!! !%!! !&!! !&!! !&! ! &! !%!! !%!! !&! !"!! ! !&!! "! # ()**+,-./)01!2345 &! !%!! ! !&! Figure 7.11: The fractions f7 and f8 with the values of manual (green) and automatic (blue) quantification as well as the relative difference (red) along with scatter plots for both sides. !%!! !"!! !&!! 8 Discussion In this chapter, some general comments will be made to the results in chapter 7. Furthermore, possible future improvements to the presented methods that, due to the time limit of this project, were not implemented, will be proposed. 8.1 Segmentation and alignment of 3-D transaxial myocardial perfusion images From the results it is quite obvious that the automatic segmentation scheme presented in this thesis is near to flawless when applied to myocardial perfusion SPECT-images. The relatively large dataset along with the great variation among these kind of medical images, gives great credibility to this local phase based registration scheme for segmenting the left ventricle of the heart. The initial 2-D segmentation is probably essential in that it removes other organs that can resemble the left ventricle to quite some extent from time to time. Additionally, the 3-D segmentation fine-tunes the model to the target image, and if desired, an even tighter cropping could probably be performed than the one illustrated in Figure 7.1. As noted though, the surrounding organs of the left ventricle can be helpful for medical staff when diagnosing the images. Performing the registration with different models have not been performed to any substantial extent in this thesis. However, from the results it is obvious that the choice of model can not be crucial when it comes to segmentation; backed up by the great variance among different hearts. However, in some cases, where the target heart has been overly large, the 2-D segmentation can cut away some parts of the heart due to restraints in rigidity. While this does not generally have any impact on the final result after the 3-D segmentation has been performed, it does increase the possibility of a bad angle estimation. The results from the alignment can, compared to that of the segmentation, be directly 60 Chapter 8. Discussion compared to other data, such as the manual angle estimations. As the angling is a more complex process, the automatically estimated angles do vary some with the manual measurement and as could be seen, the semi-automatic method have a lower mean difference than the fully automatic method used in this thesis. It should be noted though, that the difference between the manual and the semi-automatic methods sometimes reaches a value of zero. This can be explained due to the fact that the semi-automatic method is built in to the workstation where the operator performs the manual estimations. The operator is first presented with angles from the semi-automatic application, and could possibly deem the presented angles as sufficient; therefore not changing them further manually. Moreover, as the deformation of the model to the target images seems very well performed, the flaw of the angle estimation might be founded in the way that the difference in angles is described and defined. Not uncommonly, the left ventricle of the examined image is somewhat curved, in which case a line description for describing the orientation of the object could be insufficient. Tests with two lines as describing the angle difference has been performed, but without any obvious improvement. Since the deformation field of the 3-D registration is known and probably a good description of some kind of difference between the model and target image, future implementations of angle estimation can be easily done. Moreover, performing the registration with a different kind of model might prove itself useful when it comes to the angle estimation since the difference between the target image and the model could correlate better with the difference in angle for another model. Furthermore, the absolute angle of the left ventricle may not be important in an absolute, single estimation, but is probably more important when comparing different images from the same patient. One solution to this could then be to register the left ventricle in one examination to the same ventricle in another examination. Since the two images will probably be more alike compared to a general model, the angle difference estimation between the hearts of the different examinations could be easier determined. It is, however, difficult to determine how much the rotation of a single heart can vary from a "perfect" rotation and still be sufficient for a doctor to base a diagnose on. Generally though, a difference of more than 10 degrees often results in a poor rotated image. Furthermore, a perfect rotation is hard to define, as different operators will rotate the same heart differently. Therefore, to gain a full understanding on how successful the algorithm of this thesis is in rotating the images, a study involving more human operators is desirable. It is also difficult to compare this method to already proposed methods, since information about image type and quality is not available in enough detail for such studies. 8.2 Automatic dopamine transporter quantification 8.2 61 Automatic dopamine transporter quantification The automatic quantification of DaTSCAN-images showed itself to be quite successful when it comes to correlation to manual 2-D quantification. There is very high correlation in some cases of the different fractions, however, a relevant question to ask is if the correlation should be higher. Probably not, since the two methods differ very much. The automatic method uses volume quantification, while the manual quantification is performed on a number, or sometimes only one, two dimensional slices of the volume. The segmentation process was successful in most cases, and could probably be even more so if a more sophisticated method for initiating the registration model was developed. A pre-registration of the whole brain could primarily be performed, which would give a better estimation of where the basal ganglia should, or more important should not, be located. For very low uptake, the registration model could sometimes float away, in lack of structure to register on. This behaviour could be removed to some extent if one of the two basal ganglia have a higher uptake. By first registering on the side with high uptake, a rough approximation of the other basal ganglia could be generated. By then increasing the regularisation kernel (increasing the rigidity of the model), and mirroring the result from the first registration, the registration of the basal ganglia with low uptake could probably be helped; taking into account that the two basal ganglia of the same brain do not differ much from each other. Coinciding with suggestions of the earlier section, other registration models should be evaluated to investigate if that could yield a better result. However, as the uptake of the basal ganglia is due to a physiological process, this is unlikely. If CT and/or MRI images of the same brain are available, the registration could be performed on these images (with much higher resolution) and later mapped onto the SPECT-images of the dopamine activity in the basal ganglia. 9 Conclusions In this thesis, it has been showed that a phase based registration scheme can be very successful when performing fully automatic segmentation of myocardial perfusion SPECTimages. Such a method has been implemented and when performed in three dimensions, this segmentation can be used to estimate the angle of the left ventricle. To reduce variability between different operators, such a method can further more be used to take one step closer to a fully automatic examination of myocardial perfusion images. Additionally, a method for automatically quantifying DaTSCAN-images in three dimensions was presented with promising results. Manual quantification can be difficult and tedious and is often performed on slices instead of whole volume data. This method can therefore increase the quality of examinations and vastly decrease the time that it takes to quantify each image. As the research on automatic algorithms for dopamine transporter quantification is currently quite sparse, the hope that this method will give medical staff future tools for increasing examination quality is likely well founded. Bibliography [1] G. Germano, P.B. Kavanagh, Hsiao-Te Su, M. Mazzanti, H. Kiat, R. Hachamovitch, K.F. Van Train, J.S. Areeda, and D.S. Berman. Automatic reorientation of three-dimensional, transaxial myocardial perfusion spect images. The Journal of Nuclear Medicine, 1995. [2] G. Germano, P.B. Kavanagh, J. Chen, P. Waechter, Hsiao-Te Su, H. Kiat, and D.S. Berman. Operator-less processing of myocardial perfusion spect studies. The Journal of Nuclear Medicine, 1995. [3] R.J. Morton, M.J. Guy, R. Clauss, P.J. Hinton, C.A. Marshall, and E.A. Clarke. Comparison of different methods of datscan quantification. Nuclear medicine communications, pages 1139–1146, 2005. [4] P.A. Tipler and R.A. Llewellyn. Modern Physics. W. H. Freeman and Company, New York, USA, fourth edition, 2004. [5] A.C Perkins. Nuclear Medicine in Pharmaceutical Research, chapter 2. CRC Press, London, UK, fourth edition, 1999. URL http: //site.ebrary.com.lt.ltag.bibl.liu.se/lib/linkoping/ Doc?id=10054842&ppg=5. [6] A. Del Guerra. Ionizing Radiation Detectors for Medical Imaging. World Scientific Publishing Company, Incorporated, Singapore, third edition, 2004. URL http://site.ebrary.com.lt.ltag.bibl.liu.se/ lib/linkoping/Doc?id=10082183&ppg=5. [7] O. Chiewitz and G. de Heversy. Radioactive indicators in the study of phosphorus metabolism in rats. Nature, 1935. [8] E. Lindgren. Versuche mit radioaktiven isotopen bei leukämibehandlung. Acta Radiologica, 1944. [9] H.W. Strauss, K. Harrison, and J.K. Langee. Thallium-201 for myocardial imaging. relation of thallium-201 to regional myocardial perfusion. Circulation, 1975. [10] B. Bodforss. Radioisotope scanning of the brain. (32), 1965. [11] P.V. Harper, G. Andros, and K. Lathrop. Technetium-99 as a biological tracer. The Journal of Nuclear Medicine, 1962. 66 Chapter 9. Conclusions [12] S.A. Larsson. Gamma camera emission tomography. development and properties of a multi-sectional emission computed tomography system. 1980, Acta Radiologica. [13] C.G. Wilson, Y.P. Zhu, and M. Frier. Nuclear Medicine in Pharmaceutical Research, chapter 8. CRC Press, London, UK, fourth edition, 1999. URL http://site.ebrary.com.lt.ltag.bibl.liu.se/ lib/linkoping/Doc?id=10054842&ppg=5. [14] GE Healthcare. Datscan.com, January 2008. URL http://www.datscan. com. [15] H.W. Strauss and D.D. Miller. Society of Nuclear Medicine Procedure Guideline for Myocardial Perfusion Imaging. Society of Nuclear Medicine, 3.0 edition, June 2002. [16] K. Tatsch, S. Asenbaum, P. Bartenstein, A. Catafau, C. Halldin, L.S. Pilowsky, and A. Pupi. European Association of Nuclear Medicine Procedure Guidelines For Brain Neurotransmission SPET using 123 I-labelled Dopamine Transporter Ligands. European Association of Nuclear Medicine, October 2001. [17] G.H. Granlund and H. Knutsson. Signal Processing for Computer Vision. Kluwer Academic Publishers, 1995. ISBN 0-7923-9530-1. [18] B. Svensson, J. Pettersson, and H. Knutsson. Bildregistrering: Geometrisk anpassning av bilder, May 2007. URL http://www.imt.liu.se/edu/ courses/TBMI45/pdfs/registration.pdf. [19] H. Knutsson and M. Andersson. Morphons: Paint on priors and elastic canvas for segmentation and registration. In Proceedings of the Scandinavian Conference on Image Analysis (SCIA), Joensuu, June 2005. [20] J. Pettersson. Automatic generation of patient specific models for hip surgery simulation. Lic. Thesis LiU-Tek-Lic-2006:24, Linköping University, Sweden, April 2006. Thesis No. 1243, ISBN 91-85523-95-X. [21] B. Rodrìguez-Vila, J. Pettersson, M. Borga, F. Garcìa-Vicente, E.J. Gómez, and H. Knutsson. 3d deformable registration for monitoring radiotherapy treatment in prostate cancer. In Proceedings of the 15th Scandinavian conference on image analysis (SCIA’07), Aalborg, Denmark, June 2007. [22] J. Pettersson, K.L. Palmerius, H. Knutsson, O. Wahlström, B. Tillander, and M. Borga. Simulation of patient specific cervical hip fracture surgery with a volume haptic interface. IEEE Transactions on Biomedical Engineering, 2007. 67 [23] A. Parraga, A. Susin, J. Pettersson, B. Macq, and M. De Craene. Quality assessment of non-rigid registration methods for atlas-based segmentation in headneck radiotherapy. In Proceedings of IEEE International Conference on Acoustics, Speech, & Signal Processing, Honolulu, Hawaii, USA, April 2007. [24] H. Knutsson, C-F. Westin, and G.H. Granlund. Local multiscale frequency and bandwidth estimation. In Proceedings of the IEEE International Conference on Image Processing, pages 36–40, Austin, Texas, November 1994. IEEE. (Cited in Science: Vol. 269, 29 Sept. 1995). [25] H. Knutsson, M. Andersson, and J. Wiklund. Advanced filter design. In Proceedings of the 11th Scandinavian Conference on Image Analysis, Greenland, June 1999. SCIA. [26] M. Andersson and H. Knutsson. Controllable 3-D filters for low level computer vision. In Proceedings of the 8th Scandinavian Conference on Image Analysis, Tromsø, May 1993. SCIA. URL http://www.imt.liu.se/mi/ Publications/Papers/scia93_ak.pdf. [27] H. Knutsson and C-F. Westin. Normalized convolution: A technique for filtering incomplete and uncertain data. In Proceedings of the 8th Scandinavian Conference on Image Analysis, Tromsö, Norway, May 1993. SCIA, NOBIM, Norwegian Society for Image Processing and Pattern Recognition. Report LiTH–ISY–I–1528. Linköpings tekniska högskola Institutionen för medicinsk teknik Rapportnr: LiTH-IMT/MI30-AEX--08/465--SE Datum: 2008-04-04 Svensk titel Segmentering och uppvinkling av tredimensionella, transaxiella myokardiska perfusionsbilder och automatisk dopaminreceptorkvantifiering Engelsk titel Segmentation and Alignment of 3-D Transaxial Myocardial Perfusion Images and Automatic Dopamin Transporter Quantification Författare Leo Bergnéhr Uppdragsgivare: EXINI Diagnostics Rapporttyp: Examensarbete Rapportspråk: Engelska Sammanfattning Abstract In this thesis, a novel method for automatically segmenting the left ventricle of the heart in SPECT-images is presented. The segmentation is based on an intensity-invariant localphase based approach, thus removing the difficulty of the commonly varying intensity in myocardial perfusion images. Additionally, the method is used to estimate the angle of the left ventricle of the heart. Furthermore, the method is slightly adjusted, and a new approach on automatically quantifying dopamine transporters in the basal ganglia using the DaTSCAN™ radiotracer is proposed. The results for both applications are promising. The segmentation for myocardial perfusion images succeeded in 100 of 100 cases with a mean difference to manually estimated angles of around 8 degrees. The automatic quantification algorithm showed good correlation with manual quantification, with a correlation coefficient of up to 0.95 for 70 images. Nyckelord Keyword Segmentation, Morphon, SPECT, Myocardial, DAT Bibliotekets anteckningar: