Download Image Analysis of Gene Locus Positions within

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Cell nucleus wikipedia , lookup

Mitosis wikipedia , lookup

Amitosis wikipedia , lookup

List of types of proteins wikipedia , lookup

JADE1 wikipedia , lookup

Chromosome wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Transcript
Image Analysis of Gene Locus Positions within
Chromosome Territories in Human Lymphocytes
Karel Štěpka1 and Martin Falk2
1
2
Centre for Biomedical Image Analysis,
Faculty of Informatics, Masaryk University,
Brno, Czech Republic
[email protected]
Department of Cell Biology and Radiobiology,
Institute of Biophysics of ASCR,
Brno, Czech Republic
[email protected]
Abstract. One of the important areas of current cellular research with
substantial impacts on medicine is analyzing the spatial organization of
genetic material within the cell nuclei. Higher-order chromatin structure
has been shown to play essential roles in regulating fundamental cellular
processes, like DNA transcription, replication, and repair. In this paper,
we present an image analysis method for the localization of gene loci with
regard to chromosomal territories they occupy in 3D confocal microscopy
images. We show that the segmentation of the territories to obtain a
precise position of the gene relative to a hard territory boundary may lead
to undesirable bias in the results; instead, we propose an approach based
on the evaluation of the relative chromatin density at the site of the gene
loci. This method yields softer, fuzzier “boundaries”, characterized by
progressively decreasing chromatin density. The method therefore focuses
on the extent to which the signals are located inside the territories, rather
than a hard yes/no classification.
1
Introduction
The study of the spatial organisation of the genetic material within the nuclei
of eukaryotic cells is one of the most important avenues of current intracellular
research. ToDo: mention what can be studied
Three-dimensional images of the genetic material can be acquired using confocal fluorescence microscopy. The objects of interest (e.g. the whole nucleus,
individual chromosomes, their parts or individual gene loci) are fluorescently
stained, so that they appear as bright areas or spots in the acquired images.
The fluorescent staining can be performed using fluorscent proteins or antibodies attached to proteins, or, in the case of individual gene loci, using bacterial
artificial chromosomes (BAC). These DNA fragments are first prepared to match
the target DNA, fluorescently tagged, and then hybridized with the target in a
process called fluorescence in situ hybridization (FISH). [1]
Human chromosome 5 (HSA5) is one of the human autosomes. Conditions
related to the chromosomal aberrations of HSA5 include the cri du chat syndrome, familial adenomatous polyposis, myelodysplastic syndromes, or Crohn’s
disease [2], [3], [4], [5]. Understanding of the spatial arrangement of the chromosome and the ways it can interact with other chromosomes is therefore of high
interest.
2
Image Data
ToDo: describe the cells we have
The chromosomes are actually tangled and looped strands of chromatin, i.e.,
DNA and proteins. In some places, mostly at the center of the territory occupied
by the chromosome, these loops are densely packed. In other places, chromatin
may be more decondensed, and the chromatin strand may be arranged more
loosely.
However, due to the fact that the width of the strand is below the diffraction
limit for optical microscopes, we cannot examine (or even reliably detect) the
individual loops of the strand – when passing through the optical system, the
light gets distorted by the point spread function (PSF) of the system, and the
resulting image does not contain areas thinly or thickly populated by the loops of
the chromatin, but only areas of low or high total fluorescence. The image areas
with higher total fluorescence intensity correspond to the regions containing
more chromatin loops, more densely packed.
The 3D images of a sample cell can be seen in Fig. 1. The three images show
the individual channels: the cell nucleus, the two HSA5 territories, and the two
gene loci, one per each chromosome territory. All three channels were aligned. We
can see that the nucleus is partially visible even in the channels reserved for the
territories and the loci, as a result of fluorescence bleed-through (also referred to
as crosstalk or crossover), a common problem in fluorescence microscopy. We can
also note that while the cell nucleus has a relatively well-defined boundary, and
the gene loci appear as point-like particles that can be sufficiently represented
by their center of mass, the chromosome territories are of more irregular shapes,
and do not have a definite boundary that would allow for a clear segmentation.
There have been methods introduced specifically to segment chromosome
territories, such as [6] or [7]. However, as noted in [8], when used to help determine the relative positions of gene loci within the territories, these methods
suffer from the bias introduced by arbitrarily selecting the threshold value for
the hard territory boundary.
It has been shown that genes positioned more peripherally within the territory are more active than those closer to the center [8], [9]. Therefore, to avoid
the bias caused by a binary “inside boundary/outside boundary” classification
of genes that are of such high interest, we propose an approach in which the gene
loci are related to the spatial density of the chromatin loops, with higher density
usually located at the territory center, and lower density at the periphery. This
will remove the arbitrary thresholds between the loci deep inside the chromo-
Fig. 1. The individual channels of the acquired images. From left to right: cell nucleus,
chromosome territories, gene loci. (xy-, xz-, and yz-planes; the ticks at the image borders
indicate the position of the cutting planes)
some territories, the loci in the areas of lesser density chromatin, and the loci
which seem to be completely outside the territories (in such cases, it is assumed
that the gene locus is on a chromatin strand that extends relatively far from the
center of the territory, and whose fluorescence is not high enough for the strand
to be detectable on its own).
3
3.1
Analysis Method
Nucleus Segmentation
As the basis for the further analysis, in each image, the cell nucleus was segmented. The cell nuclei in the data set were counterstained with DAPI, and
they were approximately spherical and relatively regular, with very few to none
non-convex areas. Segmenting cell nuclei is a common task in biomedical image
analysis, and most of the common approaches are reliable when used on regularly
shaped cells with enough contrast.
For our study, we selected the method described by Gué et al. in [10]. This
approach first median-filters the image to suppress noise, then determines an
intensity threshold using the ISODATA algorithm [11]. Finally, the nucleus mask
is smoothed using a 3D mathematical morphologic closing, followed by opening.
3.2
Gene Loci Detection
To detect small, point-like particles in fluorescent images, several methods have
been proposed, whose properties and performance have been discussed in comparison studies such as [12].
From these methods, we selected the one proposed by Matula et al. in [13],
based on the 3D morphological extended maxima (EMax) transform. After suppressing the noise with a 3D Gaussian filter, a morphological HMax transform
is computed; this transform identifies those local intensity maxima whose height
exceeds a specified threshold. The EMax image is then defined as the regional
maxima of the result. After the computation of the EMax transform, the components whose size does not fall within the range allowed for fluorescence spots
can be discarded.
An advantage of this method is the straightforward relationship between its
result and its HMax height parameter. Since the number of fluorescent spots
present in each nucleus in our data set is expected to be equal to 2, the height
threshold can be automatically adjusted for each image, so that the spots are
detected even in those images whose contrast deviates from the average contrast of the data set. This helps with non-supervised processing of large data
sets, in which the images acquired later during the session may be affected by
photobleaching.
3.3
Chromosome Territory Processing
ToDo:
In order to process the chromosome territories, it is necessary to suppress
the bleed-through from the nucleus channel, i.e., the part of the signal emitted
not by the marked chromosomes, but by the whole counterstained nucleus.
To do this, we first take that part of the territory channel which corresponds
to the areas masked by the nucleus segmentation, as obtained in section 3.1.
Within this region, we compute an intensity threshold using the Otsu algorithm [14]. This value is then subtracted from the territory channel, clamping
the lowest intensities at 0. This removes the background fluorescence caused by
the bleed-through. Following this, the territory image intensities were normalized
to h0; 1i.
For noise suppression, we used Gaussian blurring with σ = 1 voxel. Apart
from suppressing the noise, this also smoothed the territories proper, replacing
the need for averaging the intensity values around the gene loci positions detected
in section 3.2. The influence of any possible imprecisions in the localization of
the loci has also been reduced.
For each gene locus, the normalized territory intensity at its position was
obtained. Being already normalized, this value would represent the location of
the gene locus in respect to the chromosome territory – a value of 0 would mean
the locus is completely “outside”, on an otherwise invisible chromatin strand
extending from the the territory; conversely, a value of 1 would correspond to the
locus being situated in the area of the highest fluorescence, and, consecutively,
the highest chromatin density (which may sometimes, but not always, correspond
to the center of a hard segmentation of the territory).
However, in some of the images, the maximum intensities of the two chromosome territories differed significantly, and relating both gene loci to the same
maximum intensity might not have revealed all important information. Therefore, for each locus, we also calculated the ratio of the territory intensity at its
location to the maximum intensity of the territory to which this particular locus
belonged.
To determine which locus belonged to which territory, we computed rough
segmentations of the territories by searching for the lowest intensity threshold yielding two connected components. These hard masks did not necessarily
represent the ideal segmentations that would be comparable across all images.
However, within a single image (and therefore coming from the same thresholding operation), they made it possible to determine whether a gene locus was
closer to one territory or the other. To do this, we calculated the Euclidean distance transform (DT) of the rough territory mask. In the DT image, every voxel
value either corresponded to its distance from the territory mask (for voxels
outside the mask), or had the value of 0 (for voxels inside the mask). From the
DT images, it was then possible to determine which locus was closer to which
terrritory.
We can see an example of these results in Fig. 2. The line running from top to
bottom is the boundary between the influence zones of the two territories, whose
rough segmentations are also shown. The cross marks correspond to the positions
of the two gene loci; note that the left locus appears to be positioned just at the
border of the territory mask. If the threshold for the hard segmentation changed,
the position of the locus might change from “inside” to “outside” or vice versa.
Because of this, the hard classification is prone to bias related to the threshold
value.
Fig. 2. The boundary between the influence zones of the two chromosome territories,
overlaid over the original territory channel. The small closed curves around the territories show the rough boundaries, from which the DT was computed. The crosses mark
the positions of the gene loci; the loci themselves are not visible in this channel. Note
the left locus lying just at the border of the hard segmentation. (xy-plane)
However, the assignment of the loci to the territories is not negatively influenced by the fact that the segmentation may not be precise. This is illustrated in
Fig. 3. The figure shows that with different thresholds (all of them yielding two
chromosome territories), the influence zones undergo changes much less rapid
than the territory masks themselves, thus still allowing reliable assignment.
Fig. 3. Stability of the influence zones. X-axis: different intensity thresholds yielding
two territories. Y-axis: the amount of voxels which are different when compared to using
the first threshold, as percentage of the whole image. We can see that even though the
difference between the masks taken at higher thresholds grows, as the masks shrink,
the difference between the influence zones is much more stable, keeping the assignment
of the loci to the territories largely independent of the exact territory segmentation
4
Results
The results measured can be seen in Fig. 4. The figure shows the histograms of
the chromatin intensities at the gene loci, relative to the maximum intensity of
the territories assigned to each locus. In each nucleus, a pair of gene loci was
present, one locus per each of the chromosome territories. The grey bars show
the data for the less intensive loci of each pair, the black bars show the data for
the more intensive loci.
We can clearly see that the two populations are different, suggesting than
in each nucleus, one of the two copies of the gene tends to be located more
centrally, while the other is located more peripherally, possibly allowing the
gene more interaction with its surrounding.
To investigate the relationship between the chromatin intensities at the gene
loci, and the distances of the loci from the rough segmentation boundaries of
the chromosome territories, we calculated the Pearson’s coefficient according to
ρX,Y =
cov(X, Y )
,
σX σY
(1)
Fig. 4. Histogram of the chromatin intensities at the gene loci, relative to the maximum intensity of the assigned chromosome territory. The grey bars represent the less
intensive loci of each pair, the black bars represent the more intensive ones
where cov is the covariance, and σX is the standard deviation of X. The value of
the correlation coefficient was below 0.39, which suggests that there is indeed a
positive relationship between the values, but the distance from the hard segmentation boundary does not capture all details of the chromatin structure inside
the territory (such as in the cases when the territory contains interior areas with
lower chromatin density).
5
Conclusion
We have studied ToDo: some cells, chromosomes and genes, which were fluorescently stained somehow.
After a relatively straightforward segmentation of the cell nuclei and the
detection of the gene loci, we focused on the analysis of the locus positions in
relation to their chromosome territories.
As noted in the literature, segmentation of chromosome territories is difficult,
mainly because of the fact that they have no definite, hard boundary. To avoid
the bias caused by the arbitrary selection of such boundary, we analyzed the
gene loci not in relation to the territory segmentation, but rather to the fluorescence intensity, corresponding to the density of the chromatin at the specified
location. As an additional benefit of this, we were able to take into account the
variations of the chromatin density in the areas that would otherwise fall inside
the hard segmentation boundary. These areas would then be counted as being
“hidden” in the deep interior, while in reality, the chromatin strands may be
more decondensed there, allowing for more interaction with their surrounding.
Our approach enabled us to determine the intensities at the gene loci and
observe that in each nucleus, one gene locus of the pair tends to be located in
an area of high fluorescence, while the other locus of the pair is located more
peripherally. From the medicine and biology point of view, this is related to
the amount of interaction with the neighboring chromosomes that is possible
for such locus, and may be of high interest to further studies focusing e.g. on
chromosomal breakpoints.
References
1. Tanke, H. J., Florijn, R. J., Wiegant, J., Raap, A. K., Vrolijk, J.: CCD microscopy
and image analysis of cells and chromosomes stained by fluorescence in situ hybridization. The Histochemical Journal (1995), vol. 27, no. 1, 4–14.
2. Van den Berghe, H., Cassiman, J.-J., David, G., Fryns, J.-P., Michaux, J.-L.,
Sokal, G.: Distinct Haematological Disorder with Deletion of Long Arm of No. 5
Chromosome. Nature (1974), vol. 251, no. 5474, 437–438.
3. Carlock, L. R., Wasmuth, K. J.: Molecular Approach to Analyzing the Human
5p Deletion Syndrome, Cri du Chat. Somatic Cell and Molecular Genetics (1985),
vol. 11, no. 3, 267–276.
4. Siddiqi, R., Gilbert, F.: Disease Genes and Chromosomes: Disease Maps of the
Human Genome, Chromosome 5. Genetic Testing (2003), vol. 7, no. 2, 169–87.
5. Huff, C. D., Witherspoon, D. J., Zhang, Y., Gatenbee, C., Denson, L. A., Kugathasan, S., Hakonarson, H., Whiting, A., Davis, C. T., Wu, W., Xing, J.,
Watkins, W. S., Bamshad, M. J., Bradfield, J. P., Bulayeva, K., Simonson, T. S.,
Jorde, L. B., Guthery, S. L.: Crohns Disease and Genetic Hitchhiking at IBD5.
Molecular Biology and Evolution (2012), vol. 29, no. 1, 101–111.
6. Rinke, B., Bischoff, A., Meffert, M.-C., Scharschmidt, R., Hausmann, M.,
Stelzer, E. H. K., Cremer, T., Cremer, C.: Volume Ratios of Painted Chromosome
Territories 5, 7 and X in Female Human Cell Nuclei Studied with Confocal Laser
Microscopy and the Cavalieri Estimator. Bioimaging (1995), vol. 3, no. 1, 1–11.
7. Eils, R., Dietzel, S., Bertin, E., Schröck, E., Speicher, M. R., Ried, T., RobertNicoud, M., Cremer, C., Cremer, T.: Three-Dimensional Reconstruction of Painted
Human Interphase Chromosomes: Active and Inactive X-chromosome Territories
Have Similar Volumes but Differ in Shape and Surface Structure. Journal of Cell
Biology (1996), vol. 135, 1427–1440.
8. Dietzel, S., Schiebel, K., Little, G., Edelmann, P., Rappold, G. A., Eils, R., Cremer, C., Cremer, T.: The 3D Positioning of ANT2 and ANT3 Genes within Female
X Chromosome Territories Correlates with Gene Activity. Experimental Cell Research (1999), vol. 252, 363–375.
9. Kurz, A., Lampel, S., Nickolenko, J. E., Bradl, J., Benner, A., Zirbel, R. M., Cremer, T., Lichter, P.: Active and Inactive Genes Localize Preferentially in the Periphery of Chromosome Territories. Journal of Cell Biology (1996), vol. 135, no. 5,
1195–1205.
10. Gué, M., Messaoudi, C., Sheng Sun, J., Boudier, T.: Smart 3D-FISH: Automation
of Distance Analysis in Nuclei of Interphase Cells by Image Processing. Cytometry
Part A (2005), vol. 36, no. 4, 279–293.
11. Ridler, T., Calvard, S.: Picture Thresholding Using an Iterative Selection Method.
IEEE Transactions on Systems, Man, and Cybernetics (1978), vol. 8, no. 8, 630–632.
12. Smal, I., Loog, M., Niessen, W., Meijering, E.: Quantitative Comparison of Spot
Detection Methods in Fluorescence Microscopy. IEEE Transactions on Medical
Imaging (2010), vol. 29, no. 2, 282–301
13. Matula, P., Verissimo, F., Wörz, S., Eils, R., Pepperkok, R., Rohr, K.: Quantification of Fluorescent Spots in Time Series of 3-D Confocal Microscopy Images of Endoplasmic Reticulum Exit Sites Based on the HMAX Transform. In: Molthen, R. C.,
Weaver, J. B. (eds.): Medical Imaging 2010: Biomedical Applications in Molecular,
Structural, and Functional Imaging, vol. 7626. Society of Photo-Optical Instrumentation Engineers, San Diego (2010).
14. Otsu, N.: A Threshold Selection Method from Gray-Level Histograms. IEEE Transactions on Systems, Man, and Cybernetics (1979), vol. 9, no. 1, 62–66.