ANEXO III

Neste anexo são mostrados dois artigos, que foram apresentados em eventos relacionados ao tema em estudo, decorrentes da pesquisa. O primeiro artigo, "Classifying Chromosomes Using Radial Basis Function Networks", foi apresentado no First International Congress of Industrial Engineering and XV National Congress of Production Engineering - ENEGEP, em São Carlos - SP, no período de 03 à 07 de Setembro de 1995. Este artigo se encontra nos anais do Congresso, no volume 2, páginas 1137 à 1141. O segundo artigo, "Using Radial Basis Function Networks to Classify Human Chromosomes", foi apresentado no II Simpósio Brasileiro de Redes Neurais - II SRRN, também em São Carlos - SP, no período de 18 à 20 de Outubro de 1995. Este artigo também pode ser encontrado nos anais do Simpósio, nas páginas 133 à 138.

CLASSIFYING CHROMOSOMES USING RADIAL BASIS FUNCTION NETWORKS

José Leomar Todesco

Edgar Augusto Lanzer

Hugh F. VanLandingham

Abstract: This paper presents an approach to the automatic classification of G-banded human metaphase chromosomes using Radial Basis Function Networks (RBFN). The features chosen to represent the chromosome were length, centromere position, and density. These features are the input to the network that classifies the chromosomes according to the Denver group (7 outputs).

Different topologies and free parameters of the network were tested confirming that with the use of this approach classification performance and accuracy of the RBFN compares favorably with a well-developed parametric classifier and multi-layer perceptrons (MLP), used in previous studies.

We now are investigating the use of this approach for the classification of 24 classes of chromosomes (1 - 22 pairs of "outosomes" and two sex chromosomes). We intend to use the output of the Denver classification as input to the next processing stage, together with the features from banding pattern.

Keywords: Artificial Neural Network, RBFN, Automated chromosome classification.

1 - INTRODUCTION

The microscopic analysis of human metaphase chromosomes has as its objective to classify the chromosomes (karyotyping) to evaluate their structural integrity. The microscopic study is delayed and requires the extreme experience of a professional, making it a highly skilled work. The inspection of chromosomes is an essential procedure in many fields of investigation such as detecting genetic abnormality, damage due to environmental factors, cancer diagnosis, or pre-natal diagnosis. For clinical purposes, karyotyping is required in which chromosomes must be assigned to one of 24 classes (Paris conference in 1971). The karyotyping requires great ability of the cytogenecist; this task contains substantial elements of a tedious and repetitive nature, resulting in some interest in recent years in developing automated karyotyping systems [1, 2, 13, 14, 16, 17, 21].

To form a karyotyping of the 46 chromosomes in a normal human cell, some features of chromosomes are needed. The length, centromere position and band pattern along the longitudinal axis have been used for more than 20 years [2]. Piper selected 28 available features, and says that it is impractical to make a exhaustive search for the optimal feature subset [16]. Various classification methods have been applied to such band transition sequences [8], weighted density distributions [14], structural band descriptions [4], Markov networks [3,5] and more recently neural network [1, 19].

Neural network classifiers have been shown to be highly adaptable and capable of generalizing on classes based on training data [6, 11]. RBFN have been applied in classification tasks where classical pattern recognition methods and MLP, usually trained with backpropagation, have not been applied or have been unsuccessful [7].

In the following sections a description of the data base used and the RBFN designare are given.

2 - CHROMOSOME DATABASE AND FEATURES

Three data bases of annotated measurements from G-banded chromosomes, originating in Copenhagen, Edinburgh and Philadelphia, have been used in this study. In each data set correct classification into one of the 24 human chromosome classes was provided by experienced cytogeneticists to permit evaluation of the approach. These three data sets have been largely used in previous classification studies [1]. Details of these data are given in Table 1.

The first data base was collected at Rigshospitalet, Copenhagen in 1975-1978 and consists of 180 G-banded peripheral blood metaphase cells. These chromosomes were carefully measured by densitometry of photographic negatives from selected cells of high quality. The second data base obtained at the MRC, Edinburgh in 1984, contains 125 human male peripheral blood cells. The third data base was obtained at the Jefferson Medical College, Philadelphia, in 1987 and contains 130 cells.

The different preparation techniques and digitization result in obvious differences in chromosome morphology and banding in the three data bases. So, a standardization or normalization is needed to minimize the length and gray level variation between cells (caused mainly for illumination and staining). The features used in our study were length, centromere position, and density. The density was taken from the average points in the longitudinal axis of the chromosome and normalized after that.

Data set

Tissue of Origin

Digitization method

No. in set

Data quality

Copenhagen

Peripheral blood

Densitometry from photographic negatives

8,106

Good

Edinburgh

Peripheral blood

TV Camera

5,469

Fair

Philadelphia

Chorionic villus

CCD line scanner

5,817

Poor

Table 1 - Details of the three chromosome data sets used. [16]

3 - RADIAL BASIS FUNCTION NETWORK (RBFN)

There exists a variety of different ways that neural networks (NNs) can be used in pattern classification [6]. The backpropagation algorithm for training a MLP (supervised) could be seen as an application of a method of optimization known in statistics as stochastic approximation [6, 15]. We can visualize the design of a NN as an approximation problem to find the best result (curve-fitting) in a space of high dimension. Looking at the problem this way, learning is equivalent to finding a surface in a multi-dimensional space proportioned by the best adaptation of the parameters of training, with the criterion of "the best adaptation" measured by some statistical method [6]. Correspondingly, generalization is equivalent to using this multi-dimensional surface to interpolate the test data. This is the approach used in RBFN, which was initially introduced in the problem solution of real multi-variate interpolation.

Broomhead and Lowe, in 1989, were among the first to explore the use of RBFN in the NN field. The RBFN basically consists of an input layer, a hidden layer, and the output layer. Each node in the hidden layer employs radial basis functions to produce a localized output with respect to the input signals. The outputs are combinations of weighted inputs that are mapped by an activation function that is radially symmetric. Each activation function requires a "center" and a scale parameter. The most common radial basis function is the Gaussian function, so that given an input vector X, the output of a single node will be

(3.1)

where the function could be

(3.2)

The values of , j = [1,n], are used in the same manner as with "normal" probability densities to provide "dispersion" scales in each component direction.

Another common variation on the basis functions is to increase their functionality using the Mahalanobis distance in the Gaussian function. The above equation becomes:

(3.3)

where K-1 is the inverse of the covariance matrix of X associated with hidden node C .

Given p-exemplar n-vectors, representing p-classes, the network can be initiated with knowledge of the centers (locations of the exemplars). If cj represents the jth exemplar vector, then we can define the weight matrix C as follows:

C = [c1 c2 ... cn] (3.4)

such that the weights in the hidden node j are the components of the "center" cj. Thus, a hidden-layer node calculates the expression of Eq. (3.2).

The output layer is a weighted sum of the hidden-layer outputs. When presenting an input vector to the network, the network implements

(3.5)

where represents the vector of functional outputs from the hidden layer, and the corresponding center vector. Given some training data with desired responses, the output weights W can be found using the LMS interactively or non-interactively, like descent gradient and pseudo inverse techniques, respectively.

Learning in the hidden-layer is performed using an unsupervised method, typically a clustering algorithm, heuristic clustering algorithm, or supervised algorithm to find the cluster centers (hidden node C). The most common clustering algorithm used to train the hidden layer RBFN is the generalized Lloyd algorithm or the K-means clustering algorithm [7]. Some studies have also used supervised learning of locations of the centers and self-organizing learning of the centers [10, 22].

A simple way of choosing the scaling factors for the Gaussian functions is to set them equal to the average distance between all training data

(3.6)

where is the set of training patterns grouped with cluster center Cj, and Mj is the number of patterns in .

Another manner of choosing the parameters is to calculate the distances between the centers in each dimension and use some percentage of this distance for the scaling factor. In this way, the p-nearest neighbor algorithm has been used. Sometimes, to improve the radius of the Gaussian function, it is interesting to multiply this variance by a constant. The objective is to increase the radius and consequently the amplitude or range of the neuron [18].

4 - CLASSIFICATION RESULTS

After normalization of the data in each data set, half of data were used to train and the other half to test. Then, we further divided the training set into a subtraining set and a cross-validation set. Alternative values for the user-specified parameters were then tried while training on the subtraining set and testing on the cross-validation set. The best-performance parameter values were then employed to train a network on the full training set. The generalization performance of the resulting network is then measured on the test set. In a subsequent experiment the roles of the training data and testing data were interchanged.

A comparison was made between the best performance using our approach and the MLP approach used by Errington & Graham, shown in the Table 2. To train and test a whole data set on a SUN sparc 10 takes about 1 minute. The results have proved that the RBFN have excellent characteristics as classifiers and have an advantage in reduced training time. Finally, it is possible to continually perform the learning process with sequential incoming data, therefore, the network always improves its performance [19].

Data Set

Error rate for RBF (%)

Error rate for MLP (%)

Improvement (%)

Copenhagen

4.1

5.4

24

Edinburgh

9.9

10.1

2

Philadelphia

10.4

14.6

29

Table 2 - Comparison of error rates for the best Denver classifiers used.

5 - CONCLUSION

We have presented in this paper the application of radial basis function networks to the classification of human chromosomes in the Denver group. The results have been compared and have shown improvement the conventional techniques and MLP, trained with back propagation. We are studying the use of this approach in the classification of human chromosomes into 24 classes, with the objective to separate the chromosomes and to form karyotyping.

ACKNOWLEDGMENT

The authors would like to thank all those who helped with this research, specially Dr. Jim Piper at MRC Cytogenetics Unit in Edinburgh, Scotland, to let us use his chromosome databases. The authors also would like to thanks the Brazilian Research Council (CNPq) for support to conduct this research.

REFERENCES

[1] Errington, P. and Graham, J.

Classification of Chromosomes using a Combination of Neural Networks

IEEE International Conference on Neural Networks, 1236-1241, 1993.

[2] Graham, J.

Automation of Routine Clinical Chromosome Analysis I - Karyotyping by Machine.

Analytical and Quantitative Cytology and Histology, vol. 9, No. 5, 383-390, 1987.

[3] Granum, E. & Thomason, M.

Automatically Inferred Markov Network Models for Classification of Chromosomal Band Pattern Structures

Cytometry, vol. 11, 26-39, 1990.

[4] Groen, F.C.A.; ten Kate, T.K.; Smeulders, A.W.M. and Young, I.T.

Human chromosome classification based on local band descriptors

Pattern Recognition Letters, vol. 9, No. 3, 211-222, 1989.

[5] Guthrie, C; Gregor, J. & Thomason, M. G.

Constrained Markov Networks for Automated Analysis of G-Banded Chromosomes Comput. Biol. Med., vol. 23, No. 2, 105-114, 1993.

[6] Haykin, Simon

Neural Networks: A Comprehensive Foundation

Macmillan College Publishing Company, New York, 1994

[7] Hush, Don R. & Horne, B. G.

Progress in Supervised Neural Networks: What's New Since Lippmann?

IEEE Signal Processing Magazine, 8-39, January, 1993.

[8] Johnston, D. A; Tang, K. S. & Zimmermann, S.

Band Features as Classification Measures for G-Banded Chromosome Analysis

Comput. Biol. Med., vol. 23, No. 2, 115-129, 1993.

[9] Kirby, S.P.J. and Theobald, C.M.

Some two-stage procedures for the calculation of discriminant scores in the automated allocation of human chromosomes

Pattern Recognition Letters, vol. 14, No. 3, 221-227, 1993.

[10] Lee, Suhhan and Kil, Rhee M.

A Gaussian Potential Function Network with hierachically Self-Organizing Learning.

Neural Networks, vol. 4, 207-224, 1991.

[11] Lippmann, Richard P.

An Introduction to Computing with Neural Nets

IEEE ASSP Magazine, April, 1987.

[12] Lippmann, Richard P.

Pattern Classification Using Neural Networks.

IEEE Communications Magazine, Nov., 47-64, 1989.

[13] Lundsteen, C.; Gerdes, T & Maar, J

Automatic Classification of Chromosomes as Part of a Routine System for Classification Analysis

Cytometry, vol. 7, 1-7, 1986.

[14] Lundsteen, C.; Gerdes, T.; Granum, E.; Philip, J. and Philip, K.

Automatic chromosome analysis II. Karyotyping of banded human chromosomes using band transition sequences

Clinical Genetics, vol 19, 26-36, 1981.

[15] Moody, J. and Darken, C. J.

Fast learning in Networks of locally-tuned processing units.

Neural Computation, vol. 1, 281-294, 19889.

[16] Piper, J. and Granum, E.

On Fully Automatic Feature Measurement for Banded Chromosome Classification Cytometry, vol. 10, 242-255, 1989.
[17] Piper, J.

Classification of chromosomes constrained by expected class size

Pattern Recognition Letters, vol. 4, No. 5, 391-395, 1986.

[18] Saha, Avijit and Keeler, J. D.

Algorithms for Better Representation and Faster Learning in Radial Basis Function Networks.

Advances in Neural Information Processing System 2, D. S. Touretzki editor, 482-489, 1990.

[19] Sweeney Jr., W. P.; Musavi, M. T. and Guidi, J. N.

Probabilistic Neural Network as Chromosome Classifier.

Proceedings of 1993 International Joint Conference on Neural Networks, vol. 2, 1993.

[20] Tso, M.; Kleinschmidt, P; Mitterreiter, I. and Graham, J.

An efficient transportation algorithm for automatic chromosome karyotypeng

Pattern Recognition Letters, vol. 12, No. 2, 117-126, 1991.

[21] van Vliet, L; Young, I. T. and Mayall, B. H.

The Athena Semi-Automated Karyotyping System

Cytometry, vol. 11, 51-58, 1990.

[22] Wettschereck, D. and Dietterich, T.

Improving the Performance of Radial Basis Function Networks by Learning Center Locations.

Advances in Neural Informations Processing System 4, J. E. Moody, S. J. Hansen and R. L. Lippmann editors, 1133-1140, 1992.