Download Tetrahedron template

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Clinical decision support system wikipedia , lookup

Knowledge representation and reasoning wikipedia , lookup

Expert system wikipedia , lookup

Transcript
Expert System with Application
journal homepage: www.elsevier.co m/locate/eswa
Thyroid Diseases Diagnosis Expert System Using Decision Tree
Vivi and Derwin Suhartono
Computer Science Department, Bina Nusantara University, Jakarta, Indonesia
ARTICLE INFO
ABSTRACT
Keywords:
Expert system
Thyroid
Disease
Diagnosis
Decision Tree
Thyroid gland is an important organ of the endocrine system. It releases hormones
which control body metabolism. However, around two hundreds million
populations worldwide had suffered from thyroid disease. Clinicians often
experience difficulties in the diagnosis of thyroid diseases due to the inherent
complexity of the clinical process and lack of comprehensive automated
diagnostic tools. Thus, a decision support system based on expert system has been
projected according clinical symptoms as predictive variable in decision tree. The
aim of this study is to help improve the performance of thyroid clinical
assessment, specifically from the clinical decision support perspective. A
retrospective data analysis of the clinical studies evaluating sixteen symptom risk
factors implemented for the development of diagnosis models to distinguish six
kinds of thyroid related diseases. The results of the thyroid diseases diagnosis
expert system evaluation had reached 86, 67% accuracy which matched with
those of the physicians’ decisions.
1. INTRODUCTION
Thyroid hormones affect various metabolic events. As a
the changes of these hormones is required a hormones test
which can be costly in the developing country.
result, occurrence of function changes in the various
The increasing need to develop a more basic and
organs is resulted if thyroid disease begins. The symptoms
accessible approach need to be taken in hand in order to
vary from palpitation, dyspnea, exophthalmoses and
cut down the cost as well as the time has drawn the
dysphagia. Its pathogenesis involves both genetic and
attention of a diverse array of fields, including artificial
environmental factors. The long-term persistence of
intelligence and biomedical engineering, explaining why
thyroid diseases can cause susceptibility to specific
related technologies such as expert systems and decision
complications depending on the degree of thyroid disorder.
tree have been adopted for clinical decision support
Although there is a risk in every age and gender, thyroid
system.
disease is often observed as the person gets older and
For instance, Chang et al. (2012) developed a hybrid
women are 7 times more prone to thyroid gland disease
decision support model to discover informative knowledge
(National Women’s Health Information Centre, 2013).
in diagnosing acute appendicitis. This study had develop a
Thyroid gland insufficiency is easily defined with the
simple and reliable hybrid decision support model by
measurement of free T3 (FT3), free T4 (FT4), TSH, TT3
combining statistical analysis and decision tree algorithms
and TT4 hormones before the treatments begins. However,
to ensure high accuracy of early diagnosis in patients with
suspected acute appendicitis and to identify useful decision
Sindh University Research Journal published in 2011
rules. Statistical analysis approaches were used as a feature
featured a knowledge based expert system for symptomatic
selection process in the design of decision support models,
automated healthcare which developed by Soomro et al. It
including the Chi-square test, Fisher’s exact test, the
is a web application which can be a good solution for the
Mann-Whitney U-test (p < 0.01), and Wald forward
medical field, where the doctors and patient are too far
logistic regression (entry and removal criteria of 0.01 and
with each other, but the system which interact to patient be
0.05, or 0.05 and 0.10, respectively).
a computer and performs the diagnosing of disease which
The final decision support models were constructed
is disease recognition by selecting symptoms, and then
using the C5.0 decision tree algorithm of Clementine 12.0
symptoms are checked from the knowledge base for the
after preprocessing. A statistically significant difference
recognition of the disease and suggests treatment of
was detected in the pair wise comparison of ROC curves (p
disease.
< 0.01, 95% CI, 3.13-14.5; p < 0.05, 95% CI, 1.54-13.1).
It also includes the information portion disease
The larger induced decision model was more effective for
information containing list of diseases where user can view
identifying acute appendicitis in patients with acute
the disease, it also contains the laboratory test portion,
abdominal pain, whereas the smaller induced decision tree
which physically doesn’t test, but by inserting the resulted
was less accurate with the test data.
values to a particular test, it will check the parameter
Takada et al. (2012) developed a prediction of axillary
lymph node metastasis in primary breast cancer patients
values of report and make a process to identify disease and
tell a solution for the disease from knowledge base.
using a decision tree-based model. This study aim to
Motivated by the worldwide increasing number of
develop a new data-mining model to predict axillary lymph
research in clinical decision support system each year,
node (AxLN) metastasis in primary breast cancer by
researchers aim to help health care professionals in
decision tree-based prediction method—the alternating
diagnosis of thyroid disease. Using the basic characteristic
decision tree (ADTree). Clinical datasets for primary breast
symptom of thyroid related diseases will be much more
cancer patients who underwent sentinel lymph node biopsy
efficient. These characteristic symptom form great data
or AxLN dissection without prior treatment were collected
stacks when used in each person’s diagnosis. Therefore, a
from three institutes (institute A, n = 148; institute B, n =
decision support system which will be helpful for the
143; institute C, n = 174) and were used for variable
diagnosis of thyroid related disease
selection,
model
training
and
external
validation,
respectively.
2. LITERATURE REVIEW
The models were evaluated using area under the
In this part, the information related with the easier
receiver operating characteristics (ROC) curve analysis to
conception of the decision support system’s construction in
discriminate node-positive patients from node-negative
thyroid disease diagnosis is introduced in the shape of sub-
patients. The ADTree model selected 15 of 24 clinic
classifications.
pathological variables in the variable selection dataset. The
2.1 THYROID
resulting area under the ROC curve values were 0.770
Thyroid gland is one of the important endocrine
[95% confidence interval (CI), 0.689–0.850] for the model
organs in human body. It is located at the front of the
training dataset and 0.772 (95% CI: 0.689–0.856) for the
neck under Adam’s apple where collarbones meet. The
validation dataset, demonstrating high accuracy and
gland produces hormones like triiodothyronine (T3)
generalization ability of the model. The bootstrap value of
and thyroxine (T4) that control the way every cell in the
the validation dataset was 0.768 (95% CI: 0.763–0.774).
body uses energy or metabolism (National Institutes of
Health, 2013). Controlling metabolism is essential for
Expert system is defined as one, which contains
regulating weight, body temperature, mental and
information obtained from a human expert, and
physical energy levels. As time goes by, thyroid gland
represents that information in the form of rules, such as
might degrade and encounters some age-related
IF–THEN. The rule can then be used to perform
problem.
operations on data to inference in order to reach
When the body releases too much thyroid hormone,
appropriate conclusion. These inferences are essentially
the condition is called hyperthyroidism. And an
a computer program that provides a methodology for
underactive thyroid gland leads to hypothyroidism.
reasoning about information in the rule base or
Several thyroids related medical conditions are De
knowledge base, and for formulating conclusions
Quervain’s thyroiditis, Hashimoto thyroiditis, Grave’s
(Hemmer, 2008).
disease and Plummer’s disease. These thyroids related
medical condition occasionally quite similar in some
basic symptoms fortunately it is still distinguishable by
few dominant characteristic symptoms.
Figure 2. The Structure of Expert System
2.3 DECISION TREE
A decision tree takes as input an object or situation
described by a set of properties, and outputs a yes or no
Figure 1. The Anatomy of Thyroid Gland
decision. Decision trees therefore represent Boolean
functions. Functions with a larger range of outputs can
2.2 EXPERT SYSTEM
Expert system is a branch of applied artificial
intelligence, an area of computer science focusing on
creating machines that can engage on behaviors that
humans consider intelligent and was developed by the
artificial intelligence community in the mid-1960s.
The basic idea behind expert system is simply that
expertise, which is the vast body of task-specific
knowledge, is transferred from a human to a computer.
This knowledge is then stored in the computer and
users call upon the computer for specific advice as
needed. The computer can make inferences and arrive
at a specific conclusion. Then like a human consultant,
it gives advices and explains, if necessary, the logic
behind the advice (Russell & Norvig, 2010).
also be represented, but for simplicity we will usually
stick to the Boolean case. Each internal node in the tree
corresponds to a test of the value of one of the
properties, and the branches from the node are labeled
with the possible values of the test. Each leaf node in
the tree specifies the Boolean value to be returned if
that leaf is reached (Kendall & Creen, 2007).
After the each node in decision trees are filled by a
statement, the rules can be written down on the tree
from root to the leaf (IF-THEN rules). Getting decision
in this way provides the confirmation of the result of
the work of inference mechanism.
These rules can be checked in terms of application
by showing them to an expert whether they are
meaningful or not. The decision trees are used in the
analysis and applications of classifying various cases
into low-mid or high risk groups; creating rules in order
to predict the future events; definition of the relations of
the certain sub-groups; the unit of the categories and
getting the most effective decisions by the help of the
medical observation (Poole & MackWorth, 2010).
3. METHODOLOGY
Developing an expert system involves tasks such as
acquiring knowledge from an acknowledged domain
expert, documenting it and organizing it, generating
knowledge net to check the relationships between different
knowledge sources, checking for consistency in the
Yes
No
knowledge and finally transforming the knowledge net into
Figure 3. Decision Tree.
a computer program using appropriate tools.
The expert system operation is achieved via the decision
trees structure which is related as one of disease diagnosis
techniques. The basic characteristic symptom of thyroid
related diseases will be inquire and assess with the rule
stored in knowledge base. This expert system uses the
knowledge of the domain which acquired from thyroid
expert to ensure its credibility and reliability.
Below are the symbol descriptions corresponding with
those shown in decision tree. Table 1 describes the
symptoms of thyroid diseases which represented in
decision tree node by Sn symbol. Table 2 explains about
thyroid diseases where in decision tree node is represented
by Dn symbol.
Later this knowledge included disease and symptom
Table 1. Symptoms of Thyroid Diseases.
related toward condition was coded in knowledge base and
through inference engine, a specified control strategy or
search techniques to arrive at solutions which search
through the knowledge base to arrive at decisions.
Symbol
S01
Weight loss
S02
Sleeping difficulty (insomnia)
Basically, knowledge base is the state space and the
inference mechanism is a search process.
Joint and muscle pain OR limb
S03
The expert system diagnoses thyroid disease by posing
series of questions about the characteristic symptom which
S05
S06
identified by following the path from the topmost node, the
root, to a node without children, a leaf, according to the
answers that most reflect the symptom.
sensitive toward light OR protruding
eyes (exophthalmoses)
thereby form a hierarchy, encoded as a decision tree. In the
node has a ‘yes’ child and a ‘no’ child. A disease is
jaw area OR ear area (tender goiter)
Swelling around the eyes OR
contained in a node, and every node points to child node
simplest form is ask yes-or-no questions, and each internal
numbness
Tenderness around neck area OR
S04
will be used as predictive variable. Each question is
for each possible answer to its question. The questions
Symptom
S07
Inflammation around neck area OR
jaw area OR ear area (swelling
goiter)
Increased appetite OR frequent
bowel movement (diarrhea) OR
hand tremor
Table 2. Thyroid related Diseases.
Symbol
Disease
D1
De Quervain’s Thyroiditis
D2
Hyperthyroid
D3
Grave’s Disease
D4
Plummer’s Disease
D5
Hashimoto Thyroiditis
D6
Hypothyroid
D7
Undefined disease
Figure 5. Disease List.
The established expert system was further evaluated its
prediction accuracy. For the ensemble procedure, the data
IF-THEN rules are simple but powerful forms of
knowledge representation providing the flexibility of
combining declarative and procedural representation for
using them in a unified form. IF-THEN rule has a set of
were randomly sampled patients with variety in gender and
diseases. Of 30 patients as random samples evaluated in
this study, 26 patients can be detected and identified the
kind of thyroid disease by expert system. The detected
antecedents and a set of consequents. The antecedents
thyroid related diseases are De Quervain’s thyroiditis,
specify a set of symptoms and the consequents a set of
hyperthyroid,
diseases. For instance Grave’s disease is detected through
the IF-THEN rule below.
IF (weight loss = yes) AND (sleeping difficulty
/insomnia = no) AND (swelling around the eyes OR
sensitive toward light OR protruding eyes/exophthalmoses
= yes) THEN Grave’s disease.
4. RESULT
Grave’s
disease,
Plummer’s
disease,
Hashimoto thyroiditis and hypothyroid.
Due to some of the characteristic symptoms especially
eyes irritations that found in Grave’s disease appear to be
more dominant in women than men. As a result, there are 4
undetected patient samples who suffer Grave’s diseases are
all happen to be men who doesn’t experience swelling
around the eyes, neither sensitive toward light or
Following are the screenshot of the established thyroid
diseases diagnosis expert system using decision tree.
protruding eyes (exophthalmoses).
From the above experiments, prediction accuracy of
thyroid disease diagnosis expert system can be calculated
from the ratio of total sample that can be detected by the
expert system with the total sample which is randomly
taken.
By using existing patient data which had been validate
by physician decision to estimate the diagnostic accuracy
of thyroid disease diagnosis expert system. The result
showed decision tree was effective in detecting thyroid
related
Figure 4. Main Menu.
diseases.
The
diagnostic
accuracy
was
approximately 86, 67% from 30 random sample patients.
total sample detected by the expert system
the total sample which is randomly taken
References
X 100%
Brent, G. A. (2010). Thyroid Function Testing. USA:
Springer.
26
X 100%
Cooper, D. S. (2009). Medical Management of Thyroid
= 86, 67%
30
Disease, Second Edition. UK: Taylor & Francis.
5. DISCUSSION AND CONCLUSION
De Groot, L. J., & Jameson J. L. (2013). Endocrinology
Most of the symptoms used in decision tree node are
reduced clinical parameters set by choosing some basic
dominant characteristic symptom found in the thyroid
related disease. This reduces the time required to perform
the logical inference to detect and identify thyroid related
disease
and
it
makes
the
resulting
rules
more
Adult and Pediatric: The Thyroid Gland. USA:
Elsevier Health Sciences.
Hemmer, B. (2008). Expert Systems and Applied Artificial
Intelligence. USA: Prentice Hall.
Kendall, S., & Creen, M. (2007). An Introduction to
Knowledge Engineering. USA: Springer.
comprehensible, thereby increasing the resulting accuracy.
Monaco, F. (2012). Thyroid Diseases. UK: CRC Press.
However, these clinical parameters set quality may even be
National Institutes of Health. (2013). Endocrinology:
increased by selecting informative features from a more
Thyroid. Accessed November 07th, 2013 from
gender and age specific dataset.
http://www.nlm.nih.gov/medlineplus/ency/article0
This study developed a simple and reliable hybrid
00174.htm
decision support model based on decision tree algorithm to
Poole, D., & MackWorth, A. (2010). Artificial Intelligence
provide a reliable accuracy on diagnosis of patients with
Foundations of Computational Agents. Canada:
suspected thyroid related diseases. The experimental
Cambridge University Press.
results show that thyroid disease diagnosis expert system
using decision tree is an effective decision support model
Russell, S., & Norvig, P. (2010). Artificial Intelligence A
Modern Approach. USA: Pearson Education Inc.
with its diagnostic accuracy approximately 86, 67% and
Chang, S. S., Jang, B. K., Seo, S. T., Kim, M. S., & Kim,
thus demonstrated its feasibility for detecting thyroid relate
Y. N. (2012). A hybrid decision support model to
diseases. Therefore, the decision model developed in our
discover informative knowledge in diagnosing
study can be applied to support the initial decision of
acute appendicitis. Journal of BMC Medical
clinicians and increase vigilance when detecting suspected
Informatics and Decision Making 2012, 12:17.
thyroid related diseases.
Soomro, A. A., Memon, N. A., & Memon, M. S. (2011).
Knowledge Based Expert System for Symptomatic
Acknowledgements
Automated Healthcare. Sindh University Research
The author want to express the gratitude to Dr. Simon
Journal (Science Series) 2011, Vol.43 (1-A) 79-
Forehan, PhD., Dr. Pradana Soewondo, PhD., and Dr.
84.
Dante Saksono, PhD. for providing the necessary data for
Takada, M., Sugimoto, M., Naito, Y., Moon, H. G., Han,
this study and for their roles in revising this work to be
W. S., Noh, D. Y., Kondo, M., Kuroi, K., Sasano,
more according to issues. For kindness and patient.
H., Inamoto, T., Tomita, M., & Toi, M. (2012).
Prediction of axillary lymph node metastasis in
primary breast cancer patients using a decision
tree-based model. (2012). Journal of BMC
Medical Informatics and Decision Making 2012,
12:54.