Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Linking Formal Ontologies: Scale, Granularity and Context Alan Rector Medical Informatics Group, University of Manchester www.cs.man.ac.uk/mig www.opengalen.org img.cs.man.ac.uk [email protected] 1 Manchester Medical Informatics Group OpenGALEN Why use Logic-based Ontologies? because Knowledge is Fractal! & Changeable! 2 Manchester Medical Informatics Group OpenGALEN Four Roles of Terminology/Ontologies • Content of Databases and Patient Records – Structural linkage within EPR/EHR & messages – Content of EPR/EHR & messages • Capturing information - the user interface • Linkage between domains – – – – – Health and Bio Sciences Macro, Micro, and Molecular scales Contexts: Normal / abnormal; species; stage of development Healthcare delivery and Clinical research Patient Records and Decision Support • Indexing Information – Metadata and the semantic web • www.semanticweb.org www.w3c.org 3 Manchester Medical Informatics Group OpenGALEN Logic based ontologies • The descendants of frame systems and object hierarchies via KL-ONE • “is-kind-of” = “implies” – “Dog is a kind of wolf” means “All dogs are wolves” – Therefore logically computable • Modern examples: OIL, DAML+OIL (“OWL”?) – Underpinned by the FaCT family of Description Logic Reasoners • Others LOOM, CLASSIC, BACK, GRAIL,... • www.ontoknowledge.org/oil www.semanticweb.org 4 Manchester Medical Informatics Group OpenGALEN Logic-based Ontologies: Conceptual Lego gene hand protein cell extremity expression body Lung chronic inflammation acute infection bacterial abnormal deletion normal 5 polymorphism ischaemic Manchester Medical Informatics Group OpenGALEN Logic-based Ontologies: Conceptual Lego “SNPolymorphism of CFTRGene causing Defect in MembraneTransport of ChlorideIon causing Increase in Viscosity of Mucus in CysticFibrosis…” “Hand which is anatomically normal” 6 Manchester Medical Informatics Group OpenGALEN What’s in a “Logic based ontology”? • Primitive concepts - in a hierarchy – Described but not defined • Properties - relations between concepts – Also in a hierarchy • Descriptors - property-concept pairs – qualified by “some”, “only”, “at least”, “at most” • Defined concepts – Made from primitive concepts and descriptors • Axioms – disjointness, further description of defined concepts • A Reasoner – to organise it for you 7 Manchester Medical Informatics Group OpenGALEN Logic Based Ontologies: A crash course Thing Feature Structure pathological Heart Thing red + feature: pathological MitralValve Encrustation * ALWAYS partOf: Heart * ALWAYS feature: pathological Structure + feature: pathological + involves: Heart red + partOf: Heart Encrustation + involves: MitralValve 8 + (feature: pathological) Manchester Medical Informatics Group OpenGALEN Bridging Bio and Health Informatics • Define concepts with ‘pieces’ from different scales and disciplines – “Polymorphism which causes defect which causes disease” • Define concepts which make context explicit – “ ‘Hand which is anatomically normal’ has five fingers” • Separate properties for different contexts/views – “Abnormalities of clinical parts of the heart” • includes pericardium 9 Manchester Medical Informatics Group OpenGALEN Species Bridging Scales and context with Ontologies Genes Protein Function Gene in Species Disease Protein coded by gene in species Function of Protein coded by gene in species Disease caused by abnormality in Function of Protein coded by gene in species 10 Manchester Medical Informatics Group OpenGALEN Representing context and views by variant properties Organ is_part_of is_clinically_part_of Pericardium OrganPart Heart is_structurally_part_of Disease of (is_part_of) Heart CardiacValve Disease of Pericardium 11 Manchester Medical Informatics Group OpenGALEN The cost: Ontologies are not Thesauri A Mixed Hierarchy organ heart heart valve aortic valve } kind } p art } kind } p art aortic valve cu sp Works for navigation by humans Works for “Disease of…’ and ‘Procedure on…’ Fails for “Surface of…” How can the computer know the difference? 12 Manchester Medical Informatics Group OpenGALEN From a thesaurus to a logic-based ontology Untangle part-whole and is-kind-of in anatomic ontology Link Clinical Ontology with Anatomical ontology Add rule that “Disorder of part disorder of whole” Reasoner can then create automatically: A logic-based is-kind-of (subsumption) hierarchy disorder of organ disorder of heart disorder of valve in heart disorder of aortic valve in heart disorder of cusp in aortic valve in heart 13 Manchester Medical Informatics Group OpenGALEN Examples common in Bio Ontologies Golgi membrane Integral protein Plasma membrane Apical plasma membrane Is part of Is part of 14 Manchester Medical Informatics Group OpenGALEN The Cost: Normalising (untangling) Ontologies Structure Structure Part-whole Function Function Part-whole 15 Manchester Medical Informatics Group OpenGALEN The Cost: Normalising (untangling) Ontologies Making each meaning explicit and separate PhysSubstance Protein ‘ProteinHormone ProteinHormone’ Insulin Enzyme ‘Enzyme’ Steroid SteroidHormone ‘SteroidHormone’ ‘Hormone’ Hormone ProteinHormone^ ‘ProteinHormone’ Insulin^ SteroidHormone^ ‘SteroidHormone’ Catalyst ‘Catalyst’ ‘Enzyme’ Enzyme^ ...and helping keep argument rational and meetings short! … ActionRole PhysiologicRole HormoneRole CatalystRole … Hormone ProteinHormone SteroidHormone Catalyst Enzyme = = = = … Substance BodySubstance Protein Steroid … Substance & playsRole-HormoneRole Protein & playsRole-HormoneRole Steroid & playsRole-HormoneRole Substance & playsRole CatalystRole ?=? Protein & playsRole-CatalystRole 16 Manchester Medical Informatics Group OpenGALEN The Cost • You can’t say everything you want to – Expressiveness costs computational complexity • More inference takes more time – Scaling for complex tasks still being investigated • Many other kinds of reasoning needed It doesn’t make the! Coffee! 17 Manchester Medical Informatics Group OpenGALEN Other benefits • Limit combinatorial explosions From “phrase book” to “dictionary + grammar” Avoid the “exploding bicycle” – – – – 1980 - ICD-9 (E826) 8 1990 - READ-2 (T30..) 81 1995 - READ-3 87 1996 - ICD-10 (V10-19) 587 • V31.22 Occupant of three-wheeled motor vehicle injured in collision with pedal cycle, person on outside of vehicle, nontraffic accident, while working for income – and meanwhile elsewhere in ICD-10 • W65.40 Drowning and submersion while in bath-tub, street and highway, while engaged in sports activity • X35.44 Victim of volcanic eruption, street and highway, while resting, sleeping, eating or engaging in other vital activities 18 Manchester Medical Informatics Group OpenGALEN Other benefits • Index and assemble information Hypertension Idiopathic IdiopathicHypertension Hypertension` Inour ourcompany’s company’sstudies studies In Study a Study a phase 2 Phase Phase22 19 Manchester Medical Informatics Group OpenGALEN Summary: Logic based ontologies because Knowledge is Fractal • Link “Conceptual Lego” – at all levels • indefinitely – Spanning scales, genotype, phenotype, etc. • Model context and views – Express differences explicitly • Manage combinatorial explosion • Index information efficiently Next step: Larger scale demonstrations in Genotype to Phenotype 20 Manchester Medical Informatics Group OpenGALEN