Download Methodology for Pattern Discovery, Validation, and Hypothesis

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Saethre–Chotzen syndrome wikipedia , lookup

Polycomb Group Proteins and Cancer wikipedia , lookup

Epigenetics of neurodegenerative diseases wikipedia , lookup

Epigenomics wikipedia , lookup

Minimal genome wikipedia , lookup

Long non-coding RNA wikipedia , lookup

Ridge (biology) wikipedia , lookup

Gene therapy of the human retina wikipedia , lookup

Gene therapy wikipedia , lookup

Biology and consumer behaviour wikipedia , lookup

Genomic imprinting wikipedia , lookup

Genetic engineering wikipedia , lookup

Gene desert wikipedia , lookup

Non-coding DNA wikipedia , lookup

Epigenetics of diabetes Type 2 wikipedia , lookup

Epigenetics in learning and memory wikipedia , lookup

Genome evolution wikipedia , lookup

Gene nomenclature wikipedia , lookup

Point mutation wikipedia , lookup

Genome (book) wikipedia , lookup

Primary transcript wikipedia , lookup

Gene expression programming wikipedia , lookup

Genome editing wikipedia , lookup

Nutriepigenomics wikipedia , lookup

Site-specific recombinase technology wikipedia , lookup

History of genetic engineering wikipedia , lookup

RNA-Seq wikipedia , lookup

Vectors in gene therapy wikipedia , lookup

Gene wikipedia , lookup

Epigenetics of human development wikipedia , lookup

Transcription factor wikipedia , lookup

Gene expression profiling wikipedia , lookup

Helitron (biology) wikipedia , lookup

Microevolution wikipedia , lookup

NEDD9 wikipedia , lookup

Designer baby wikipedia , lookup

Artificial gene synthesis wikipedia , lookup

Therapeutic gene modulation wikipedia , lookup

Transcript
Mahmuda Khan
METHODOLOGY FOR PATTERN
DISCOVERY, VALIDATION, AND
HYPOTHESIS DEVELOPMENT FROM
THE ANNOTATED BIOLOGICAL WEB
Goal
 To obtain training data – sentences from the
literature – to validate patterns involving
triplets of Arabidopsis thaliana genes, GO
terms and PO terms
Validation of Triplets
What is a triplet?
- (gene, GO, PO)
1. Arabidopsis gene
2. GO: Gene Ontology- universal biological
process (BP) or cellular component (CC) or
molecular function (MF)
1. PO: Plant Ontology- plant structure
Examples of Triplets
- (HAP1 , pollen tube guidance, sperm cell)
- (SEP1, DNA binding, carpel)
- (PFS2, petal morphogenesis, stamen)
- (AP1, protein binding, shoot apex)
- (PHOT1, vacuole, cauline leaf)
Photomorphogenesis Genes
http://dbserv2.informatik.uni-leipzig.de:8080/dsggs/?analysis
http://pattaran.umiacs.umd.edu
Flowering Time Genes
http://dbserv2.informatik.uni-leipzig.de:8080/dsggs/?analysis
Photosynthesis Genes
http://dbserv2.informatik.uni-leipzig.de:8080/dsggs/?analysis
Example of imprints for
triplets
(AG, sequence- specific DNA binding transcription factor, stamen)

AG encodes a transcription factor of the MADS-box family that is
expressed in stamen and carpel primordia.

The MADS-box transcription factor AGAMOUS (AG) is an important
regulator of stamen and fruit identity as well as floral meristem
determinacy in a number of core eudicots and monocots.

The Arabidopsis homeotic gene AGAMOUS (AG) is necessary for the
specification of reproductive organs (stamens and carpels) during the early
steps of flower development.

The floral homeotic C function gene AGAMOUS (AG) confers stamen and
carpel identity and is involved in the regulation of floral meristem
termination in Arabidopsis.
Example of imprints for
doublets – Padmini – please
provide
some
examples
(AG, sequence- specific
DNA binding
transcription factor, stamen)

AG encodes a transcription factor of the MADS-box family that is
expressed in stamen and carpel primordia.

The MADS-box transcription factor AGAMOUS (AG) is an important
regulator of stamen and fruit identity as well as floral meristem
determinacy in a number of core eudicots and monocots.

The Arabidopsis homeotic gene AGAMOUS (AG) is necessary for the
specification of reproductive organs (stamens and carpels) during the early
steps of flower development.

The floral homeotic C function gene AGAMOUS (AG) confers stamen and
carpel identity and is involved in the regulation of floral meristem
termination in Arabidopsis.
What Mahmuda did:
 Read scientific articles.
 Retrieved imprint sentences for about 136
triplets and doublets.
 Participated in an experiment to determine
the effectiveness of Manjal (automated)
retrieval of the imprint sentences.
Thanks for listening 