* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Hunting disease provoking genes using thousands of computers
Ridge (biology) wikipedia , lookup
Epigenetics of human development wikipedia , lookup
Medical genetics wikipedia , lookup
Genome (book) wikipedia , lookup
Minimal genome wikipedia , lookup
Gene expression profiling wikipedia , lookup
Public health genomics wikipedia , lookup
Heritability of IQ wikipedia , lookup
Hunting disease provoking genes using thousands of computers Abstract: Linkage analysis is a tool used by geneticists for mapping disease-susceptibility genes in the study of Mendelian and complex diseases. However analyses of large inbred pedigrees with extensive missing data are often beyond the capabilities of a single computer. We present a distributed system called Superlink-online for computing multipoint LOD scores of large inbred pedigrees. It achieves high performance via efficient parallelization of the algorithms in Superlink, a state-of-the-art serial program for these tasks, and through utilization of thousands of resources residing in multiple opportunistic grid environments. Notably, the system is available online, which allows computationally intensive analyses to be performed with no need for either installation of software, or maintenance of a complicated distributed environment. The main algorithmic challenges have been to efficiently split large tasks for distributed execution in highly dynamic non-dedicated running environment, as well as to utilize resources in all the available grid environments, providing nearly interactive response time for shorter tasks while simultaneously serving massively parallel ones. The system is being extensively used by medical centers worldwide, achieving speedups of up to three orders of magnitude and allowing analyses which have not been feasible previously. This work has been done as a part of PhD studies in the Technion under joint supervision of Prof. Assaf Schuster and Dan Geiger. It has been published in American Journal of Human Genetics and presented at High Performance Distributed Computing conference in 2006.