Download Metagenomics and Metrics on Spaces of Probability Measures

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Helitron (biology) wikipedia , lookup

Transcript
The University of Chicago
Department of Statistics
Statistics Colloquium
STEVEN EVANS
Department of Statistics
University of California, Berkeley
Metagenomics and Metrics on Spaces of Probability
Measures
MONDAY, May 14, 2012, at 4:00 PM
133 Eckhart Hall, 5734 S. University Avenue
Refreshments following the seminar in Eckhart 110.
ABSTRACT
Metagenomics attempts to sample and study all the genetic material present in a community of micro-organisms in environments that range from the human gut to the open
ocean. This enterprise is made possible by high-throughput pyrosequencing technologies
that produce a “soup” of DNA fragments which are not a priori associated with particular
organisms or with particular locations on the genome. Statistical methods can be used to
assign these fragments to locations on a reference phylogenetic tree using pre-existing information about the genomes of previously identified species. Each metagenomic sample thus
results in a cloud of points on the reference tree. In seeking to answer questions such as
what distinguishes the vaginal microbiomes of women with bacterial vaginosis from those
of woment who don’t, one is led to consider statistical methods for distinguishing between
such clouds. I will discuss joint work with Erick Matsen from the Fred Hutchinson Cancer
Research Center in which we use ideas based on distances between probability measures that
go back to Gaspard Monge’s 1781 “Mémoir sur la théorie des déblais et des remblais” as well
as some familiar objects (e.g. reproducing kernel Hilbert spaces) from the world of Gaussian
processes.
For further information and inquiries about building access for persons with disabilities, please
contact Dan Moreau at 773.702.8333 or send him an email at [email protected]. If
you wish to subscribe to our email list, please visit the following website:
https://lists.uchicago.edu/web/arc/statseminars.
Related documents
Geometric and Topological Methods for Data Science
Geometric and Topological Methods for Data Science
On Coincidences
On Coincidences
Introduction to the Holonomic Gradient Method in Statistics
Introduction to the Holonomic Gradient Method in Statistics
Large Deviations with Applications to Random Matrices and Random Graphs
Large Deviations with Applications to Random Matrices and Random Graphs
A Phase Transition in Molecular Evolution with Applications to the Assembly of the Tree of Life
A Phase Transition in Molecular Evolution with Applications to the Assembly of the Tree of Life
Efficient Approximation, Error Estimation, and Adaptive Computation for Randomly Perturbed Elliptic Problems
Efficient Approximation, Error Estimation, and Adaptive Computation for Randomly Perturbed Elliptic Problems
Inference for Partially Identified Econometrics
Inference for Partially Identified Econometrics
Computational Complexity of Two and Multistage Stochastic Programming Problems
Computational Complexity of Two and Multistage Stochastic Programming Problems
Plug and Play Inference for Stochastic Dynamical Systems
Plug and Play Inference for Stochastic Dynamical Systems
Three Essays in Mathematical Finance
Three Essays in Mathematical Finance
A Universal Sampling Formula in Population Genetics
A Universal Sampling Formula in Population Genetics
Point Pattern Modeling for Degraded Presence-Only Data Over Large Regions
Point Pattern Modeling for Degraded Presence-Only Data Over Large Regions
On $ell_1$ Regularization and Sparsity in High Dimensions
On $ell_1$ Regularization and Sparsity in High Dimensions
Parameter Estimation for Linear Gaussian Covariance Models
Parameter Estimation for Linear Gaussian Covariance Models
Permutation-Based Methods for Assessing Significance in Genetic Association Studies with Binary Traits and Related Individuals
Permutation-Based Methods for Assessing Significance in Genetic Association Studies with Binary Traits and Related Individuals
Weighted Analysis of Olfactory Dysfunction Data
Weighted Analysis of Olfactory Dysfunction Data
A Moderate Deviation for Stationary Causal Processes
A Moderate Deviation for Stationary Causal Processes
Decoupled Shrinkage and Selection: A New Tool for Posterior Summaries in Linear Regression
Decoupled Shrinkage and Selection: A New Tool for Posterior Summaries in Linear Regression
Efficient Sampling Methods for Protein Structure Refinement
Efficient Sampling Methods for Protein Structure Refinement
An Empirical Analysis in Risk Management: Comparison of Portfolio Value at Risk with Different Approaches
An Empirical Analysis in Risk Management: Comparison of Portfolio Value at Risk with Different Approaches
Removing Unwanted Variation from High-Throughput Omic Data
Removing Unwanted Variation from High-Throughput Omic Data
Bayesian Curve Fitting and Neuron Firing Patterns
Bayesian Curve Fitting and Neuron Firing Patterns