General Database Statistics Using Entropy Maximization

Clustering178winter07

... • In supervised learning we were given attributes & targets (e.g. class labels). In unsupervised learning we are only given attributes. ...

Equational reasoning for conditioning as disintegration

... probability (such as setting a continuous variable to an observed value). This popularity contrasts with the scary pitfalls (such as Borel’s paradox) that beset rigorous treatments of conditioning. In general, conditional expectations may arise that do not correspond to any conditional distribution ...

- Catalyst

... the number of other stations within 0.75km. The data set was then narrowed by combining similar predictors into logical groups (food, nightlife, health services, tourism, etc.) and removing variables that were empty, duplicative, ambiguous, or with limited number of observations. Next the data was p ...

Analysis and Enhancement of Process Model Using

Statistical and Machine-Learning Data Mining

... experienced problems in predictive modeling and analysis of big data. The common theme among these essays is to address each methodology and assign its application to a specific type of problem. To better ground the reader, I spend considerable time discussing the basic methodologies of predictive m ...

IOSR Journal of Computer Engineering (IOSR-JCE)

analyse input data

Sparrow2011

Optimal Sample Size for Multiple Testing: the Case of Gene

... nature central to our discussion, was formalized within a Bayesian framework as early as 1961 through the work of Raiffa and Schlaifer (1961). (See also Lindley, 1997 or Adcock, 1997 and references therein for discussions of sample size determination.) Following this paradigm, we present a general d ...

Secure Bayesian Model Averaging for Horizontally Partitioned Data

meta-learning architecture for knowledge representation and

... NIPS 2003 Challenge in Feature Selection [7, 6] or WCCI Performance Prediction Challenge [8] in 2006. The competitions results are an evidence that in real applications, optimal solutions are often complex models and require atypical ways of learning. Problem complexity is even more clear when solvi ...

Change-Point Detection in Time-Series Data by Direct Density

spatio-temporal structures characterization based on multi

... a method based on Multivariate Information Bottleneck in order to estimate the optimal number of clusters and characterize spatio-temporal structures. In order to detect or recognize spatio-temporal patterns, it is essential to characterize information in a low-dimensional space. Features are extrac ...

Seismic Hazard Bayesian Estimates in Circum

... The theory of Bayesian probability expresses the formulation of the inferences from data straightforward and allows the solution of problems which otherwise would be intractable. Assuming the Poisson model, BENJAMIN (1968) was the ®rst to deal with the Bayesian approach to investigate the problem of ...

Finding Behavior Patterns from Temporal Data using

clinical decision support for heart disease using predictive models

Mining Noisy Data Streams via a Discriminative Model

Dependent Species Sampling Models for Spatial Density Estimation

Modelling of essential fish habitat based on remote sensing

A Bayesian Model for Supervised Clustering with the Dirichlet Process Prior

... known to solve them. The primary disadvantages of these approaches are the largely adhoc connection between the classifier and the clustering algorithm, the necessity of training over O(n2 ) data points, and the potential difficulty of performing unbiased cross-validation to estimate hyperparameters ...

Statistical Inference, Multiple Comparisons, Random Field Theory

... – Our model is shown to be a sub-optimal in the bound restriction – In traditional SNB , there is no evidence that show it is optimal or suboptimal ...

A Bayesian Model for Supervised Clustering with the Dirichlet

... sifier and the clustering algorithm, the necessity of training over O (n2 ) data points, and the potential difficulty of performing unbiased cross-validation to estimate hyperparameters. The first issue, the ad-hoc connection, makes it difficult to make state precise statements about performance. Th ...

A Characterization of Interventional Distributions in Semi

... over X, Y with an index t. For this set of distributions to be induced by some underlying causal BN such that each Pt (x, y) corresponds to the distribution of X, Y under the intervention do(T = t) to the causal BN, they have to satisfy some norms of coherence. For example, it must be true that Px0 ...

Evolution Strategies assisted by Gaussian Processes with improved

< 1 ... 31 32 33 34 35 36 37 38 39 ... 58 >

Mixture model

In statistics, a mixture model is a probabilistic model for representing the presence of subpopulations within an overall population, without requiring that an observed data set should identify the sub-population to which an individual observation belongs. Formally a mixture model corresponds to the mixture distribution that represents the probability distribution of observations in the overall population. However, while problems associated with ""mixture distributions"" relate to deriving the properties of the overall population from those of the sub-populations, ""mixture models"" are used to make statistical inferences about the properties of the sub-populations given only observations on the pooled population, without sub-population identity information.Some ways of implementing mixture models involve steps that attribute postulated sub-population-identities to individual observations (or weights towards such sub-populations), in which case these can be regarded as types of unsupervised learning or clustering procedures. However not all inference procedures involve such steps.Mixture models should not be confused with models for compositional data, i.e., data whose components are constrained to sum to a constant value (1, 100%, etc.). However, compositional models can be thought of as mixture models, where members of the population are sampled at random. Conversely, mixture models can be thought of as compositional models, where the total size of the population has been normalized to 1.

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Mixture model