Clustering Binary Data with Bernoulli Mixture Models

... ply becomes a matter of using Bayes’ rule to classify data as belonging to the mixture components most likely to have produced them. Parameter estimation in mixture models is not cheap, however. In the first recorded use of mixture models, Pearson (1894) sought method of moments estimators for a mi ...

Mining Incomplete Data with Many Missing Attribute Values

... set approaches to missing attribute values. Probabilistic approaches are based on imputation, a missing attribute value is replaced either by the most probable known attribute value or by the most probable attribute value restricted to a concept. In this paper, in a rough set approach to missing att ...

The Future of the Application of Artificial Intelligence Methods to

... --------CULTURE-1-------7) From what site was the specimen for CULTURE-1 taken? BLOOD 8) Please give the date when this culture was obtained. 5/9/75 The first significant organism from this blood culture will be called: ...

Data Mining in PHM

... data • noisy: contain errors or outliers • inconsistent: contain discrepancies in codes or names • Lack of quality data  results generated: inconsistent, lack robustness, do not contribute to knowledge gain! • Quality decisions must be based on quality data • Data warehouse needs consistent integra ...

A Partitioned Fuzzy ARTMAP Implementation for Fast Processing of

... databases noise in the data may create over-fitting when we repeat the presentations of the input/output pairs, so a single pass over the training set may be preferable, this situation also happened when we do online training of the network with an unlimited data source. In the performance phase of ...

Mining Astronomical Databases

... in astronomical databases, usually by looking for populations they expect to exist. The promise of applying clustering algorithms to astronomical databases lies in the application of precisely defined criteria in identifying populations, criteria that are not subject to psychological or physiologica ...

Improved Gaussian Mixture Density Estimates Using Bayesian

... has been applied previously to regression and classification tasks ([PC93]) . There are several different variants on the simple averaging idea. First, one may train all networks on the complete set of training data. The only source of disagreement between the individual predictions consists in diff ...

IoT and Machine Learning

... assign labels to new unlabelled pieces of data. This can be thought of as a discrimination problem, modelling the differences or similarities between groups. •Regression: Data is labelled with a real value rather than a label. Examples that are easy to understand are time series data like the price ...

IOSR Journal of Computer Engineering (IOSRJCE)

... Air is the element of Ai at row r ...

agent based frameworks for distributed association rule mining

... rules, which would aid in making decisions about local branches, would be lost. In such cases organization may miss out certain rules that were prominent in certain branches and were not found in other branches. The frequent patterns in distributed databases are divided into three classes [32]: (a) ...

Hubs in Nearest-Neighbor Graphs: Origins, Applications and

... - Point 2 standard deviations closer: E(||X||) – 2·Std(||X||) (full lines) = Noncentral Chi distribution with d degrees of freedom ...

Oriented k-windows: A PCA driven clustering method

... axes. We next consider the possibility of allowing the hyperrectangles adapt both their orientation and size, as means to more effective cluster discovery. Let us assume, for example, that k d-dimensional hyperrectangles of the UkW algorithm have been initialized, as in function DetermineInitialWind ...

Advances in Environmental Biology Abouzar Qorbani,

... suggested method by these researchers in detecting diabetes, breast cancer, liver diseases and liver abnormalities are 86.13, 98.86, 91.24 and 86.13 respectively. Serter Uzer and Inan [10] used bee colony algorithm hybrid method and support vector algorithm to classify diabetes disease, breast cance ...

KEEL Data-Mining Software Tool: Data Set Repository, Integration of

... Artificial Intelligence. The main motivation for applying EAs to knowledge extraction tasks is that they are robust and adaptive search methods that perform a global search in place of candidate solutions (for instance, rules or other forms of knowledge representation). They have proven to be an imp ...

An Ensemble Method for Clustering

... thus ignoring the small cluster in the middle. To illustrate the functionality of the voting algorithm we present two figures. In Figure 1 we show a typical result of a k-means run (left), the result of combining 20 cluster results (middle) and the result of combining 50 runs (right). We can see tha ...

Genetic algorithms approach to feature discretization in artificial

... of categories to be discretized using these bits. The thresholds are not used if the searched thresholds are more than the maximum value of each feature. The upper limit of the number of categories is five and the lower limit is one. This number is automatically determined by the searching process o ...

this PDF file

... tourism micro-blog, which enables the user to conveniently specify analyzing conditions. We first give an overview of the approach and then explicate its algorithms. Overview of our Approach The main idea of the analyzing system is that the system creates a interface for the users to browse and lets ...

Statistical Anomaly Detection Technique for Real Time

... point have similar density. If some neighbors of one's point can be found a single cluster, plus the other neighbors near each other another cluster and to discover the two clusters have different densities, then comparing the density of a given data point with all of that neighbors may lead to a wr ...

Incremental Ensemble Learning for Electricity Load Forecasting

... different subsets of available data. The heterogeneous learning process applies different types of models. The combination of homogeneous and heterogeneous approaches was also presented in the literature. The best known methods for homogeneous ensemble learning are bagging [6] and boosting [13]. The ...

1 - UCSD CSE

... we must first define a criterion for creating groups, and second, find an optimal grouping based on that criterion. The problem can be generalized as follows: Given a set N of n data points in d dimensional space, we must determine how to assign a set K of k points, called centers, in N so as to opt ...

Classification of Deforestation Factors Using Data Mining

... model in training data set to predict the class of future objects whose class label is not known [2][13]. There are lots of classification algorithms, for example, classification based on decision-tree, Bayesian classification based on statistics, classification based on neural network [4]. Geospati ...

KClustering

... we must first define a criterion for creating groups, and second, find an optimal grouping based on that criterion. The problem can be generalized as follows: Given a set N of n data points in d dimensional space, we must determine how to assign a set K of k points, called centers, in N so as to opt ...

On Attention Mechanisms for AGI Architectures: A Design Proposal

... AGI arrives on the scene. Natural attention is a cognitive function – or a set of them – that allow animals to focus their limited resources on relevant parts of the environment as they perform various tasks, while remaining reactive to unexpected events. Without it we could for example not stay ale ...

Cognitive Analytics: A Step Towards Tacit Knowledge?

... 2.3 OTHER ANALYTIC CONCEPTS As previously mentioned, this Advanced Analytics Taxonomy is currently being developed. In the interest of time (and space), this paper briefly describes two additional areas for consideration when assessing analytics for a KMS. System Element Topologies: The distributed ...

77

... support vector machine constructs a hyperplane or set of hyperplanes in a high- or infinite- dimensional space, which can be used for classification, regression, or other tasks. Intuitively, a good separation is achieved by the hyperplane that has the largest distance to the nearest training data po ...

< 1 2 3 4 5 6 7 8 9 10 ... 23 >

Data (Star Trek)

Lieutenant Commander Data (/ˈdeɪtə/ DAY-tə) is a character in the fictional Star Trek universe portrayed by actor Brent Spiner. He appears in the television series Star Trek: The Next Generation and the feature films Star Trek Generations, Star Trek: First Contact, Star Trek: Insurrection and Star Trek: Nemesis.An artificial intelligence and synthetic life form designed and built by Doctor Noonien Soong, Data is a self-aware, sapient, sentient, and anatomically fully functional android who serves as the second officer and chief operations officer aboard the Federation starships USS Enterprise-D and USS Enterprise-E. His positronic brain allows him impressive computational capabilities. Data experienced ongoing difficulties during the early years of his life with understanding various aspects of human behavior and was unable to feel emotion or understand certain human idiosyncrasies, inspiring him to strive for his own humanity. This goal eventually led to the addition of an ""emotion chip"", also created by Soong, to Data's positronic net. Although Data's endeavor to increase his humanity and desire for human emotional experience is a significant plot point (and source of humor) throughout the series, he consistently shows a nuanced sense of wisdom, sensitivity, and curiosity, garnering immense respect from his peers and colleagues.Data is in many ways a successor to the original Star Trek‍ 's Spock (Leonard Nimoy), in that the character offers an ""outsider's"" perspective on humanity, even briefly working with Spock in the two-part Next Generation episode, Unification.

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Data (Star Trek)