International Journal on Advanced Computer Theory and

... Is the extension of DBSCAN from three different aspects as suggested by authors, as: i) it can cluster spatial –temporal data according to its non-spatial, spatial and temporal attributes. ii) detection of noise in case of varying density can be achieved by density factor, assigned to each cluster i ...

Spatial Data Mining CSE 6331, Fall 1999

Mining Patterns from Protein Structures

Steven F. Ashby Center for Applied Scientific Computing

Visual Data Mining

... visualised by tree-like graphs. Furthermore, the user is enabled to select subspaces of distance space and select new distance metrics for them. This allows dealing with multiple similarity judgements in one retrieval process. The proposed components for visual data mining will be implemented in the ...

Combining Multiple Clusterings Using Evidence Accumulation

... hyper-edges in the hyper-graph representation. Finally, CSPA uses a pair-wise similarity, as defined by Fred and Jain [55], and the final data partition is obtained by applying the METIS algorithm of Karypis and Kumar to the induced similarity measure between patterns. In this paper we further explo ...

A K-means-like Algorithm for K-medoids Clustering and Its

The Discovery by Data Mining of Rogue Equipment

... A). Semiconductor processes require very complex phenomena such as: (a) ion implantation (ION) where dopant ions accelerated to KeV (or higher) energies become embedded with tightly controlled dosages and depths in well-defined areas of semiconductor materials; (b) reactive ion etching (RIE) where c ...

Clustering of time series data—a survey

... if xk = vi Otherwise, set j k = 1 (0) if j = ( =)i. (4) Compute = U (l+1) − U (l) . If > , increment l and go to Step 2. If , stop. This group of algorithms works better with time series of equal length because the concept of cluster centers becomes unclear when the same cluster conta ...

Print this article

Dynamic Cluster Formation using Level Set Methods ∗

... Recent computer, internet and hardware advances produce massive data which are accumulated rapidly. Applications include sky surveys, genomics, remote sensing, pharmacy, network security and web analysis. Undoubtedly, knowledge acquisition and discovery from such data become an important issue. One ...

Example - Databases and Information Systems Group

Regional Co-locations of Arbitrary Shapes

Programming Large Dynamic Data Structures on a

... multicore machines. Users can overcome this limitation by treating each processor core as a node in the DSM system and run multiple parallel computation tasks on each machine. However, due to the limited network resources on each machine, the performance may not scale well with the number of cores u ...

Automatically Detecting Avalanche Events in Passive Seismic

... used an OCSVM to detect anomalous network intrusions. Speciﬁcally, we trained and tested an OCSVM using twofold cross-validation on data from the entire season that was processed using the feature extraction procedure discussed in Section III. Though the results of the OCSVM (Table I) were better th ...

Efficient Analysis of Frequent itemset Association Rule Mining

... Apriori is designed to operate on databases containing transactions. Each transaction contains set of items called itemset. It is a decisive algorithm, which uses an repetitive approach known as a level-wise search, where k-itemsets are used to explore (k+1)-itemsets. This set contains items that sa ...

Rule-Based Classifier

Mining Social Media Data - Some Lessons

Adding Knowledge Extracted by Association Rules - SEER-UFMG

... In the beginning, relational database management systems (RDBMSs) supported only numbers and small character strings, called traditional data, which can be compared using exact matching (= and 6=) and relational (<, ≤, > and ≥) operators, also called traditional operators. Traditional data can be mi ...

Data Mining for Description and Prediction of Antibiotic

... Healthcare-associated infections is the most common healthcare related injury and affect almost every tenth patient. With the purpose of reducing these infections Infektionsverktyget was developed for registration and feedback of infection data. The tool is now used in all Swedish county councils re ...

K - Department of Computer Science

data mining - Amazon Web Services

... behavior, assess risk, determine associations, or do other types of analysis. The models used for automated data analysis can be based on patterns (from data mining or discovered by other methods) or subject based, which start with a specific known subject. There are a number of common misconception ...

Data Mining

... Source: Kenneth C. Laudon & Jane P. Laudon (2014), Management Information Systems: Managing the Digital Firm, Thirteenth Edition, Pearson. ...

Enhanced Web Mining Technique To Clean Web Log File

... columns and rows where columns represent attributes and rows represent tuples. Each column contains one or more data categories while each row contains a unique instance of data for the categories defined by the columns. Relational data can be accessed by database query language such as SQL query. D ...

Theorem 1

... The equation has a single unknown and a single root x’i,1 ...

< 1 ... 79 80 81 82 83 84 85 86 87 ... 505 >

Nonlinear dimensionality reduction

High-dimensional data, meaning data that requires more than two or three dimensions to represent, can be difficult to interpret. One approach to simplification is to assume that the data of interest lie on an embedded non-linear manifold within the higher-dimensional space. If the manifold is of low enough dimension, the data can be visualised in the low-dimensional space.Below is a summary of some of the important algorithms from the history of manifold learning and nonlinear dimensionality reduction (NLDR). Many of these non-linear dimensionality reduction methods are related to the linear methods listed below. Non-linear methods can be broadly classified into two groups: those that provide a mapping (either from the high-dimensional space to the low-dimensional embedding or vice versa), and those that just give a visualisation. In the context of machine learning, mapping methods may be viewed as a preliminary feature extraction step, after which pattern recognition algorithms are applied. Typically those that just give a visualisation are based on proximity data – that is, distance measurements.

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Nonlinear dimensionality reduction