Keyword searching and browsing in databases using BANKS

... “hubs” which are connected to a large numbers of nodes. For example, in a university database a department with a large number of faculty and students would act as a hub. As a result, many nodes would be within a short distance of many other nodes, reducing the effectiveness of proximitybased scorin ...

Exploiting Frequent Episodes in Weighted Suffix Tree to Improve

... Whereas the decision engine must response as soon as possible if it detects any deviation of the programs so as to improve the whole security of computer systems. Generally speaking, an IDS wishes to have all possible normal usages of a program to improve the ability of identifying normal and abnorm ...

Privacy-by-design in big data analytics and social mining

... data analytics and privacy are not necessary enemies. The purpose of this paper is precisely to show that many practical and impactful services based on big data analytics can be designed in such a way that the quality of results can coexist with high protection of personal data. The magic word is p ...

Forecasting with Big Data - Bournemouth University Research Online

... surprising that this increasing availability of data is causing anxiety, and this is evident through the sound example presented by Smolan and Erwitt (2012) where the authors state that in the modern age, we generate 70 times the information stored in the library of congress simply within the first ...

Providing Privacy and Security for Cloud Data Using Data Mining

... Many enterprises use database audit and protection (DAP) and Security Information and Event Management (SIEM) solutions to gather together information about what is happening. But monitoring and event correlation alone do not translate into data security. At a time when regulation and compliance iss ...

Data Mining: Concepts and Techniques

Generalization-based Privacy Preservation and

Application of Temporal Descriptors to Musical Instrument Sound

Barna Saha - WordPress.com

... Google Faculty Research Award, 2016 Yahoo! Academic Career Enhancement Award, 2015 (Yearly given to top-5 young faculty doing Yahoo relevant research) Simons Research Fellowship, 2015, The Simons Institute, University of California Berkeley. NSF CISE Research Initiation Initiative (CRII) Award, 2015 ...

Searching and Mining Trillions of Time Series Subsequences

Cluster Analysis

Proceedings Template - WORD

... outlier detection for stream data more challenging than that for regular (non-stream) data. This paper discusses those research issues for applications where data come from a single stream as well as multiple streams. ...

arXiv:cs.DB/0112011 v2 5 Feb 2003

... itemset in potential bodies and heads. This can be done in a similar level-wise manner as in phase 1, based on the observation that if a head-set represents a confident rule for that itemset, then all of its subsets also represent confident rules [24]. For example, if the itemset {1, 2, 3, 4} is a f ...

Designing a Data Warehouse: Prerequisites

A Class Imbalance Learning Approach to Fraud

Performance Evaluation of Partition and Hierarchical Clustering

... [1]. Proteins are important molecules composed of amino acids and arranged in a linear chain. They perform all necessary functions and participate in all processes within and between cells. Each protein has unique structure and functions. Protein sequences are represented by combination of alphabets ...

Building Semantic Kernels for Text Classification using Wikipedia

Web Mining (網路探勘)

... • Using machine learning to generate extraction rules. – The user marks the target items in a few training pages. – The system learns extraction rules from these pages. – The rules are applied to extract items from other pages. ...

Efficient Density-Based Clustering of Complex Objects

A REVIEW ON SPATIAL DATA MINING METHODS AND

... can be used for spatial data presentation and statistical analysis besides helping in making well informed decisions. Some of the techniques useful for GRIS include spatial analysis, induction, classification and clustering, trend or spatial characteristic analysis, pattern recognition and digital m ...

7. B.Tech. IT R15 Regulations 3rd and 4th Year Course Structure

...  To know the importance of the complexity of a given algorithm.  To study various algorithm design techniques.  To utilize data structures and/or algorithmic design techniques in solving new problems.  To know and understand basic computability concepts and the complexity classes P, NP, and NP-C ...

Outlier-based Health Insurance Fraud Detection for US Medicaid Data

... normally consist of some outliers based on statistical deviation, just by chance, which cannot be filtered within a single metric. Only when fraudulent providers will take a more deviant position in the group of outliers, normal providers may shift to the non-outlying group, leaving the ‘bad guys’ s ...

Periodicity Detection in Time Series Databases

PPT - Donald Bren School of Information and Computer Sciences

... • There are heuristics to try to infer the true actions of the user: – Path completion (Cooley et al. 1999) • e.g. If known B -> F and not C -> F, then session ABCF can be interpreted as ABCBF • Anderson et al. 2001 for more heuristics ...

< 1 ... 81 82 83 84 85 86 87 88 89 ... 505 >

Nonlinear dimensionality reduction

High-dimensional data, meaning data that requires more than two or three dimensions to represent, can be difficult to interpret. One approach to simplification is to assume that the data of interest lie on an embedded non-linear manifold within the higher-dimensional space. If the manifold is of low enough dimension, the data can be visualised in the low-dimensional space.Below is a summary of some of the important algorithms from the history of manifold learning and nonlinear dimensionality reduction (NLDR). Many of these non-linear dimensionality reduction methods are related to the linear methods listed below. Non-linear methods can be broadly classified into two groups: those that provide a mapping (either from the high-dimensional space to the low-dimensional embedding or vice versa), and those that just give a visualisation. In the context of machine learning, mapping methods may be viewed as a preliminary feature extraction step, after which pattern recognition algorithms are applied. Typically those that just give a visualisation are based on proximity data – that is, distance measurements.

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Top subcategories

Nonlinear dimensionality reduction