Download Towards Semantic Web Mining

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the work of artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Towards Semantic Web Mining
Bettina Berndt
Andreas Hotho
Gerd Stumme
Semantic Web Mining



Combination of Semantic Web and Web
Mining
Improve Web Mining using Semantic
Web
Improve Semantic Web using Web
Mining
Overview






Web Mining
Extracting Semantics from the Web
Exploiting Semantics for Web Mining
Mining the Semantic Web
Closing the Loop
Conclusion/Assessment
Web Mining
•
•
•
Discovers Local and Global Structure
Structured Data
Goals
•
•
•
•
Improvement of site design
Generate dynamic recommendations
Improve marketing
Main Areas
•
•
•
Web Content Mining
Web Structure Mining
Web Usage Mining
Content Mining






Type of Text Mining
Uses Tags
Detect co-occurrences
Event detection
Reconstruction of page content
Relations in a domain
Web Structure Mining

WebPages as a whole



Uses hyperlinks
Identify relevance
Single Pages

Five types of Web Pages





Head Pages
Navigation Pages
Content Pages
Look up Pages
Personal Pages
Web Usage Mining



Request by Visitors
Additional Structure
Unintended Relationships
Web Mining
Disadvantages of Web Mining

Content/Structure





False positives
Unused
Human understandable
Large amount of data
Usage



Usage tracked by urls
General concepts
Multiplicity of events and urls
TOP
cooperateswith(X,Y)
NAME
cooperateswith(Y,X)
TITLE
PERSON
PROJECT
COOPERATES-WITH
Ontology
WORKS-IN
RESEARCHER
Semantic
Web Mining
Andreas
Hotho
WORKS-IN
DAMLPROJ
URI-SWMining
URI-AHO
Relational
Metadata
WORKS-IN
COOPERATES WITH
URI-GST
WWW
Outline






Web Mining
Extracting Semantics from the Web
Exploiting Semantics for Web Mining
Mining the Semantic Web
Closing the Loop
Conclusion/Assessment
Extracting Semantics

Ontology Learning


Learn structures of Ontologies
Instance Learning

Populates the Ontologies
Extracting Semantics

Ontology Learning


Semi-automatic approach
Merging



FCA-Merge
TITANIC
Instance Learning

Information Extraction
Outline






Web Mining
Extracting Semantics from the Web
Exploiting Semantics for Web Mining
Mining the Semantic Web
Closing the Loop
Conclusion/Assessment
How can the semantic web help
with web mining ?
Web Content/Structure Mining

Content Mining




Preprocess the input data
Apply heuristics
Creates a cluster
Web Structure Mining



Page Rank
Keyword Analysis
CLEVER
Conceptual Clustering of Emails (and Bookmarks)
using IE and Formal Concept Analysis for supporting navigation and retrieval.
Web Usage Mining

Goal


Better understand user’s tendencies
Problem


Dynamic pages
How to take advantage of this?



Generate queries
Create usage paths
Classification scheme
Advantages


Structured Model
Improve queries



Analyze single pages
Analyze ontologies
Users history
Outline






Web Mining
Extracting Semantics from the Web
Exploiting Semantics for Web Mining
Mining the Semantic Web
Closing the Loop
Conclusion/Assessment
How can web mining help
build the semantic web?
Semantic Web/Structure
Mining


Intertwined
Relational Data Mining



Looks for patterns
Classification, regression, clustering and
associations
Challenges


Scalability
Distributed
Semantic Web Usage Mining

Goal



Requested page = ontology entity
Log files
Advantages



Understand search strategies
Improve navigation design
Personalize
Outline






Web Mining
Extracting Semantics from the Web
Exploiting Semantics for Web Mining
Mining the Semantic Web
Closing the Loop
Conclusion/Assessment
Mining to Learn Ontologies

Establish a concept hierarchy


OntEx
Determine Association rules

Discover combinations of concepts
Mining to Learn Ontologies
Filling the Ontologies
Use Ontology to Mine
Conclusion/Assesment
•
•
•
•
•
Semantic Structures in the Web can
help Web mining
Web Mining can build the Semantic Web
Combine the two together
Different Idea
Combination of Products
Questions?