Download Web Mining - CENG METU

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

URL redirection wikipedia , lookup

Transcript
Semantic Web and Web
Mining:
Networking with Industry and
Academia
İsmail Hakkı Toroslu
IST EVENT 2006
WWW Related Fields
Browsers and User Interfaces: Web user interfaces will
need to become richer and more interactive.
Pervasive Web and Mobility: In near future most of the
devices accessing the Web will be mobile.
Search: Search tools help users to explore and access
complex and unstructured information on the Web
(keyword and link-based search paradigms).
WWW Related Fields
Security and Privacy:
Web Engineering: Systematic approaches are needed
in order to develop high-quality Web-based systems and
applications.
XML and Web Data: XML has become the language of
the Web and XML technologies have become the basis
for many Web-based applications. XML is widely used
for data exchange and to publish data from database
systems to the Web by providing input to content
generators for Web pages.
WWW Related Fields
Semantic Web: .The main idea of the Semantic Web is
to extend the current human-readable web by encoding
the semantics of web-resources in a machineinterpretable form. This is needed in order to be able to
automatically integrate data from different sources, to
perform actions on behalf of the user, and to search for
information based on its meaning rather than its
syntactic form.
Web Services: Web Services technology has become a
widely-used solution for intra- and inter-enterprise
application integration, including e-commerce
WWW Related Fields
Web Mining: Research in web data mining aims to
develop techniques to derive knowledge from data
published in web-accessible formats. Due to
heterogeneity and lack of structure in web data,
automated discovery of targeted or unexpected
knowledge is a challenging task. Related fields are: data
mining, machine learning, natural language processing,
statistics, databases, and information retrieval.
Search
Search engine design and architecture
Basic search engine infrastructure: crawling,
indexing, and query processing
Personalized search - location, context and
activity-aware search
Query and search-user modeling
Search interfaces, natural language interfaces to
search, summarization, post processing tools
and feedback
Search-motivated characterizations of the web
Meta-search and rank aggregation
Web services
Service contract and metadata
Orchestration, choreography and composition of
services
Large scale XML data integration
Tools and technologies for Web Services
development, deployment and management
Software methodologies for Service-Oriented
Systems
Using formal methods on Web Services
XML query processing and data management
Models and query languages for Web data
Semantic Web
Ontologies and representation languages
Semantic annotation and metadata
Semantic brokering, integration and
interoperability
Semantic search and retrieval
Semantic web services
Semantic web mining, ontology learning
Web applications that exploit semantics
Web Mining
Classification or clustering of web data
Mining web content and link structure
Web log mining and web traffic analysis
Building user profiles and providing
recommendations
Change detection and monitoring methods for
web data
Entity and relationship extraction and
disambiguation
Privacy issues in web mining
Data integration and data cleaning
Integrating linguistic and domain knowledge in
web mining
FP7 ICT Workprogramme ?
Challenge 1: Pervasive and Trusted Network
and Service Infrastructures
The target infrastructures has to support interoperable devices
and services, a variety of content formats and a multiplicity of
delivery modes and support context awareness and the dynamic
behavior needed for applications with time and context varying
requirements.
– Objective 3.1.1.2: Service and Software Architectures,
Infrastructures and Engineering (140 M€, Call 1)
FP7 ICT Workprogramme ?
Challenge 4: Digital Libraries and Content
content should be made available in digital form through
digital libraries and its long term accessibility and usability
must be ensured.
we need more effective technologies for intelligent content
creation and management, and for supporting the capture of
knowledge and its sharing and reuse.
individuals and organizations have to find new ways to
acquire and exploit knowledge, and thereby learn.
– Objective 3.4.2.1: Intelligent Content Creation and Management
(101 M€, Call 2)
FP7 ICT Workprogramme ?
Objective 3.1.1.2: Service and Software
Architectures, Infrastructures and Engineering
Service architectures, technologies, methods and tools that
enable service discovery, advertising and dynamic composition.
Service/software engineering approaches and tools for
dynamically composed systems
Strategies and technologies enabling mastery of complexity,
dependability, and behavioral stability in complex systems and
in systems evolving over time without central design.
FP7 ICT Workprogramme ?
Objective 3.1.1.2: Service and Software
Architectures, Infrastructures and Engineering
Virtualization tools, system software and network-centric
operating systems that orchestrate unlimited, heterogeneous and
dynamic resources distributed across multiple platforms as a
single entity, and provide platform-independent access and
sharing of knowledge, processing, communication, storage and
content. They also enable the definition and execution of tasks and
workflows for collaboration and operation across multiple domains
and optimize usage of distributed resources.
Enabling the integration of dynamic service architectures in
the “networked enterprise”, catering for enterprise
interoperability, collaboration, highly distributed operations,
reduced life cycle cost.
FP7 ICT Workprogramme ?
Objective 3.4.2.1: Intelligent Content Creation
and Management
Advanced authoring environments for the creation of novel forms
of interactive and expressive content encouraging multimodal
experimentation and exploration of the design space.
Collaborative workflow environments to manage the lifecycle of
media and enterprise content from the acquisition of reference
materials to the versioning, packaging and repurposing of complex
products.
FP7 ICT Workprogramme ?
Objective 3.4.2.1: Intelligent Content Creation
and Management
Architectures and technologies for personalized distribution,
presentation and consumption of self-adaptive content that
detect and exploit emergent ambient intelligence and take full
advantage of the intelligence built into content objects and
rendering equipment in terms of dynamic device adaptation,
contextual support of user goals, and cultural or linguistic
preferences.
Empirical studies on how to foster user-produced content. Privacy
preserving algorithms for mining both social and human-device
interactions.
FP7 ICT Workprogramme ?
Objective 3.4.2.1: Intelligent Content Creation
and Management
Semantic foundations: probabilistic, temporal and modal
modeling and reasoning through objective-driven research moving
beyond current knowledge representation formalisms.
Advanced knowledge management systems capable of
extracting meaning and structure from an analysis of structured
and unstructured information and work patterns, and of making
that structure available for activities ranging from document search
to decision making. Such systems will exploit semantics
embedded in multimedia objects, data streams and ICT-based
processes, and rely on formal policies to manage user access to
knowledge resources, thus supporting the dynamic formation of
virtual organizations. Advances delivered through research will be
embedded within end-to-end systems using computer-tractable
knowledge in support of data and application integration,
automation of business processes, automated diagnosis and
problem-solving in a variety of knowledge-intensive domains.