Download Solutions Big Data IBM

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Forecasting wikipedia , lookup

Data assimilation wikipedia , lookup

Transcript
Isabelle Claverie-Berge, IM Architect
13 03 2012
Solutions Big Data IBM
Information Management
© 2012 IBM Corporation
The BIG Data Challenge
•
•
•
•
Manage and benefit from massive and growing amounts of data
Handle uncertainty around format variability and velocity of data
Handle unstructured data
Exploit BIG Data in a timely and cost effective fashion
COLLECT
Collect
INTEGRATE
Integrat
e
MANAGE
Manage
ANALYZE
Analyze
Information Management
By 2015 the number of networked devices will
be double the entire global population. All
sensor data has uncertainty.
9000
8000
100
7000
6000
The total number of social media
accounts exceeds the entire global
population. This data is highly uncertain
in both its expression and content.
90
80
2000
1000
40
30
20
0 10
ing
Th
of
et
or
s
rn
ns
Data quality solutions exist for
enterprise data like customer,
product, and address data, but
this is only a fraction of the
total enterprise data.
te
50
(I n
3000
Se
4000 60
s)
5000 70
Aggregate Uncertainty %
Global Data Volume in Exabytes
By 2015, 80% of all available data will be uncertain
dia nd te
e
M a
xt)
l
io
cia , aud
o
S de o
P
VoI
(vi
Enterprise Data
2005
Multiple sources: IDC,Cisco
2010
2015
3
© 2012 IBM Corporation
Information Management
Uncertainty arises from many sources
Process Uncertainty
Data Uncertainty
Model Uncertainty
Processes contain
“randomness”
Data input is uncertain
All modeling is approximate
Actual
Spelling
Intended
Spelling Text Entry
? ?
?
Uncertain travel times
Fitting a curve to data
GPS Uncertainty
?
Testimony
?
?
{Paris Airport}
Ambiguity
Semiconductor yield
Contaminated?
Rumors
{John Smith, Dallas}
{John Smith, Kansas}
Conflicting Data
Forecasting a hurricane
(www.noaa.gov)
4
© 2012 IBM Corporation
Information Management
The fourth dimension of Big Data: Veracity – handling data in doubt
Volume
Velocity
Data at Rest
Data in Motion
Scale from terabytes
to petabytes (1K TBs)
to zettabytes (1B
TBs)
Streaming data,
milliseconds to
seconds to respond
Often time-sensitive,
streaming data and
large volume data
movement
Variety
Data in Many
Forms
Structured,
unstructured, text,
multimedia
Veracity*
Data in Doubt
Uncertainty due to
data inconsistency
& incompleteness,
ambiguities, latency,
deception, model
approximations
* Truthfulness, accuracy or precision, correctness
5
© 2012 IBM Corporation
Information Management
Big Data Technologies
Variety
InfoSphere BigInsights to
perform exploratory analytics
InfoSphere Streams to score
incoming data against analytic
models
Nontraditional/
internet
data
Volume
InfoSphere
Warehouse
Traditional
Data
Netezza
Reuse Warehouse
Analytic models
Velocity
At-Rest Data
In-Motion Data
© 2012 IBM Corporation
Platform Capability
3
Improved Relevance
4
Higher campaign profitability
D I G I T A L M E D I A | Page 7
All in one
place
Segmentation
Increased Targeting Precision
Clickstream
Transactions
Events
CRM
Support calls
Clustering
Scoring
Feature Selection
Associations
Matching
2
Single view of customer
Personalized message
Matching algorithms
Matrix computations
Single Value Decomp.
Optimization
1
Consolidation
Marketeer’s objectives
Forecasting
Predictive algorithms
Decision trees
Linear Regression
Information Management
Platform Capability
2
Single view of customer
Increased Targeting Precision
Improved Relevance
4
Higher campaign profitability
Confidential
D I G I T A L MIBM
ED
I A | Page 8
Clickstream
Transactions
Events
CRM
Support calls
All in one
place
Big Data
Clustering
Scoring
Feature Selection
Associations
Personalized message
Matching algorithms
Matrix computations
Single Value Decomp.
Matching
3
Segmentation
1
Consolidation
Marketeer’s objectives
Optimization
Complex Analytics
© 2011 IBM Corporation
Forecasting
Predictive algorithms
Decision trees
Linear Regression
IBM Offers a Comprehensive Set of
Solutions for BIG Data
Non-Traditional
Data
InfoSphere
Big Insights
Traditional
Data
Streams filters
incoming data
Streams reuses
Warehouse Analytic
models
Persistent Data
In-Motion Data
Information Management
Un nouveau mode d’exploration des données
Traditional Approach
Big Data Approach
Structured & Repeatable Analysis
Iterative & Exploratory Analysis
IT
Business Users
Delivers a platform to
enable creative
discovery
Determine what
question to ask
IT
Business
Structures the
data to answer
that question
Explores what
questions could be
asked
Monthly sales reports
Profitability analysis
Customer surveys
IBM Confidential
Brand sentiment
Product strategy
Maximum asset utilization
© 2011 IBM Corporation
Information Management – Big Data
Big Data : integration points
Rules / BPM
Client and Partner Solutions
IBM Big Data Solutions
iLog & Lombardi
Data Warehouse
InfoSphere
Warehouse
Statistics
Image/Video
Financial
Mining
Geospatial
Times Series
Acoustic
Mathematical
InfoSphere BigInsights
Productivity Tools & Optimization
Management
Provisioning
Workflow
Job
Scheduling
Job
Tracking
Data
Ingestion
Admin
Tools
Configuration
Manager
Activity
Monitor
Identity &
Access Mgmt
Data
Protection
Connectors
Applications
InfoSphere MDM
Database
DB2 & non-IBM
Content Analytics
ECM
Business Analytics
Blue Prints
Manageability
Workload
Management &
Optimization
Processes
InfoSphere Streams
Master Data Mgmt
INTEGRATION
Data
Big Data Enterprise Engines
IBM & non-IBM
Information Server
Text
Warehouse Appliances
Applications
Big Data Analytics
Cognos & SPSS
Marketing
Unica
Data Growth
Management
InfoSphere Optim
© 2012 IBM Corporation
Information Management
Typical IBM Big Data Solution Use Cases
Create context
(classification, text mining)
Unstructured
data
Analyze
Semi-structured
data
Parse, aggregate
Structured data
Analyze, report
IBM Confidential
© 2011 IBM Corporation
Company Confidential
Analyze, report
Active archival
Long running queries
Page 12
Information Management

Big Data Use Case
Customer Experience
(Call Center)
IBM Confidential
© 2011 IBM Corporation
Information Management
Business Value Hypotheses - Summary
Analyze Customer Survey Data to
Advocates vs. Antagonist
 Cost Efficiencies
Analyze Customer Complaint Data to
Understand Key Customer Issues
Understand Impact of Social Networks
on Customer Behavior & Influence
Analyze Agent-to-Agent Internal
Memos
 Performance
 Revenue Lift
 Regulatory Compliance
Analyze Customer Interaction
Notes (i.e. emails, etc.)
Analyze Policy & Procedures
IBM Confidential
© 2011 IBM Corporation
Information Management
Addressing the Customer Experience Issues
1
2
CSR Agent asks
customer questions
to understand
situation and
customer need
4
3
CSR Agent researches
question by consulting
policies & procedures,
looking up data,
consulting other agents,
or transfers the call to
another agent
CSR Agent spends
time clarifying
question based on
research and
providing best
possible answer.
CSR Agent ensures
customer is satisfied
& probes for other
needs before closing
inquiry
Search Highly Organized Knowledge, Better Predict
Customer Needs & Drive Differentiated Experience
Future State Knowledge Management Solution
Phase 1
Phase 2
Phase 3
Establish a searchable
knowledge base database
organized by need with
structured actions and
procedures for CSR to
follow
Watson: Incorporate
transactions and
interactions into generating
a list of potential reasons
for call and prioritize
potential next best actions
Watson 2: Automatically provide
suggestions for specific
customer inquiries, as well as
suggestion for customer
treatments based on transaction
& interaction history
IBM Confidential
© 2011 IBM Corporation
Information Management
Phase 1: Establish Knowledge Base
Systems of
Record
Content
Loaders
Analytics
Knowledge
Interface
Agent Assist
2
Call Center Logs
• Caller Identified
Voice
Transcriptions
Search
Engine
ETL
3
1
Other Customer
Interactions
MDM Interface
Watson
Big Insights
Analytics
Data &
Text
Analytics
Documented
Policies & Rules
Transactions &
Statements
1
2
3
Knowledge Base
Search & Retrieve
• Issue(s)
Researched
• Issue Selected
• Apply Treatment
• Cross or Up Sell
• Closure
Call Center
Knowledge Base
Rules
Engine
Call Center
IVR
Account / Product specifications, Customer Transactions and Interactions, Policies and Procedures are loaded into Big Insights and
relevant information context linkages are established in an indexed Knowledge Base (KB). The KB has natural language capabilities to link
issues with treatments.
A customer transfers from IVR to an Agent, and gets identified
A customized UI helps the agent access the Content search engine to research the customer’s issue, locate appropriate treatment and
apply it.
IBM Confidential
© 2011 IBM Corporation
Information Management
Phase 2: Automate Issue Detection & Treatment
Analytics
Watson / Big
Insights Analytics
Knowledge
Interface
Search
Engine
Knowledge Base
Search & Retrieve
Predictive
Engine
Active
Profiling
• Predict Issues
• Prioritize
Treatments
Rules
Engine
Call Center
Knowledge Base
Agent
Assist
• Caller Identified
3
2
• Prioritized Potential Issue(s) Presented
to Agent
• Issue Selected, or where necessary,
researched
• Treatments for Issue are applied through
automation, or where necessary,
manually
• Cross or Up Sell
• Closure
1
Active Profile
Watson / Streaming
Analytics
Call Center
IVR
1
The Knowledge Base is compiled into an Active Profile
2
Customer transfers out of IVR to the call center and gets identified
3
Agent is provided with prioritized potential issues by the Rules Engine, together with treatments. Where needed the agent
can research the Knowledge Base. Treatment application is automated where possible, reducing manual time required
IBM Confidential
© 2011 IBM Corporation