Download Data Mining Solutions

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Cluster analysis wikipedia , lookup

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Data Mining Solutions
INTRODUCTION
CAVEATS
Vast amounts of data combined with advancements in computer
processing allow Tegra to exploit computational efficiencies on
hundreds of statistical models across thousands of variables and
down millions of records. The result is data mining – the practice of
discovering patterns and trends that go beyond simple analyses to
make better decisions.
Data mining does not provide
answers. Rather, it provides
for discoveries within a large
data set to generate actionable ideas. The idea generation process
leads to new insights into the data that can then be tested. It is
often necessary to take small steps and iterate over an analysis
many times to achieve great results.
TECHNIQUES DEPLOYED
Insights gained from data can only be as robust as the sources used.
Better conclusions can be drawn from data of higher Quality and
Quantity.
Summary Tables
Trends
Correlations
Basic
Part of the success of data mining is user-driven. While technology
can be utilized to uncover hidden relationships between variables,
the practicality and value of these relationships are framed through
understanding the data and the business.
Linear regression models
Contingency tables
Logistic regression models
Nonlinear models
Splines & Smoothers
Visualizations
Discriminant analysis
Decision trees
k-NN or nearest neighbor
k-means cluster analysis
Neural networks
Monte Carlo simulations
Custom algorithms
RETURN ON DATA INVESTMENT (RODI)
Data sources, especially prescriber level data, can be expensive and
often costs hundreds of thousands per year.
Return On Data Investment (RODI) accounts for insights gained from
data mining. Tegra can do more with your data than just reporting.
Advanced
APPLICATIONS & IMPLEMENTATION
Overall knowledge discovery in databases
Data error and anomaly investigations
Physician segmentation
Physician retention
Physician probability of prescribing
Physician future trends
Physician behavior
Account clustering
Transactional patient claims data
Upselling opportunities
Territory optimization
Identify abusers
Insurance claims
Payer spillover
METHODOLOGY
Define the business problem
Build the data mining database
Prepare the data
Perform data mining analyses
Evaluate models, Train models, Validate models
Deploy model and present results
EXAMPLES
1. Duration. An exhaustive data mining exercise was applied to call
data and prescribing patterns. Tegra performed dozens of
regression models over several time periods to reveal the
importance of duration. Reach and Frequency are important. Depth
and breadth are important. However, data mining showed that
longer duration consistently resulted in higher sales performance.
2. Migration by Specialty. Data mining revealed that PCP and ENDO
prescribers behaved differently across segments. PCPs were more
likely to try the product than ENDOs, but also more likely to only try
the product once. Tegra created models to isolate segments more
worthy of sales effort resulting in substantial efficiencies for the field
sales force and faster sales growth for a newly launched product.
3. Quality Assurance Team. A combination of classic experimental
design plus data mining identified manufacturing deficiencies and
ways to optimize the quality assurance process. Using repeated
measures statistical models on machine speed, defect type, number
of defects, time of day, operator and sequence, Tegra calculated pvalues for speed levels and critical defects. In addition, data mining
models helped gain insight into inspection times, operator
variability, and false positive rates. The combined analyses resulted
in clear actionable improvements to the quality control process and
provided analytical rigor to the FDA’s requests.