Download Chapter 3 Problem Statement

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Chapter 3
Problem Statement
This chapter describes the overall purpose of the study including the motivation,
problem statement, research objectives, research methodology, assumptions and
scope of the study.
3.1
Introduction
India being one of the ten most industrialized nations of the world, it has made
rapid advances in industrialization. But this standing has fetched with it undesirable and surprising consequences such as pollution and unexpected urbanization.
This chapter briefly outlines the motivation for the study and briefly describes the
problem statement, research objectives and questions.
3.2
Motivation
Data mining provides a promising tool for solving air pollution management problems. The necessity for predicting air pollution incidentally is more suitable when
it is known that data sets are available for free of charge in MPCB data repositories and at a nominal cost from meteorological centers. An efficient tool once built
for abstracting the large quantities of data sets that are available and deriving
useful knowledge from them can help to detect the vulnerability of the exposed
people in the survey areas. This will enable planners to utilize the information to
59
improve the health conditions of these areas for effective air pollution mitigation
and management.
3.3
Problem Statement
In order to predict air pollution conditions, it is essential to handle and manage
historical data sets of the parameters measured. Considering the vast amount
of data available and to distinguish the pattern and extent of relationships for
useful and efficient extraction of knowledge, there is a need for using data mining
techniques.
Much of the spatial data obtained are sparse in nature, for example, pollution
levels at different locations. Thus, a method for obtaining a continuous data set
from a sparse data set is a practically useful need. Kriging methods that interpolate
to recognize the spatial variation of any attribute that is continuous is too irregular
and cannot be modeled by a simple and smooth mathematical function.
The present research thesis examines the air pollution data at different locations
in Mumbai and Navi Mumbai and attempts to forecast and interpolate the values
for different locations. More specifically, this thesis uses data mining tools and
techniques of artificial intelligence like artificial neural networks to forecast the air
pollutants namely SO2 , NOx and RSPM in the monitored locations and kriging
as a technique to estimate their values in the unmonitored areas of Mumbai and
Navi Mumbai.
3.4
Research Objectives
The primary objective of the research is to offer scientific analysis of pollution
data. The analysis is to be done by applying machine learning and data mining
techniques to the pollution data. Thus, the objective is to develop data mining
techniques for pattern recognition of the air pollutants by using available data sets.
The following are some of the other objectives:
60
• To offer scientific analysis of pollution data
• To adapt methods from machine learning and data mining of spatial data
and apply them to pollution data analysis.
• To know the air pollutants in Mumbai and Navi Mumbai.
• To develop a model for estimating the air pollution level.
• To integrate methods of spatial pattern analysis using exploratory statistics,
spatial data analysis and artificial intelligence.
• To demonstrate the application of data mining techniques using available
data sets from different locations and different data sources as case study.
• To develop an application program to implement a kriging module.
• To identify the factors affecting the prediction of air pollution levels
• To identify the Air Quality Index based on the predictions of the air pollutants.
3.5
Research Questions
Research questions that required to be answered to achieve the objectives are:
• Which functions can efficiently describe air pollution pattern in datasets?
• What is the relation between air pollution pattern parameters and characteristics like SO2 , NOx , RSPM and the general air quality index?
• What relationship exists between the meteorological parameters and the air
pollutants?
• What are the meteorological elements that affect the dispersion of the air
pollutants in the geographical area?
• What is the spatial pattern of air pollution in the study area?
61