Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Chapter 3 Problem Statement This chapter describes the overall purpose of the study including the motivation, problem statement, research objectives, research methodology, assumptions and scope of the study. 3.1 Introduction India being one of the ten most industrialized nations of the world, it has made rapid advances in industrialization. But this standing has fetched with it undesirable and surprising consequences such as pollution and unexpected urbanization. This chapter briefly outlines the motivation for the study and briefly describes the problem statement, research objectives and questions. 3.2 Motivation Data mining provides a promising tool for solving air pollution management problems. The necessity for predicting air pollution incidentally is more suitable when it is known that data sets are available for free of charge in MPCB data repositories and at a nominal cost from meteorological centers. An efficient tool once built for abstracting the large quantities of data sets that are available and deriving useful knowledge from them can help to detect the vulnerability of the exposed people in the survey areas. This will enable planners to utilize the information to 59 improve the health conditions of these areas for effective air pollution mitigation and management. 3.3 Problem Statement In order to predict air pollution conditions, it is essential to handle and manage historical data sets of the parameters measured. Considering the vast amount of data available and to distinguish the pattern and extent of relationships for useful and efficient extraction of knowledge, there is a need for using data mining techniques. Much of the spatial data obtained are sparse in nature, for example, pollution levels at different locations. Thus, a method for obtaining a continuous data set from a sparse data set is a practically useful need. Kriging methods that interpolate to recognize the spatial variation of any attribute that is continuous is too irregular and cannot be modeled by a simple and smooth mathematical function. The present research thesis examines the air pollution data at different locations in Mumbai and Navi Mumbai and attempts to forecast and interpolate the values for different locations. More specifically, this thesis uses data mining tools and techniques of artificial intelligence like artificial neural networks to forecast the air pollutants namely SO2 , NOx and RSPM in the monitored locations and kriging as a technique to estimate their values in the unmonitored areas of Mumbai and Navi Mumbai. 3.4 Research Objectives The primary objective of the research is to offer scientific analysis of pollution data. The analysis is to be done by applying machine learning and data mining techniques to the pollution data. Thus, the objective is to develop data mining techniques for pattern recognition of the air pollutants by using available data sets. The following are some of the other objectives: 60 • To offer scientific analysis of pollution data • To adapt methods from machine learning and data mining of spatial data and apply them to pollution data analysis. • To know the air pollutants in Mumbai and Navi Mumbai. • To develop a model for estimating the air pollution level. • To integrate methods of spatial pattern analysis using exploratory statistics, spatial data analysis and artificial intelligence. • To demonstrate the application of data mining techniques using available data sets from different locations and different data sources as case study. • To develop an application program to implement a kriging module. • To identify the factors affecting the prediction of air pollution levels • To identify the Air Quality Index based on the predictions of the air pollutants. 3.5 Research Questions Research questions that required to be answered to achieve the objectives are: • Which functions can efficiently describe air pollution pattern in datasets? • What is the relation between air pollution pattern parameters and characteristics like SO2 , NOx , RSPM and the general air quality index? • What relationship exists between the meteorological parameters and the air pollutants? • What are the meteorological elements that affect the dispersion of the air pollutants in the geographical area? • What is the spatial pattern of air pollution in the study area? 61