Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec PROCESS AND APPLICATION OF DATA MINING IN INTERNET BASED LEARNING IN HIGHER EDUCATION – A MODEL P. Karthika1 ,V. Ravikumar2 and P. Karthikeyan3 1 Assistant Professor, Department of Management Studies, MPNMJ Engineering College, Chennimalai, Erode- 638 112, Tamilnadu, India 1 Haed and Assistant professor, Department of Management Studies, MPNMJ Engineering College, Chennimalai, Erode- 638 112, Tamilnadu, India 3 Assistant Professor (Sr.Grade), School of Management Studies, Kongu Engineering College, Perundurai, Erode-638 052, Tamilnadu, India ABSTRACT Learning is a continuous and life long process. Convenient learning has become the need of the current set-up. Information and communication technologies have brought about marvelous changes in the teaching-learning process. Internet based learning system has started to play an indispensable role in a lot of activities. Availability of right content at the time is the real need of the learner. This paper provides an insight into data mining in internet based learning. The role of data mining in internet based learning environment , data mining system components, interactive data mining are discussed together with a process model for implementing data mining in internet based learning at higher education. The process model includes conceptualization and identification, Selection and filtering, Pattern extraction and evaluation and finally visualization. Keywords: Data Mining, internet-based Learning, Conceptualization and identification, Selection Pattern extraction and visualization 1.INTRODUCTION Traditionally finding the useful patterns in data has been given a various of names together with data mining, knowledge extraction, information finding, information collecting, dataselection, and data pattern processing. The word data mining has been mostly used by mathematicians, data forecasters, and the management information systems (MIS) communities. It has also enlarged attractiveness in the database arena. The word KDD was invented at the first KDD workshop in 1989 [1] to emphasize that "knowledge" is the end product of a data discovery. It has been make popular in artificial intelligence and machine learning. Data mining is a significant step in overall KDD process. Knowledge discovery from database involves the application of specific algorithms for extracting required data based on specific pattern [1]. The various additional activities in this process involve data gathering, data assortment, data cleaning, integration of suitable previous knowledge, and correct interpretation of the results of mining. All these activities ensure that useful knowledge is derived from varied sectors such as marketing, Finance, Banking, Engineering, Telecommunication and Teaching. Knowledge Detection in database is the method of outcome of knowledge from immense 75 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec amount of data where data mining is the important of this process. Data mining can be used to mine comprehensible meaningful patterns from large databases and these patterns may then be converted into knowledge. Data mining is the process of take out the information and patterns derived by the KDD process which helps in vital administrative. Data mining works with data warehouse and the whole process is divided into action plan to be performed on data: Collection, transformation, mining and results interpretation. In this paper, reviewed Knowledge Discovery perspective in Data Mining and consolidated different areas of data mining, its techniques and methods in it [2]Big data is a term that describes the growth of the amount of data organization and the potential to discover new insights when analysing the data. Big data spans three various dimensions, which include volume, velocity, and variety [3]. There is pressure in higher educational institutions to provide up institutional effectiveness [4] .Data Mining (sometimes called data or knowledge discovery) has become the area of growing significance because it helps in analysing data from different perspectives and summarizing it into useful information. There are increasing research interests in using data mining in teaching. This new emerging field, called Educational Data Mining, deal with developing methods that discover knowledge from data originating from educational [5]. [6] It has evidently captured the multi-disciplinary aspect of internet based-learning. Fig. 1 illustrates how the various fields of study interact to constitute Internet based-learning. Traning Education Learning Managment Information Technology Knowledge Internet Based Learning Fig.1 Internet based learning applied to various fileld Data mining (DM) is the process of where data is scrutinized and summarized into useful information. In short, data mining is process of developing patterns from large databases [7].DM analyses large dataset to extract hidden patterns such as similar groups of data records using clustering technique. The data is used for machine learning and projecting analysis.DM works to analyse data stored in data warehouses and results in effective decision making [8]. Weiss and Indurkhya in [9] anticipated that, “DM is the search for valuable information in large volumes of data”. According to Technology Forecast [10], and Piatetsky-Shapiro et al. [11], it is the process of extracting before unknown, useful information which include knowledge, association rules, pattern finding, numerical and statistical techniques. Query languages or graphical user interface are required to express the DM requests and discovered information , so that results obtained from the DM Engine become understandable and usable for end users. Data mining can also be used in the areas of manufacture control, customer retention, knowledge exploration, sports, astrology, and Internet Web Surf-Aid. 76 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec Knowledge discovery forms an integral part of data mining towards solving problems in a specific domain. Knowledge is valuable asset to most organizations as a substantial source to enhance organizational competency. Researchers and practitioners in the area of knowledge management, view knowledge in board sense as a condition of mind, an object, a process, an access to information, or a capacity. Today’s knowledge world needs information at a faster pace across the organization to survive in the midst of global competitive environment. Technology plays a vital role in all walks of life. It has created a great impact in various sectors. Literature revels that education sector is benefited by the usage of different technology and modern database principles. Data mining techniques have emerged as successful technique in internet based learning system. The rest of this paper is arranged as follows: Section 2 describes the survey of papers. Section 3 introduces the Data Mining process. Section 3.1 describes the data mining system component. Section 3.2 descripe the intercative data mining. Section 4 shows the experimental results and comparison made with other methods. Section 5 describes the conclusion. This paper discusses the various benefits of data mining in internet based learning environment together with an analysis of various models for data mining in internet based learning. This paper concludes with a theoretical model arrived based on literature study that could be implemented in internet based learning. 2. RELATED WORK Data mining determine knowledge assets from the data store/ warehouse repository used mainly for business intelligence and education. For example, data mining is used by e-learner to extract the desired content so as to facilitate timely learning. There are several groundwork to develop data mining in internet-based learning [12] proposed a model to systematically mine and manage useful knowledge in R&D organizations. Internet-based learning supports the computer-based and computer supported education and training system exploiting web as the representation and delivery medium. A web-based learning system is a high-tech in the field of e-learning due to the advantages of usability, interoperability and accessibility provided by the internet technology. Reusability, interoperability and accessibility are major concepts in the development of modern learning environments and extensively proposed by many researchers. [13]. designed internet-based education systems as a muti-agent architecture working with intelligent reusable learning objects. [14] made pioneering efforts in data mining that mined the knowledge of page contents and student navigation pattern. [15] focused on student enrolments in community colleges and reports a case study in which data mining was used to monitor and predict community college students’ transfer to four-year institutions. [16] presented a real life application of data mining to find weak students. [15] studied the impact of data mining on higher education. This study aided to gain insights about the exiting higher education worldwide and its improvement from data mining perspective. [17] discussed a new model for using data mining in higher educational system. [18] proposed a framework for effective educational process using data mining techniques to uncover the hidden trends and patterns making accuracy based predictions through higher level of analytical sophistication in students counselling process. [19] proposed 77 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec to shape the analysis problem as data mining task. Educational data mining (EDM) is an emerging tools and techniques to educationally related data. The discipline focuses on analysing educational data to develop models for improving learning experiences and improving Institutional effectiveness. Data mining is a significant tool for helping organizations enhance decision making and analysing new patterns and relationships among a large amount of data. A in EDM was presented, from applying data mining for understanding student retention and attrition to finding new ways of making individual student. Many opportunities be existent to study EDM from an organizational unit of analysis to individual course-levels of analysis. Some work is strategic in nature and some of the research is extremely technical. [16] visualized that the education domain offers many interesting and challenging applications for data mining. First, an educational institute often has many diverse and varied sources of information. There are the traditional database (e.g students’ information, educators’ information, class and schedule information, alumni information) online information (online web pages and course content pages) in recent times, multimedia databases. Second, there are many diverse internet groups in the educational domain that give rise to many interesting mining requirement. For example, the managers may wish to find out information such as the admission requirements and to predict the class enrolment size for timetabling. The students are wish to know how best to select courses based on prediction of how well they will perform in the courses selected. The alumni office need to know how best to perform target mailing so as to achieve the best effort in reaching out to those alumni that are likely to respond. All these application not only contribute an educational institute in delivering a better quality teaching experience, but also aid the institution in running its administrative tasks effectively. With so abundant information and so many varied needs, it is foreseeable that an integrated data mining system that is able to cater to the special needs of an educational institution will be in great demand. The literature survey had enabled us to study various papers which made significant impact on our findings from Indian perspective. A small percentage of the studies were based on traditional theories of learning, but without anchoring aspects of web-based learning [20] 3. DATA MINING PROCESS The data mining process obtain five steps is represented in Fig.2. Data repository is the general term used to refer a destination designated for data storage. The data is extracted from the data repositories is the first step of data mining process. Data cleaning and loading into data mining database is the second step in the data mining. Clustering is the task of discovering groups and structures in the data that are in some way or another "similar", without using known structures in the data. Regression is attempts to find a function which models the data with the least error. Classification is the task of generalizing known structure to new data. For example, an e-mail program might attempt to classify an e-mail as "legitimate" or as "spam". 78 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec Raw Data Data Mine Database Raw Data (b) (a) T A B L E (c) (d) (e) Fig. 2. Data Mining Process In the Fig. 2 (a) Extraction of data from repositories (b) Data cleansing and loading into data mining database (c) Data transformation (d) Pattern duscovery using algorithms such as clustering, regression and classification. (e) Data visualization and intrepretation of results 3.1 Data mining System Components The Fig.3 denotes the classification of data mining system. Which is classifies based on database technology, statistics, machine learning, information science, visualization and other discipline. Data mining applied to the internet has the potential to be quite beneficial. Online data mining is mining of data related to the online. Statistics Database Technology Machine Learning Data Mining Information Science Other Discipline Visualization Fig. 3 Data Mining System Components Statistics is at the core of data mining - helping to distinguish between random noise and significant findings, and providing a theory for estimating probabilities of predictions, etc. However Data Mining is more than stats. DM covers the entire process of data analysis, containing data cleaning and preparation and visualization of the results, and how to produce forecasts in real-time. Machine learning explores the construction and study of algorithms that can learn from and make predictions on data [21] .The process of machine learning is similar to 79 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec the data mining. Data Mining and machine learning both are systems search through data to look for patterns. However, instead of take out data for human comprehension - as is the case in data mining applications - machine learning uses that data to improve the program's own understanding. Machine learning programs spot patterns in data and adjust program actions accordingly. There are more applications identified that can benefit from data mining. These include: Life Sciences (LS), Customer Relationship Management (CRM), Web Applications, Manufacturing, Competitive Intelligence, Retail/ Finance/ Banking, and Computer/ Network/ Security, Monitoring/ Surveillance applications, education Support, Climate modelling, Astronomy, and Behavioural Ecology. Indeed, most scientific disciplines are becoming data-intensive and turning to data mining as a tool. There are no of numbers of applications identified that can benefit from data mining. Indeed, most scientific disciplines are becoming data-intensive and turning to data mining as a tool. As information technology transforms their role from passive data collection to a more active exploration and exploitation of information, they face a severe challenge: how can they handle a massive amount of data that generate, collect and store. There is a need to have a technology that can access, analyse, summarize, and interpret information intelligently and automatically [22]. 3.2 Interactive data mining Data has become a critical resource in many organizations and effectively managing the data has become a major need. Various types of data base management systems have been developed to manage the multimedia data types such as text, voice, video and the increasing no of data base system are now inter operating with one another. Extracting useful information from the data has also become urgent requirement. Data by itself will not be of much use unless one can assign means to the data. Data mining techniques enable uses to post queries and extract information from the data often previously unknown. Visualization techniques have helped the user to graphically capture the data in the data base as well as help the user in the mining process. There four data base management, visualization and the mining are three technologies that have to be integrated to provide an interactive data mining. Environment that can help the users a great deal to uncover previously unknown information. The World Wide Web also adds another dimension to interactive data mining. During recent years the explosion of the users and the internet and the increasing the no of World Wide Web services are rapidly advancing digital library technology. Numerous data bases are now scattered across the various sites. The goal is for users to access these databases in a transparent manner. Furthermore, it’s also desirable for the users to extract information from this data sources through the data mining. The data and information are extracted from the data bases have to be formatted in a way that can be understood by the users. Therefore, visualization techniques play a major role for a data management and mining on the internet. Data mining is the process of extracting information and patterns from the data often previously 80 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec unknown. Various tools are now available for data mining. Many of the data mining techniques need to be guided by the user. That is, the user steers the data mining tools so that useful results are obtained. This process is called interactive data mining as there is interaction between the user and the data mining tool. Fig.4 illustrates interactive data mining showing the relationship between the database management system, the visualization tool, and the data mining tool. Visualization Tool Visualize DBMS Response Data Mining Tool DBMS Direct Data Mining Response Fig. 4 Data Mining on Visualized Results Visualization plays a major role in interactive data mining. For example, the visualization tool may be used directly to graphically display the results extracted from the database possibly via a data base management system. The graphical display can then be used by data mining tool to extract patterns. For example, suppose an automobile company wants to sell automobiles. It then queries the database to obtain information about the various neighbourhoods in the town. The data base management system may graphically display the map of town and information about the people in the town. From this picture the data mining tool cloud determine the types of cars to sell to the people in various neighbourhoods. Visualization tools may be applied directly to the results obtained by the data mining tools. In this case the data mining tool extracts information from the database via the DBMS and this information is visualized by the visualization tool. Based on this information the data mining tools may pose additional queries to DBMS to get more data so that it can deduce additional patterns. Note that while the first approach applies data mining techniques to visualization, the second approach applies visualization techniques to data mining. That is, in the first approach, the results from the DBMS are visualized and the data mining tool is applied on the graphical displays. In the second approach visualization techniques are used in the result of the data mining tool so that the results can be graphically displayed. In both cases visualization tools guide the data mining process. In an earlier paper we have identified a third approach where visualization techniques compliment the data mining process. That is the data mining tool is used to uncover patterns; but visualization tools may still be used on the data to have a better understanding of the data [23] 4. PROCESS MODEL FOR DATA MINING IN INTERNET BASED LEARNING A study [24] states that conventional teaching methods provide the least effective method of 81 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec learning. The author compares students’ scores on achievement tests using three forms of instruction; conventional teaching, mastery teaching, and individualized tutoring. Mastery teaching is an instructional technique whereby a teaching supplements a lecture with diagnostic tests to determine where students are having problems, and adjust the instruction accordingly. The proposed model support the internet-based learning with several scheme of learning including adaptive, autonomous and collaborative learning. Leaner Instructor Communication Medium Web-Based Learning Transformed Dataset Selecting and Filtering Pattern Extraction Visualisation Conceptualization Data Mining Process DB Fig. 5 Process Model for Interent-based Learning There is no specific model followed by internet-based learning system. Four components emerge from the literature as part of a internet-based leaning system. They include the following: student model, the pedagogical module, the expert model, and the communication module or interface [25]. Based on the compontents and their work the proposed model which is suitable for the internet based learning encironment is deveopled whish is illustrated in fig 5. 82 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec Internet based technologies have led to great innovation in teaching and learning. Internet based education system constitute one of the fastest growing areas in educational technology, research and development. Internet based education is one of the innovation of technological changes and it mainly overcomes the conventional boundaries of traditional learning systems. It facilities a leaner-centric approach, 24X7 services to all activities concerning the teaching learning process are possible through internet based learning.The instructor stores and delivers information of each individual learner. For example, the model tracks how well a student is performing on the material being taught or records incorrect responses that may indicate misconceptions. Since the purpose of the student model is to provide data for the pedagogical module of the system, all of the information collected should be usable by tutor.The communication medium provides a model of the education process. For example, information about when to review, when to present new topic, and which topic to be present is controlled by this module. As mentioned earlier, the learners and instructor are used as input to this component, so the instructive decisions reflect the differing needs of each student.The subject experts contain the domain knowledge, which is the information being taught to the leaner. However it is more than just a representation of the data; it is a model of how someone skilled in a particular domain represents the knowledge. By using an expert model, the tutor can compare the leaner’s solution to the expert’s solution, pinpointing the places and is the most important since in the absebce of it, there would be nothing to teach the student. Generally, significant knowledge engineering to represent a domain so that other parts of the tutor can access it. The communication module controls interactions with a student, including the dialogue and the screen layouts. For example, it determines how the material should be presented to the student in the most effective way by providing an understanding of the application domain, the goals of the system and its users, and the relevant prior background and prior knowledge. Selecting a data set, or focusing on a subset of variables or data samples, on which discovery is to be accomplish. Pre-processing and data cleaning, removing the noise, collecting the necessary information for modelling, selecting methods for handling missing data fields, accounting for period sequence information and changes. Data reduction and projection, finding appropriate feature to represent data, using dimensionality reduction or transformation methods to reduce the number of variables to find invariant representations for data. Choosing the data mining task depends on the goal of KDD. Selecting methods and algorithms are to be used for searching the patterns in the data. Mining the knowledge: searching for pattern of interest; evaluating or interpreting the mined patterns, with a possible return to any previous steps; using this knowledge for promoting the performance of the system and resolving any potential conflicts with previously-held beliefs or extracted knowledge. These are the steps that all the data mining task process through Selecting and filtering has been presented in jain [26]. He presented taxonomy of available selection algorithms based on pattern recognition or ANN, sub-optimal or optimal, single solution or multi-solution, and deterministic or stochastic. The relevant classification result is based on prior knowledge of the classification task and useful review on relevance and selection. Visualization is useful for project in high dimensional data down to two or three dimensions. 83 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec The main concern is protecting the distance information and deployment of the original data in two or three dimensions. The traditional approach in data visualization is linear projection. A more suitable choice is projecting the data onto a graph. 5. CONCLUSION The research work in this area confirmed that data mining enhances teaching and learning process in a internet-based environment. From this study evident that internet-based learning is in practice in the higher education scanario. Data mining helps in extracting the required information for the teaching-learning process and its saves a lot of time. The theoretical model proposed in this article could serve as a basis for implementing data-mining techniques in internet-based learning scenario. ACKNOWLEDGEMENT The author P.Karthikeyan would like to thank the University Grants Commission, Hyderabad, India for its financial support to Minor Research Project No.F.MRP-4885/14 (SERO/UGC) dated March 2014. REFERENCES [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] Fayyad, U. M., G. Piatetsky-Shapiro, P. Smyth, and Ft. Uthurusamy, 1996. Advances in Knowledge Discovery and Data Mining, (AKDDM), AAAI/MIT Press Pratiyush Guleria and Manu Sood. September 2014. Data Mining In Education: A Review on the Knowledge Discovery Perspective, International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.4, No.5 IBM. (2012). what is big data? Retrieved May 16th, 2012, from http://www.01.ibm.com/software/data/bigdata/ Romero, C., & Ventura, S. (2010). Educational Data Mining: A Review of the State of the Art. Systems, Man, and Cybernetics on, 40(6), 601-618. doi: 10.1109/tsmcc.2010.2053532 C. Romero, S. Ventura, E. Garcia, "Data mining in course management systems: Moodle case study and tutorial", Computers & Education, Vol. 51, No. 1, pp. 368-384, 2008 Bowles, M.,(2004). What Is Electronic Learning? [online]. In: Bowles, Marc. Relearning to E-learn: Strategies for Electronic Learning and Knowledge. Carlton, Vic.: Melbourne University Press, 2004: 3-19. Sang Jun Lee, Keng Siau, “A review of data mining techniques, Industrial Management & Data Systems”, 101/1 [2001] 41-46. Padhraic Smyth, “Data Mining: Data Analysis on a Grand Scale”, July 6,2000 Weiss S. & Indurkhya N, “Predictive Data Mining: A Practical guide”, Morgan Kauf-. mann, 1998. Technology Forecast: 1997 (1997), Price Waterhouse World Technology Center, Menlo Park, CA 84 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY AGU International Journal of Science & Technology http://www.aguijst.com (AGUIJST) 2015, Vol. No. 1, Jul-Dec [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] William J. Frawley, Gregory Piatetsky-Shapiro, and Christopher J. Matheus, “Knowledge Dis-covery in Databases: An Overview”, AI Magazine Volume 13 Number 3 (1992) (© AAAI) Li and Change, W.C(2007), “Exploiting and Tranferring Presentational knowledeg Assets in R&D organizations”, Expert System with Applications. R.P. Valderrama, L.B. Ocana, and L.B. Sheremetov, “Development of intelligent reusable learning objects for web-based education systems,” Expert Systems with Applications, 28(2), pp. 273-283, 2005. Z. Tang, J. Maclennan, and P. P. Kim. Building data mining solutions with OLE DB for DM and XML analysis. SIGMOD Record, 34:80–85, June 2005 Luan, J.(2002) “Data Mining and its Application in Higher Education”, New Directions for Institutional Research, 2002, 113(Spring)p. 1736. Ma Y.; Liu B.; Wong C.; Yu P.; and Lee S., 2000. Targeting the right students using data mining. In Proc. of 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Boston, USA, 457–464. Dharminder Kumar , Deepak Bhardwaj , “Rise of Data Mining: Current and Future Application Areas”, IJCSI International Journal of Computer Science Issues, Vol. 8, Issue 5, No 1, September 2011 ISSN (Online): 1694-0814 Ranjan, J., & Malik, K (2007). Educational process: A Data-Mining approach. VINE: The Journal of information and knowledge management systems, 37(4), 502-515 Talavera and Gaudioso, 2004.Mining student data to characterize similar behaviour groups in unstructured colla boration spaces. In Proceedings of the Artificial Intelligence in Computer Supported Collaborative Learning Workshop at the ECAI 2004. Zhang, Duan C, and Wu G.(2001) “ Reserch on Self-efficiency on distance Learning and its influence”, White paper. Ron Kohavi; Foster Provost (1998). "Glossary of terms". Machine Learning 30: 271–274 Sherry Y. Chen and Xiaohui Liu, Journal of Information Science, © CILIP 2004, The Contribution of Data Mining in Information Science Thuraisingham B . 1996 Database management and the internet, Local area network handbook, Auerbach Publications, new York. B. S. Bloom, (1984) “The 2 sigma problem: The search for methods of group instruction as effective as oneto-one tutoring.,” Educational Researcher, , no. 13, pp. 4–16. Minaei-Bidgoli, B., Kashy, D.A., Kortemeyer G., Punch, W.F., (2003) “Predicting Student Performance: An Application of Data Mining Methods with an educational Web-based System”, (IEEE/ASEE) FIE 2003 Frontier In Education, Nov. 2003 Boulder, Colorado. Jain, A. and Dues, R. (1988). Algorithms for clustering Data. Prentice-Hall. 85 AGU INTERNATIONAL JOURNAL OF SCIENCE & TECHNOLOGY