Download Application of Data mining in Electronic Commerce

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Application of Data mining in Electronic Commerce
Ling Chuanfan
Jiangxi University of Finance and Economics, P. R. China 330013
Abstract The online portfolio of the e-commerce website is enormous, and it contains a lot of
information of very important user in a large amount of business, but the contradiction between the data
explode and poor information is conspicuous day by day. Through data mining technology, enterprises
can fully utilize various kinds of effective information on Web, mine the potential market, improves the
competitiveness. In this paper, we describe the data mining architecture and the web mining technology.
We further present the e-commerce model based on web mining. We detail the web mining functions
and the data mining process in e-commerce.
Keywords Data Mining, Electronic Commerce, Web Mining
1 Introduction
We are living in a networked era now, and the communication, computer and network technology are
changing the whole human society. ISC (Internet Software Consortium) statistics show that global 233
million host computers have connected with Internet by January of 2004. In our country, the
development of Internet is swift and violent too. According to CNNIC (China Internet Network
Information Center), to June 30, 2004, online computers have attained 36.3 million in our country, and
the number of users 87 million that surf the Internet, and the domains registered under cn are 382216,
and WWW web sites are 626600.
With the development at full speed of Internet and Web technology, all kinds of e-commerce web sites
surge forward, and the development in the whole world of e-commerce is in the ascendant. The trading
value of U.S.A. B2B trade reached 2,700 billion dollars in 2003. According to the estimation by
Forrester Research, the trading value of market of China B2C will reach 27 billion dollars in 2004. The
advantage brought to enterprise, individual and society in development of e-commerce is
omni-directional, and it really brings the mankind into the information-based society.
The e-commerce web site needs to deal with a large number of data every day, but the important
information contained in resources of the data fails to get the abundant mine and use so far. In the fierce
e-commerce market competition day by day, any information related to consumer behaviour is all very
valuable for operator. In order to solve the data explode and information poor phenomenon, data mining
technology arises at the historic moment, and there are extensive application prospects in e-commerce.
2 Data Mining and Web Mining
With the rapid growth of the network information resources, people pay close attention to effectively
take out potential and valuable information from the network information of magnanimity more and
more, and make it function in managing and making policy effectively.
2.1 Data Mining
Data mining is a kind of decision support process, and it is because of such technology as artificial
intelligence, machine study, statistics, etc. mainly, analyses various kinds of existing data increasingly
automated, make summing up reasoning, excavate out the potential mode from it, predict the customer's
behavior, help the policymakers of enterprises to adjust the market tactics, reduce the risk, make correct
decision.
2.1.1 Technology Definition of Data Mining
Look technically, data mining is a process to pick up potential useful information and knowledge from a
large amount of, incomplete, to have noise, a fuzzy one, among the real data at random, draw know by,
people that imply in among them.
2.1.2 Main Technological Reason Exciting Data Mining
Exciting the development of the data mining, the main technological reason used and studied is as
follows: super large-scale database, advanced computer technology, the shortcut of the magnanimity
data, use the profound ability that calculate of statistical method to the data.
2.1.3 Data Mining System Structure
The typical data mining system includes six main parts: Database, data warehouse and other information
storehouse, the database and data warehouse server, knowledge base, the data mining engine, assess the
module in mode, visual user's interface.
Data mining system structure is depicted in figure 1.
Data collection and input
Data mining
engine
Database and data
warehouse server
Database
Data analysis mining
Data warehouse
Mode
assess
Result output
Visual user’s
interface
Knowledge base
Figure 1 Data mining system architecture
2.2 Web Mining
Internet stores a large amount of complicated, no-structuralization, dynamics and incompleteness
information. This information is unable to be handled and managed by using the existing database
management system. The main reason is that the Internet includes a large number of files, figures,
pictures, sound and large-scale commercial data, etc. Users with different backgrounds, different
interests and different purposes have abundant freedom, and they can use these information by random
to connect with any Internet web sites.
Web mining is the extraction of interesting, potential and useful model and implicit information from
artifacts or activity related to the WWW.
2.2.1 Classification of Web Mining
We can classify Web mining into three domains: content, structure and usage mining, as figure 2 shows.
Web mining
Web content mining
Mining based on text
information
Web structure mining
Mining based on
multimedia information
Web usage mining
Generic access
pattern tracking
Individuation usage
record tracking
①Web content mining。It refers the process that we obtain the useful information from the file content
Figure 2 Classification of Web mining
and its description, and it can further divide for mining based on text information and mining base on
multimedia information.
Web structure mining It is a process that we can get useful knowledge from web organization
structure and link relation.
Web usage mining
Analyzing the Web access logs and related data can help understand the browser of the web site and user
behavior model, and it can divide into two domains: generic access model track and individuation use
record track.
2.2.2 Web Mining Technology
Access Analysis. While excavating Web data with this technology, we usually adopt the graphic form,
the page on websites is defined into a node in the picture, and the ultra chain between the pages is
defined into a side in the picture. In process of Web mining, analyzing through access technology can
confirm frequent visit route of website in Web.
②
③
①
。
。
②Association Rules Mining. The purpose is to excavate the interrelation of appearing and hiding
among the data. In Web data mining, the association rules mining should excavate out user's connection
among page (or file) visiting from server during a visit when being excavated, perhaps not have direct
relation of quoting between these pages.
Sequence Model Analysis. Its emphasis point lies in analyzing causality before and after among the
data. In Web server logs, users' visit is recorded by time.
Classification Rules Discovery. It sets up outline characteristic for the specific colony with some
public attribute, and these characteristics can classify newly increased data in database.
Clustering Analysis. It can gather out customer's group with similar characteristic from visited server
information, namely get having users of the similar characteristic, data one together.
③
④
⑤
3 E-commerce Model Based on Web Mining
The online portfolio of the e-commerce website is enormous, and it contains a lot of information of very
important user in every day's a large amount of business. Each customer behavior at Web will produce
relevant data, include message of buying, the relevant data that utilize the search engine and having a look
around in the website again. All mutual data are written down by the backstage supporter's database. Utilizing
perfect database technology, enterprises can collect the information of a large number of customers more
easily. And excavate technology through Web, enterprises can utilize effective customer's information,
explore the potential market, improve the competitiveness.
3.1 Traditional E-commerce Structure Model
In traditional e-commerce structure model (as figure 3 shows), the component related with data mining
has not been introduced. Mobile devices and PC customer client user communicate Web application
server through Internet's protocol and application server and the backstage supporter's database server to
carry on information interchange. It usually works in distributing environment.
Mobile
devices
Application
Server
Data Base
Server
Internet
Client
Figure 3 Traditional e-commerce structure model
The application server offers a running environment for business. In this environment, it is various kinds of
components that every layer application is separated from commercial service, and these components
communicate each other through the network. At present, we have many application servers at choices,
such as IIS server of Microsoft Company, J2EE server of every big company, etc. Run in JVM (Java
Virtual Machine) that the application software uses the server in Web and inlays inside. The components
of these servers utilize the foundation of the network to build up the catalogue offered and security service,
pass HTTP or IIOP (Internet Inter-ORB Protocol) and the customer and other packages communication.
3.2 E-commerce Model Based on Web Mining
The e-commerce model based on web mining include the basic component in the traditional
e-commerce model on the basis of Web, and it has added the knowledge base server, as figure 4 shows.
Mobile
devices
Web application
server
Internet
Client
Data base
server
Data mining
engine
Knowledge base
server
Figure 4 E-commerce model based on web mining
Application server can carry on information interchange with the knowledge base server too except that
exchanges with the database server, after for example use the server to put forward the data to
knowledge base server to excavate asking, the knowledge base server excavates the engine through the
data, excavate and deal with the data to the database, returned and gave and used the server finally.
4 Web Mining Application in E-commerce
The change of the present economic mode, the electronic trade on Internet has changed the relation
between operator and customer. The online customer's mobility is very big. They especially pay
attention to the price of the goods while paying close attention to the goods and meeting their demands.
Enterprises need to know customer's hobbies, value orientations as many as possible, so as to ensure the
competitiveness of times in e-commerce. The potentiality that Web excavates lies in using various kinds
of data to excavate algorithms, analyses such data as the daily record and customer, products and sale on
Internet server, etc.
4.1 Excavating Information from E-commerce
In e-commerce, commercial information comes from various kinds of channels. For example, when
consumers consume with the credit card, operators can collect relevant information in the settlement
course of the credit card, note consumer go on time, place, commodities or services that interested in
that consume, like price competence and solvency, etc. data that receive; When we apply for credit card,
etc. to need to fill in the form, our personal information is stored in the corresponding business database;
Enterprises can also buy necessary information from other companies or the organization besides
collecting relevant business information by oneself . Make up this information from various kinds of
channels, use the treatment technology of information to deal with, receive the decision basis that the
operator is used for carrying on directional marketing to the specific consuming groups or the individual
from it.
For example as operator being through is it after excavating, find to business datum one bank account
holder is it apply double to unite accounts to demand suddenly to go on, and confirm this consumer is
that the first application unites accounts, the operator can infer this user may be going to marry. It can is
it is it buy house, pay children such business of long-term investment as tuition fee, etc. to use for to
promote to this user orientation. The operator may even sell this information and specialize in the
wedding celebration goods and company served. In the countries and regions with more developed
market economy, a lot of e-commerce ventures all begin to excavate and carry on deep processing to
business information through the data on the basis of original information system, in order to construct
one's own competition advantage.
4.2 Web Mining Functions
All visitors of Web website will leave the trace had a look around, and the information is stored in the
logs of Web server automatically. Web analyses tools will become meaningful information next life
through analyzing and dealing with the daily record file of Web server.
Through collecting, processing and dealing with a large amount of information related to consumer's
consumer behavior, we can confirm specific consuming groups or the individual's interest, consumption
habit, propensity to consume and consumption demand, and then infer out the corresponding consuming
groups or individual's next consumer behavior. Then based on this, we can carry on the directional
marketing of the specific content to the consuming groups recognized, save the cost, raise the result,
thus bring more profits to enterprises.
4.2.1 Understand customer's behavior
Know the visitor personal hobby.
Receive the conversion ratio from visitor to buyer.
Find the customer's turning round rate.
Grasp the customer's purchase mode and the visitor mode.
Find common characteristic , customer of group , understand customer group and relation to buy item.
Find the connection between the customer and customer.
Optimizing the management mode of the e-commerce website through understanding the visitor
①
②
③
④
⑤
⑥
dynamic behavior, the website can offer the individualized service to different customers.
4.2.2 Judge the efficiency of Web website
Grasp high purchase rate part and low part of buying, and receive reliable market feedback information.
Know the visit situation of the page.
Find the situation of clicking of the advertisement, assess and test the rate of return on investment of the
advertisement.
Web designer no longer totally depends on determining the nature to guide and design websites of the
expert, but revise and design website's structure and appearance according to the visitor information.
4.3 Data Mining Process in E-commerce
The process of data mining in e-commerce includes three stages: data preparation, excavate operation
and result expression and explanation.
4.3.1 Data Preparation
This stage can divide into such three stature steps as data integrating, data choosing, data pretreatment,
etc. further.
Data integration deals with many files or many database data of running environment to integrate, solve
the fuzzy problem of semanteme. Purposes of data choose are to distinguish data set analyzed to need,
narrow range of processing; improve the quality that the data excavate. The pretreatment of the data is
for overcoming the limitation of the instrument to dig up of data at present.
4.3.2 Data Mining
This stage carries on real excavating and operating, including mainly: determining how to produce
supposing, choosing the suitable tool, exploring the operation of knowledge, verifying the knowledge
found.
4.3.3 Result Expression and Explanation
In this stage, we analyze to the information that is drawn according to the end user's decision purpose,
and distinguish the most valuable information, and refer to a policymaker through the support tool of
decision.
5 Conclusion
Excavating and finding knowledge to the data emerging in the electronic business procedure, it can
contribute to enterprise's making the corresponding sales tactics to different products, and it can
establish the page which meet customer's demand dynamically; It also can contribute to optimizing the
structure of organization of the commercial affair website, and improve the working efficiency of
commercial websites; Contribute to launching goal marketing pointedly. In a word, the data have
excavated and supported such key commercial procedure as CRM, ERP and SCM, etc. effectively in
e-commerce, it is the important means innovated in e-commerce marketing. The application that the data
excavate in e-commerce is a brand-new field, and its development is very fast. A lot of problems await
further research and promotion at the same time.
References
[1] Han J, Kamber M. Data Mining: Concepts and Techniques. San Mateo, CA: Morgan Kaufmann,
2000.
[2] Liu Hongyan, Chen Jian, Chen Guoqing. Review of classification algorithms for data mining, Journal
of Tsinghua University (Science and Technology), 2002,42(6): 727-730[in Chinese]
[3] Mobasher B, Cooley R, Srivastara J. Automatic Personalization Based on Web Usage Mining.
Communications of the ACM, 2000, 43(8), 142-151
[4] Wilvander Aalat, Boudewijn Van Dongen, et al. Workflow mining: A Survey of Issues and
Approaches. Data & Knowledge Engineering, 2003, (47): 237-267
[5] Kanmrani A, Rong W, Gonzalez R. A genetic algorithm methodology for data mining and intelligent
knowledge acquisition, Computer & Industrial Engineering, 2001, 40(2): 361-377
[6] Chakrabarti S. Data mining for hypertext: A tutorial survey. SIGKDD Exploration, 2000, 1(2): 1-11
[7] Srivastava J. et al. Web usage mining: Discovery and application of usage patterns from Web data.
SIGKDD Explorations, 2000, 1(2): 12-23