Download DASA: Dissatisfaction-oriented Advertising based on

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Marketing ethics wikipedia , lookup

Advertising management wikipedia , lookup

Targeted advertising wikipedia , lookup

Transcript
Expert Systems with Applications 37 (2010) 6182–6191
Contents lists available at ScienceDirect
Expert Systems with Applications
journal homepage: www.elsevier.com/locate/eswa
DASA: Dissatisfaction-oriented Advertising based on Sentiment Analysis q
Guang Qiu a,*, Xiaofei He b, Feng Zhang a, Yuan Shi a, Jiajun Bu a, Chun Chen a
a
b
Zhejiang Key Laboratory of Service Robot, College of Computer Science, Zhejiang University, Hangzhou 310027, China
College of Computer Science, Zhejiang University, Hangzhou 310027, China
a r t i c l e
i n f o
Keywords:
Contextual advertising
Advertising keyword extraction
Ad selection
Sentiment analysis
Syntactic parsing
a b s t r a c t
Online advertising has become one of the major revenue sources of today’s Internet ecosystem. The main
advertising channels used to distribute textual ads are sponsored search and contextual advertising. Here
we consider the problem of contextual advertising, i.e. associating ads with a Web page. Most of previous
work only focuses on topical relevance of ads whereas the consumer attitudes are ignored. In this paper,
we propose a novel advertising strategy, called Dissatisfaction-oriented Advertising based on Sentiment
Analysis (DASA), to simultaneously improve ad relevance and user experience. Specifically, by using syntactic parsing and sentiment dictionary, we propose a rule based approach to extract topic words of opinion sentences associated with negative sentiment, which are regarded as the advertising keywords. We
also design a prototype system for product information submission for the sake of ad selection. We take
into account the consumer attitudes and promote the competitors of those products with which the consumers are not satisfied. The experimental results on advertising keyword extraction and ad selection
have demonstrated the effectiveness of the proposed approach.
Ó 2010 Elsevier Ltd. All rights reserved.
1. Introduction
The growth of the Internet signified a dawn of a new age for
advertising. Compared with that on traditional media like newspaper and television, advertising on the Internet (online advertising)
boasts advantages including wide coverage, low cost and so on.
Driven by these unique advantages of online advertising and the
rapid development of World Wide Web, advertisers have paid
increasing attention to this novel and impressive advertising media. As mentioned in (Broder, Fontoura, Josifovski, & Riedel, 2007),
the total Internet advertiser cost in US alone in 2006 is estimated
at over 17 billion dollars with a growth rate of almost 20% year
over year which means more than 29 billion dollars in 2009.
The large investment in online advertising market provides
main funding for the majority of Internet companies. For some
Web sites, especially those for-profit non-transactional ones, revenue from advertising maintains not only current performance but
also future development. Therefore, it seems that these advertising-relied Web sites should get ads published as many as possible
to gain high revenue from advertisers. However, Bhargava et al.
have show in their work (Bhargava & Feng, 2002) that advertising
would create a disutility to Web users, consequently reducing sites’
market share and consumer-side revenue, further the profits from
advertisers. Also, Mccoy et al. have drew a similar conclusion in
q
This work is supported by China National Key Technology R&D Program
(2008BAH26B00).
* Corresponding author. Tel.: +86 571 87953955.
E-mail address: [email protected] (G. Qiu).
0957-4174/$ - see front matter Ó 2010 Elsevier Ltd. All rights reserved.
doi:10.1016/j.eswa.2010.02.109
their investigation (Mccoy, Everard, Polak, & Galletta, 2007). In
other words, from the perspective of consumers, advertising
should be constrained or even boycotted. The tradeoff between
financial revenue and market share triggers the emergence of relevant advertising to emphasize the relevance between ads and
Web pages for the sake of consumers.
Broder et al. categorize relevant advertising (textual ads)
according to the distribution channels into sponsored search (SS)
and contextual match (CM) (Broder et al., 2007). SS aims to place
originating query driven ads on the result pages from a Web search
engine. This kind of advertising is used by major Web search engines like Google, Yahoo! and Microsoft Live Search. CM aims to
place the commercial ads in the content of generic Web pages.
Some major Web search engines also provide such service and
there are also many smaller players. Fig. 1 gives illustrative examples for these two types of advertising. The primary focus of this
paper is on CM which is also referred to as contextual advertising.
The dominant strategy of most of current sponsored search and
contextual match is the keyword targeted marketing (RibeiroNeto, Cristo, Golgher, & Moura, 2005). Previous approaches to targeted/relevant advertising consider short-term and long-term
interests, however suffer from privacy issues (Wang et al., 2002).
In keyword targeted marketing, keywords are extracted from
search queries (in the case of SS) or Web pages (in the case of
CM) to match against those keywords associated with ads provided
by advertisers. It would be important to note that the privacy issues can be avoided in this kind of strategy.
The problem of placing relevant ads using the keyword targeted
marketing strategy (mostly for contextual advertising) has
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
6183
Fig. 1. Examples of sponsored search and contextual match. (a) Yahoo! search results for query ‘‘Honda automobiles”, in which we can see an ad of ‘‘Honda Autos” on the right.
(b) A snapshot of the home page of automotiveforums.com. There are ads provided by Google adsense on the left.
received a lot of attentions recently (Broder et al., 2007; Chakrabarti, Agarwal, & Josifovski, 2008; Lacerda et al., 2006; Ribeiro-Neto
et al., 2005; Wu & Bolivar, 2008; Yih, Goodman, & Carvalho, 2006).
These approaches only focus on topical relevance between ads and
Web pages, which may not be optimal in some situations. For
example, for a blog written by someone who is complaining about
the safety of Honda Accord, the ads related to Honda Accord are good
matches if we simply consider the topical relevance. However, the
consumer reading this blog may have a negative feeling about Honda Accord and therefore may not click the ads. Thus, it might be
desirable to place the ads of other automobiles with high safety
standard.
In this paper, we propose a novel ad targeting mechanism,
called Dissatisfaction-oriented Advertising based on Sentiment
Analysis (DASA), to address the above mentioned problem. Our basic idea is to combine traditional keyword relevance matching with
sentiment analysis (also known as opinion mining). In fact, sentiment analysis of texts is a kind of non-topical text analysis technique which is orthogonal to the topical based technique (Esuli &
Sebastiani, 2005). By sentiment analysis, we aim to discover the
author’s underlying sentiment when he/she is writing the text,
and eventually determine his/her attitude towards the mentioned
topics. This way, we are able to promote ads which not only relate
to the topics of the Web pages but also meet the consumers’ potential information needs. Naturally, in the example given above, given the author’s negative attitude towards the safety of Honda
Accord, we may consider promoting automobiles which have higher safety standard like Volvo.
The rest of the paper is organized as follows. Section 2 discusses
previous work on contextual advertising and sentiment analysis. In
Section 3, we provide an overview of our DASA advertising strategy, along with some discussions on application issues. In Sections
4 and 5, we describe our approaches for advertising keyword
extraction and ad placement in details, respectively. In Section 6,
we describe our experimental methodology and data sets, and
demonstrate experimental results with discussions. Finally, we
provide some concluding remarks in Section 7.
2. Related work
In this section, we provide a brief description of related work on
contextual online advertising and sentiment analysis which is the
key technique employed in our approach.
6184
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
2.1. Contextual online advertising
Contextual advertising is a major type of online advertising, in
which ads are placed on Web pages according to their content.
Published literature on contextual advertising is still limited. One
reason is that it is an emerging research area; in addition, commercial secrets also limit publications of research achievements. In
existing researches, some reduce the problem to keyword matching (Wu & Bolivar, 2008; Yih et al., 2006), while others inspect relevance of the entire text on the Web page (Broder et al., 2007;
Chakrabarti et al., 2008; Lacerda et al., 2006; Ribeiro-Neto et al.,
2005).
In the first line, Yih et al. (2006) emphasize the extraction of
appropriate advertising keywords in contextual advertising and
propose a classification based approach. That is, by using documents manually annotated with advertising keywords, they apply
logistic regression to train a classifier to classify a keyword as an
advertising keyword or not. The features used to represent keywords include linguistic features, html metadata, TFIDF, query
log, etc. A similar approach is recently proposed by Wu and Bolivar
(2008) which adopts linear regression to determine if a keyword is
an advertising one. The features used in their work are similar to
those in (Yih et al., 2006). It would be important to note that, in
our work, we develop a novel strategy to extract advertising keywords in which no training process is required.
In the second line, previous work on relevance measures between Web pages and ads can be roughly categorized into syntactic based and semantic based approaches. Ribeiro-Neto et al.
provide a syntactic solution to ad placement based on the cosine
similarity measuring between Web page and ad vectors (RibeiroNeto et al., 2005). They also propose a solution to solve the impedance problem between the vocabulary of ads and Web pages. In
their follow-up work (Lacerda et al., 2006), they design a ranking
strategy for displaying ads according to their relevance by effectively leveraging all individual features using genetic programming. They aim to optimize overall precision and minimize the
number of misplacements. The semantic approach proposed in
(Broder et al., 2007) stems from the linguistic phenomena of polysemy that a word has different meanings in different semantic contexts, which is contrary to the impedance problem in (Ribeiro-Neto
et al., 2005). They use the proximity of the ad and page classes as
the semantic score. Their experimental results show that semantic
score is significant as to improving the ad quality.
However, all previous approaches above focus only on the topical relevance and ignore the sentiment information between the
lines. In this work, we propose a novel strategy which considers
both the topic relevance and sentiment coherence between ads
and Web pages.
2.2. Sentiment analysis
Another work related to our approach is sentiment analysis.
Current researches on sentiment analysis mainly focus on topic related issues (Hu & Liu, 2004; Lu & Zhai, 2008; Mei, Cai, Zhang, &
Zhai, 2008; Mei & Ling, 2007) and sentiment classification (Ding
& Liu, 2007; Hatzivassiloglou & McKeown, 1997; Pang & Lee,
2004; Turney, 2002; Turney & Littman, 2003). Here, we only summarize some of previous approaches on topic modeling and word
sentiment classification, which are the most closely related to
our work.
In the work of (Mei & Ling, 2007), Mei et al. propose a novel
probabilistic model called Topic-Sentiment Mixture to capture
the mixture of topics and sentiment simultaneously. A topic model
and two sentiment models are defined in their work based on language models to model the probabilistic distribution of words in
different topics and sentiment polarities. The work by Lu and Zhai
(2008) studies the problem of how to integrate a well-written expert review about an arbitrary topic with many ordinary opinions
expressed in a text collection. They propose a semi-supervised topic modeling approach to cast the expert review as a prior in the
probabilistic topic model PLSA and fit the model to the text
collection.
Hatzivassiloglou et al. are the earliest to tackle the problem of
determining the semantic orientation of sentiment words
(Hatzivassiloglou & McKeown, 1997). Their proposed method tries
to predict the orientation of subjective adjectives by analyzing
pairs of adjectives extracted from a large unlabeled document
set. The underlying intuition is that the act of conjoining adjectives
subjects to linguistic constraints on the orientation of the adjective
involved. Turney et al. adopt a different methodology which requires little linguistic knowledge (Turney & Littman, 2003). They
first define two minimal sets of seed terms as descriptive of the
categories Positive Sp and Negative Sn. Then they compute the
point-wise mutual information (PMI) of the target term t with each
seed term ti as a measure of their semantic association.
Sentiment analysis of sentences is a major building block of our
proposed approach. The specific task here is to extract the topic
words of sentences that the contained sentiment words are about.
In our work, we propose a rule based approach to accomplish this
extraction task.
3. Overview of DASA advertising strategy
Our contextual advertising strategy DASA works as follows:
(1) Extract topic words as the candidate topics that ads may be
about from the Web pages (the traditional way to promote
ads);
(2) Determine consumers’ sentiment on the extracted topic
words and select those towards which consumers have negative sentiment as the final advertising keywords;
(3) Match the advertising keywords with appropriate ads.
In other words, an ad is considered to be relevant to a Web page
only if it is associated with the topic words towards which consumers have negative attitudes. The most crucial steps of our approach are topic word extraction and its sentiment classification,
which eventually lead to advertising keyword selection. Fig. 2
shows the architecture of DASA.
3.1. Application issues
There are several application issues we have to address before
the detailed elaboration of our approach.
Since our advertising approach depends on the analysis of consumers’ attitudes, the first issue is that whether there exists large
scale Web environment containing plenty of personal opinions
where ads can be placed. The answer to this problem is User Generated Content (UGC), which is also known as Consumer Generated
Media (CGM), referring to various kinds of media content produced
by consumers. It has been characterized as ‘‘conversational media”
which is a two-way process encouraging the publishing of users’
own content and commenting on others’ (http://www.en.wikipedia.org/wiki/User-generated content). Typical applications include
forums and blogs, which have already been prevailing Web applications on the Internet (http://www.businessweek.com/globalbiz/
content/jan2007/gb20070109559223.htm). Therefore, UGC is ideal
environment for our advertising strategy.
The second issue is the targeted audience of our advertising. In
UGC, there are two types of users participating in Web activities:
writers and readers of the content. When a writer is complaining
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
6185
Fig. 2. Illustration of the workflow of DASA. TWn means the topic word extracted from Web pages by the topic word extraction technique. AKk is the final advertising keyword
selected from the topic words by sentiment classification technique. These two steps form the advertising keyword extraction phase as described by the dashed rectangle.
about some product in his/her article, our advertising scheme
tends to place ads which are about rival products. This can potentially help the writer find a new product that satisfies his/her
needs. Besides, this scheme also provides alternatives to the readers who have similar opinions with the writer. When there are no
personal comments or only positive opinions contained in the article, traditional advertising strategy can be applied.
4. Advertising keyword extraction
As stated above, we reduce the task of advertising keyword
extraction to topic extraction and sentiment classification subtasks, i.e. to identify topics towards which consumers have negative attitudes. Often the consumers are more concerned about
the topics on which they have personal comments. Therefore, we
only consider the sentences which contain sentiment words. And
these sentences are referred to as opinion sentences throughout this
paper. Our extraction approach can be summarized in three steps:
mar, the relation between two words A and B can be described as
A(or B) depends on B(or A). There are also other complicated relations between A and B. For example, A depends on another word C
while C depends on B. We define two categories to summarize all
the relations, which are also illustrated in Fig. 3.
Definition 1. Direct relation (DR): A direct relation means one
word depends on the other one directly or they both depend on a
third word directly. In the context of sentiment word S and topic
word T, this relation means that S(or T) depends on T(or S), or S and
T both depend on another word H.
Definition 2. Indirect relation (IDR): An indirect relation means one
word depends on the other one through another word or they both
depend on a third word indirectly. In the context of sentiment
word S and topic word T, this relation means that S(or T) depends
on T(or S) through another word H, or S(or T) depends on a word H1
which depends on the same word H which T(or S) depends on.
(1) Identify the opinion sentences in the texts;
(2) Extract the topics in the opinion sentences given the labeled
sentiment words;
(3) Select advertising keywords from the extracted topic words.
4.1. Identify opinion sentences
In this step, as we define opinion sentences as sentences that
contain sentiment words, the identification task can be tackled
by discriminating if any of the contained words are sentiment
ones. In our work, we employ a dictionary based approach to judge
if a word is a sentiment word. The dictionary used in our work is
the General Inquirer (http://www.wjh.harvard.edu/inquirer/)
which consists of 1892 negative words such as ‘‘abandon”, ‘‘abuse”,
‘‘poor”, and 1563 positive ones such as ‘‘able”, ‘‘appreciate”, ‘‘good”.
4.2. Extract topic words of opinion sentences using rules
Given opinion sentences identified from the texts and the contained sentiment words as well, topic words associated with the
sentiment words can be extracted. Here we propose our solution
using pre-set rules based on syntactic parsing of sentences.
We take advantage of Minipar (http://www.cs.ualberta.ca/lindek/minipar.htm) as the syntactic parsing tool, which uses the
dependency grammar (Tesniere, 1959). After parsing, words in a
sentence are assigned with corresponding syntactic categories
and linked to each other by certain relations. In dependency gram-
Fig. 3. Different relations between sentiment word S and topic word T. (a–c) are the
three kinds of direct relations. (d–g) illustrate the four kinds of indirect relations.
6186
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
Note that in IDR, there may be more than one H or H1 in the
dependency path. In our work, we only consider the one word situation as compact dependency relations would guarantee the association between S and T with high certainty.
Given the seven relations between S and T, we design our rules
which are able to capture the relations as well as the word syntactic categories. The specific rules are listed in Table 1.
In our rule representations, S(T, H1, H)-Category and S(T, H1,
H)-Relation denote the syntactic category and relation of the word
S(T, H1, H), respectively. We also assign priority Pi (i = 1,...,7) to
each rule in case that more than one rule can be applied to the
extraction task. In the seven rules, R1 is assigned with the highest
priority P1 and R2 to R7 get lower priorities in descending order.
We set the priority for each rule by considering the certainty of
the association between S and T that the corresponding relation
can guarantee.
For the sake of simplicity, we only consider the topological
structures implied by these rules, ignoring most of the detailed
information of relation and syntactic category. One constraint is
that the category of topic word is limited to N (assigned by Minipar) which means the part-of-speech of the topic word should be
noun. Note that Minipar parses all pronouns and numbers also as
N. We do not consider the noun phrases in our current work, since
the state-of-the-art phrase identification is still not accurate enough. There are also some exceptions we need to exclude, such
as the conj relation in R1 and R2 which means the two related
words are of equal syntactic role in the sentence.
Using these rules and their priorities, we are now able to extract
the topic word in the opinion sentence. Take R1 for example, given
the sentiment word S parsed as S-Category, word with category TCategory and relation S-Relation with S is taken as the topic word T
of S with priority P1. The algorithm shown in Fig. 4 elaborates how
to extract topic word T using these rules.
4.3. Select advertising keywords
Clearly not all the above extracted topic words can be taken as
advertising keywords directly as explained in the introduction section. We should identify users’ attitudes towards the topics and
then select appropriate topic words as advertising keywords. From
the perspective of sentiment analysis, the implied sentiment of the
topic word is classified to be positive or negative based on the
polarity of the related sentiment word. As stated in the very beginning, advertising on topics towards which consumers have negative attitudes is more helpful and targeted to users. Therefore,
we take topic words which correspond to negative sentiment
words as the advertising keywords.
5. Appropriate ad placement
After the advertising keywords are obtained through topic
extraction and sentiment classification, we can select appropriate
Table 1
Rules defined in our work. S(T, H1, H)-Category and S(T, H1, H)-Relation mean the
syntactic category and relation of the word S(T, H1, H), respectively. We use these
rules to extract topic words in opinion sentences.
No. Rules
R1
R2
R3
R4
R5
R6
R7
S-Category < S-Relation > T-Category
T-Category < T-Relation > S-Category
S-Category < S-Relation > H-Category < T-Relation > T-Category
S-Category < S-Relation-T-Crelation > T-Category
T-Category < T-Relation-S-Crelation > S-Category
S-Category < S-Relation-H1-Relation > H-Category < T-Relation > T-Category
T-Category < T-Relation-H1-Relation > H-Category < S-Relation > S-Category
Fig. 4. Algorithm for extracting topic word.
ads that are relevant to these keywords and simultaneously meet
consumers’ needs. As stated above, our DASA ad placement strategy only concerns topics towards which consumers have negative
attitudes.
In real world applications, advertising keywords can be brand
names (such as BMW in the sentence ‘‘I do not like BMW”), as
well as composed features (such as safety in the sentence ‘‘I
am not satisfied with the safety of Honda Accord”). Therefore, different strategies should be adopted to select corresponding ads
for these two situations. In our work, we propose to promote
rivals’ products when the advertising keywords are brand names
and select products which have excellent performance in certain
features when the advertising keywords are feature names. In
case of multi-candidates, WTP (willingness to pay) (Feng, Bhargava, & Pennock, 2003) can be employed to determine which one
to place.
In order to select appropriate brands efficiently for the two situations, we need to know which two brands are rivals and which
features a brand is good at. In traditional advertising systems, an
ad is always represented by its bid phrases, self-explanatory title,
short description and URL of the target Web page. These pieces
of information can only provide a general description about the
product. However, we argue that ad description should be more
targeted and expressive in terms of its advantages over its rivals
so that the ad-network can be aware of products’ advantages and
consequently select appropriate ads to meet consumers’ needs. In
our work, we design a browser based system to construct a knowledge base for different products. Fig. 5 is a snapshot of the system.
Any manufactories or dealers can provide their product information through the system. Currently, we take automobile domain
as an example. Note that our approach can also be applied other
domains.
The product information is stored in XML files as shown in
Fig. 6. The element brand in the node AUTO is the brand name of
the auto, such as BMW, BENZ, etc. The element name in the node
MODEL is the model name, such as Z, X serials of BMW. The node
RIVAL stores brand names of rivals and node ADVANTAGE_FEATURE
contains the outstanding features of the model. An example is that
750li of BMW may be labeled as rival of S600 of BENZ. Another
example is that Volvo has excellent performance in safety. Thus,
we can promote ads of S600 when the consumer is complaining
about 750li and place ads of Volvo when he is unsatisfied with
the safety feature of Honda Accord.
6187
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
Fig. 5. Snapshot of our system for product information submission. Manufactories or dealers can provide advantageous feature information and rivals of their products
through this system.
D#2 for advertising keyword evaluation. Table 2 shows the detailed information of each data set.
In D#1, the average length of the 3783 opinion sentences is 21
words. All these sentences are labeled manually with the topic
words. We develop a tool for volunteers to do the labeling with
the instruction as follows:
Fig. 6. XML format for product information storage.
6. Experiments and results
Three experiments are conducted to evaluate the performance
of our approaches to topic word and advertising keyword extraction and ad placement. We begin with a description of the evaluation corpus.
6.1. Evaluation corpus
Since our advertising approach considers not only topical relevance but also consumers’ opinions, we use Web forums as the
evaluation corpus. Particularly, we choose the popular auto forum
automotiveforums.com as the source data in our experiments. In the
forums, the users’ posts are organized as one thread if they all respond to the same post. We crawl and store these posts in unit of
threads, i.e. store all the posts of one thread in a single file. We call
this file an article hereafter. Let D#0 denote the set of articles. Another two data sets are constructed from D#0 for the first two
experiments respectively, D#1 for topic word evaluation and
Suppose you are the writer of the sentence, given the sentiment
word, please figure out in the sentence what is the topic word that
the sentiment word related to. For example, in the sentence ”The z3
was awful and z4 is not much better”, the corresponding topic word
for ”awful” should be ”z3” and that for ”better” should be ”z4”.
We collect the articles which contain the 3783 opinion sentences and the final collection D#2 consists of 647 articles. For
advertising keyword labeling, we divide the 647 articles into 10
groups, and then request 10 annotators to label the keywords in
articles whose corresponding ads they consider appropriate
(”appropriate” means not only relevant but also not annoying).
These keywords are then taken as the correct advertising keywords
for the articles.
Table 2
Statistical information for the three data sets for D#0, D#1 and D#2.
D#0
D#1
D#2
128,425 articles
3783 opinion sentences
647 articles
6188
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
6.2. Experiments on topic word extraction
Topic word extraction is the key to the extraction of advertising
keywords, therefore we conduct separate experiments on it. We
begin with the evaluation strategy and criteria.
6.2.1. Evaluation strategy and criteria
To the best of our knowledge, there is no previous work on sentiment based topic word extraction in the research area of sentiment analysis. Therefore, in our experiments, we compare our
approach with a naive distance based approach that extracts the
nearest noun of the sentiment word as the topic word. We use
the accuracy as the evaluation criteria. The accuracy is defined to
be the percentage of the sentences whose automatically extracted
topic words (either by our approach or the distance based approach) are the same as the user pre-labeled ones.
6.2.2. Results and discussions
Table 3 shows the results of the 3783 opinion sentences in D#1
by using our rule based approach, the distance based approach and
two random approaches: Random-All and Random-Noun. The Random-All approach selects any one word from the sentence randomly as the topic word. Thus its corresponding accuracy value
is calculated as the inverse of the average sentence length (21
words as shown before). Similarly, the Random-Noun approach is
to select any noun in the sentence as the topic word and thus its
accuracy value is the inverse of the average number of words
parsed as N contained in the sentence (The number is 10.25. This
number is a little bit high because Minipar parses all pronouns
and numbers in addition to actual nouns as N and meanwhile some
parsing errors also exist.). We take the -Noun case into account as
both rule and distance based approaches impose the noun
constraint.
As can be seen, both our proposed approach and the distance
based approach significantly outperform random approaches. Particularly, our approach obtains 55% accuracy, while the distance
based approach obtains 42% accuracy. There is almost 31% performance improvement. This shows that our approach is effective in
extracting the topic word. Moreover, this result reveals the fact
that the relationship between sentiment word and topic word
Table 3
Performance results of our rule based approach compared with the distance based
approach and random approaches. The accuracies of random approaches are
calculated as the inverse of average sentence length (for Random-All) or the inverse
of average number of contained words parsed as N (for Random-Noun). The
CorrectSent column is the number of sentences whose topic words are extracted
correctly.
Approaches
CorrectSent
Accuracy
Rule
Distance
Random-Noun
Random-All
2063
1581
–
–
0.55
0.42
0.10
0.05
can be better characterized by syntactic structure rather than
distance.
Table 4 gives some examples of our rule based approach compared with other approaches.
6.3. Experiments on advertising keyword extraction
In this section, we first describe our evaluation criteria and
strategy, and then show the results and discussions.
6.3.1. Evaluation criteria and strategy
In order to evaluate the performance of different advertising
keyword extraction methods, we consider the quality of advertising keywords from two aspects: the commercial value (CV) and
traditional measurements (precision, recall and F1-measure)
which are calculated based on the manually labeled results. The
commercial value of extracted keywords in an article is defined
as the percentage of extracted keywords which are of real value
in some particular online advertising environment, like Google’s
AdWords and Yahoo’s Sponsored Search. It measures the quality
of keywords from the perspective of advertisers while traditional
measurements emphasize the consumers’ interests.
For comparison, we implement another keyword extraction approach based on the famous keyword weighting schema TFIDF,
which is regarded as the most useful feature for advertising keyword extraction (Yih et al., 2006). It would be important to note
that, however, our proposed approach can be easily incorporated
into previous approaches, since most of them only consider topical
relevance while our approach considers consumers’ attitudes. Our
aim in the experiments is to show the effectiveness of our approach instead of showing superior performance over sophisticated previous approaches. Therefore, a comparison to a typical
feature in advertising keyword extraction is enough. For our rule
based approach, the advertising keywords of an article are composed of those of the opinion sentences in the article. All these keywords are not weighted. For the TFIDF based approach, we remove
the stop words and select the top 10 keywords with the highest
TFIDF values as the advertising keywords. The average number of
the labeled keywords in an article is 2.76 and that of those extracted by our rule based approach is 2.26.
One important issue in our evaluation strategy is about the recognition of the commercial value of a word. We argue that a word
is commercially valuable if and only if there is at least one advertiser that bids on it. Particularly, in order to quantitatively evaluate
it, we submit a keyword to some search engine and check if there is
any ad shown in the result page. If there is ad shown, this keyword
is considered commercially valuable. In current experiments, we
make use of Google’s search engine and its AdWords service.
Fig. 7 gives two search examples (‘‘shoes” and ‘‘black”) on Google.
From the results, we can see that there are ads in the search result
page of ‘‘shoes” but no ads for ‘‘black”. So the keyword ‘‘shoes” is
considered commercially valuable while ‘‘black” is not. This conclusion is also consistent to common sense.
Table 4
Some examples for different approaches. The Sentiment column shows the sentiment word contained in the sentence. The Nouns column shows the words which are parsed as N
by Minipar in the sentence. The Labeled column shows manually labeled topic words. The Rule and Distance columns show the extracted topic words by rule and distance based
approaches, respectively.
Sentences
Sentiment
Nouns
Labeled
Rule
Distance
So the next thing is, does anyone know where I could get a cheap dealers
manual for the car?
The coil tower on these vehicles are notorious for corrosion.
But in the end, a nice turbo setup would cost approx.
Cheap
Thing, anyone, where, manual,
dealers, car
Tower, coil, vehicles, corrosion
End, setup, turbo, approx
Manual
Manual
Dealers
Tower
Setup
Tower
Setup
Vehicles
Approx
Notorious
Cost
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
6189
Fig. 7. Google search results for words ‘‘shoes” and ‘‘black”. As can be seen, there are ads in the result page of ‘‘shoes” but no ads for ‘‘black”. This means word ‘‘shoes” has
higher commercial value than ‘‘black”.
6.3.2. Results and discussions
In our experiments, we report CV, precision, recall and F1-measure of the top 10 selected keywords as the overall results for the
TFIDF based approach. Considering that the average number of
advertising keywords manually labeled and extracted by rule
based approach is close to three, we also evaluate the performance
of TFIDF based approach for the top three selected keywords. Table
5 lists all the results of different approaches.
From the table we can see that our approach outperforms the
TFIDF based approach (both TFIDF-Average and TFIDF-Top3) in
all the four measurements, except the precision in TFIDF-Top3.
The improvement of CV (17.3% over TFIDF-Average and 8.6% over
TFIDF-Top3) means that the advertising keywords extracted by
our approach are more valuable from the perspective of advertisers
and thus ensures these keywords would be related with ads instead of being abandoned because of being meaningless for
advertisers.
The superior performance of our approach in traditional measurements precision, recall and F1-measure shows that our approach can extract more advertising keywords with high
accuracy than other approaches. Note that, the F1-measure can
be essentially taken as the targeted-ness of the keywords from
the view of consumers as it is a combination of precision and recall.
Usually the more targeted the advertising keywords are, the higher
revenue the advertisers will obtain. The reason is that with the
improvement in the targeted-ness of keywords, ad relevance
6190
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
Table 5
Results of CV, precision, recall and F1-measure for our rule based approach and the
TFIDF based approach. For TFIDF, we consider two different variations TFIDF-Average
and TFIDF-Top3 which take the average value of top 10 and top 3 keywords,
respectively.
Approaches
CV
Precision
Recall
F1
Rule
TFIDF-Average
TFIDF-Top3
0.265
0.226
0.244
0.101
0.092
0.104
0.375
0.211
0.145
0.159
0.128
0.121
increases, therefore consumers will get less disturbed by ads and
thus will remain well or even better impression on advertising
Web sites. Finally, the market share would not be ill affected a
lot by ads and advertisers and Web sites will be likely to get more
profits. Therefore, F1-measure is more important than the other
two measurements. In the exception of the results, although our
approach is a little inferior in precision compared with TFIDFTop3, we gain 31.4% improvement in the F1-measure, which
means our approach still outperforms TFIDF-Top3.
Note that all the values in our measurements are relatively low.
However, in online advertising-like applications, an improvement
of X% in ad matching can lead to an improvement of X% in the
end result unlike many other tasks where the effect of performance
enhancement on the end result is not linear (Mishne & Rijke, 2006).
Therefore, regarding the magnitude of online advertising market, a
tiny improvement would lead to a considerable increase in the final revenue for advertisers.
6.4. Experiments on ad selection
The above experiments have examined the performance of our
approach for advertising keyword extraction, which is a key step
towards ad placement. We now conduct experiments on selecting
appropriate ads. Since we do not have the corpus of real world ads,
we represent our recommendations as descriptions such as ”autos
those perform excellent in safety”. As described before, there are two
kinds of ads that would be promoted to consumers as they may
complain on a certain brand or feature. For simplicity, we focus
only on features in our current experiments.
We compare DASA with the naive strategy that extracts advertising keywords in the same way but does not consider the atti-
tudes of consumers, i.e. take all the topic words as the
advertising keywords (TE-ALL). Volunteers are requested to make
the judgment on the recommended ads (the ad descriptions indeed). In our experiments, we construct a prototype system in
which volunteers provide texts commenting on some products
(only on features of autos in current work) and then get two lists
of ads returned by DASA and TE-ALL. Volunteers are unaware of
which list is the result of which approach. They are then asked to
determine which one they think is better to meet their needs with
less annoying.
Fig. 8 shows the comparison between DASA and TE-ALL. The Yaxis is the number of times that volunteers regard the results of an
approach are better than those of the other one. An obvious observation from the figure is that our approach (DASA) outperforms TEALL. For 35 times, volunteers judge that the results of our approach
are better than those of TE-ALL, while the contrast is 22 times. The
results show that it is reasonable to take consumers’ attitudes into
account.
7. Conclusions
Targeted advertising is of great importance for Internet companies to gain revenue from both advertisers and consumers. Previous
approaches focus only on the topical relevance while the consumers’ attitudes are ignored. These approaches fail to meet the actual
needs of consumers especially when they may have negative attitudes towards the mentioned topics. In this paper, we take the attitudes of consumers into account and propose a novel advertising
strategy DASA to promote ads according to what they are unsatisfied with. We use pre-set rule based sentiment analysis techniques
to tackle topic word extraction and consumers’ attitude identification in advertising keyword extraction. We also design a prototype
system for users to provide product information for the sake of ad
selection. For evaluation, we conduct separate experiments to measure the quality of topic words, advertising keywords and placed
ads. Specifically, we propose to use commercial value to measure
the quality of advertising keywords from the perspective of advertisers. Experiments show that our strategy outperforms the TFIDF
based approach in advertising keyword extraction and the naive approach which does not consider the attitudes of consumers in ad
selection. The results indicate that it is reasonable to take the attitudes of consumers into consideration.
References
Fig. 8. Comparison results of our approach DASA and TE-ALL which does not
consider the attitudes of consumers.
Bhargava, H. K., & Feng, J. (2002). Paid placement strategies for internet search
engine. In Proc. 2002 Int. WWW Conf. (WWW’02) (pp. 117–123).
Broder, A., Fontoura, M., Josifovski, V., & Riedel, L. (2007). A semantic approach to
contextual advertising. In Proc. 2007 Int. Conf. on research and development in
information retrieval (SIGIR’07), 2007 (pp. 559–566).
Chakrabarti, D., Agarwal, D., & Josifovski, V. (2008). Contextual advertising by
combining relevance with click feedback. In Proc. 2008 Int. WWW Conf.
(WWW’08) (pp. 417–426).
Ding, X., & Liu, B. (2007). The utility of linguistic rules in opinion mining. In Proc.
2007 Int. Conf. on research and development in information retrieval (SIGIR’07) (pp.
811–812).
Esuli, A., & Sebastiani, F. (2005). Determining the semantic orientation of terms
through gloss classification. In Proc. 14th ACM Int. Conf. on information and
knowledge management (CIKM’05) (pp. 617–624).
Feng, J., Bhargava, H. K., & Pennock, D. (2003). Comparison of allocation rules for
paid placement advertising in search engines. In Proc. 5th Int. Conf. on electronic
commerce (ICEC’03) (pp. 294–299).
Hatzivassiloglou, V., & McKeown, K. R. (1997). Predicting the semantic orientation
of adjectives. In Proc. 1997 ACL (ACL’04).
Hu, M., & Liu, B. (2004). Mining and summarizing customer reviews. In Proc. 2004
Int. Conf. knowledge discovery and data mining (KDD’04) (pp. 168–177).
Lacerda, A., Cristo, M., Goncalves, M., Fan, W., Ziviani, N., & Ribeiro-Neto, B. (2006).
Learning to advertise. In Proc. 2006 Int. Conf. on research and development in
information retrieval (SIGIR’06) (pp. 549–556).
Lu, Y., & Zhai, C. (2008). Opinion integration through semi-supervised topic
modeling. In Proc. 2008 int. WWW Conf. (WWW’08) (pp. 121–130).
G. Qiu et al. / Expert Systems with Applications 37 (2010) 6182–6191
Mccoy, S., Everard, A., Polak, P., & Galletta, D. (2007). The effects of online
advertising. Communications of the ACM, 50(3), 84–88.
Mei, Q., Cai, D., Zhang, D., & Zhai, C. (2008). Topic modeling with network
regularization. In Proc. 2008 Int. WWW Conf. (WWW’08) (pp. 101–110).
Mei, Q., & Ling, X. (2007). Topic sentiment mixture: Modeling facets and opinions in
weblogs. In Proc. 2007 Int. WWW Conf. (WWW’07) (pp. 161–175).
Mishne, G., & Rijke, M. (2006). Language model mixtures for contextual ad
placement in personal blogs. In Proc. 5th Int. Conf. NLP (FinTAL).
Pang, B., & Lee, L. (2004). A sentimental education: Sentiment analysis using
subjectivity summarization based on minimum cuts. In Proc. 42nd Ann. meeting
of the association for computational linguistics (ACL’04) (pp. 271–278).
Ribeiro-Neto, B., Cristo, M., Golgher, P., & Moura, E. (2005). Impedance coupling in
content-targeted advertising. In Proc. 2005 Int. Conf. on research and development
in information retrieval (SIGIR’05) (pp. 496–503).
6191
Tesniere, L. (1959). Elements de syntaxe structurale. Paris: Librairie C. Klincksieck.
Turney, P. (2002). Thumbs up or thumbs down? Semantic orientation applied to
unsupervised classification of reviews. In Proc. 40th Ann. meeting of the
association for computational linguistics (ACL’02) (pp. 417–424).
Turney, P. D., & Littman, M. L. (2003). Measuring praise and criticism: Inference of
semantic orientation from association. ACM Transactions on Information Systems,
21(4), 315–346.
Wang, C., Zhang, P., Choi, R., & D’Eredita, M. (2002). Understanding consumers
attitude toward advertising. In Proc. 8th Americas Conf. on information system
(pp. 1143–1148).
Wu X., & Bolivar, A. (2008). Keyword extraction for contextual advertisement. In
Proc. 2008 Int. WWW Conf. (WWW’08) (pp. 1195–1196).
Yih, W., Goodman, J., & Carvalho, V. R. (2006). Finding advertising keywords on web
pages. In Proc. 2006 Int. WWW Conf. (WWW’06) (pp. 213–222).