Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
A Search-based Method for Forecasting Ad Impression in Contextual Advertising Defense Overview Background: Web and contextual advertising Motivation: importance of volume forecasting in contextual advertising Methodology: forecasting volume as an inverse of the ad retrieval Experiments Web Advertising Huge impact on the Web and beyond $21 billion industry Main textual advertising channels: Search advertising Contextual advertising Contextual Advertising (CA) CA Basics Supports a variety of the web ecosystem Selects ads based on the “context”: Interplay of three participants: Web page where the ads are placed Users that are viewing this page Publisher Advertiser Ad network Advertiser’s goal is to obtain web traffic Importance of Impression Volume Critical in planning and budgeting advertising campaigns Common questions for advertisers and intermediaries: Bid value Impact of ad variations Timing of the campaign A Challenging Problem of Impression forecasting CA platforms are complex systems Have hundreds of contributing features A moving target, dynamic Competitors and what they are willing to pay Publisher‘s content and traffic vary over time Large scale computation: billions of page views, hundreds of millions of distinct pages, and hundreds of millions of ads Dynamic bid landscape Current practice Run test ad in real traffic for a few days Simultaneously with the baseline Compare with the baseline Obvious drawbacks: Use ad serving infrastructure Expensive Inefficient Very long turn-around time Forecasting as Inverse of Ad Retrieval Ad retrieval: given a page and a set of ads find the best ads Forecasting: given an ad and a set of past impressions, find where the ad would have been shown if it were in the system This work: assumes ads selected based on similarity of features: Use the WAND (Broder et al, CIKM 2003) DAAT algorithm as page selection Similarity of ad and context feature vectors: requires monotonic scoring function – this work uses dot product Features can be based on either user of page context. Conceptual Work Flow Keep all the data used in ad retrieval for a given period For an unseen/incoming ad: Examine each impression Score the ad using the ad retrieval algorithm Compare the ad score with the score of the lowest ranking ad shown in the page view Count the impressions where the ad would have been shown Main challenge: scale In order to beat scalability problem: Index only unique pages Adaptation of the WAND algorithm for count aggregation needed in forecasting A Two-level Process Use a posting list order to allow early termination Indexing Unique Pages The revenue estimate of an ad-page pair: score(p,a) = similarity(p,a)*bid Revenue estimate for the lowest ranking ad: minScorep For repeating pages the similarity is constant However, ads and bids vary: Could change the lowest ranking ad of a unique page Only one index entry per unique page: What revenue to store for the lowest ranking ads? Save a distribution of estimates {rev1…revn} Assign median to the minScorep MinScorep is recomputed based on the current ad supply Two-level process (Impression forecasting) First phase (approximate) evaluation: maxWeightf = max{wf,p : for all p} Full evaluation: Framework Offline processing Online processing Analyzing the pages Building a page inverted index Creating a page statistics file We use the inverted page index and page statistics to forecast the # of impressions of a given ad. Output Given a ad and bid, output the # of imp Give a ad, output the curve describe the relation b/w bid and # of impressions Experiment Results Day to day forecast Week to week forecast Observations: Similar results between day-day and weekweek forecasting. The errors seems big, however, Due to the traffic fluctuation. Even with large margin of error, our result is still significant (it’s the best of its kind, and it’s still acceptable in campaigning budgeting and advertising strategy) Top row has a good prediction. Bottom row does not match well due to traffic fluctuation, but match the trend and sharp very well. Tradeoff b/w efficiency and accuracy Changing the value of minScorep will have effect on the output of the first level Ad Variation Example Subtle difference could lead to dramatic performance change Conclusion Ad retrieval algorithm is the determining factor in the CA impression volume forecasting Introduced a search-based forecasting as inverse of ad retrieval Promising experimental results Further work: combine search with learning approaches to further improve forecasting.