| Skip to main content | Skip to navigation |

Finding Advertising Keywords on Web Pages

  • Wen-tau Yih, Microsoft Research, USA
  • Joshua Goodman, Microsoft Research, USA
  • Vitor R. Carvalho, Carnegie Mellon University, USA

Full text:

Track: Data Mining

Slot: 16:00-17:30, Wednesday 24th May

A large and growing number of web pages display contextual advertising based on keywords automatically extracted from the text of the page, and this is a substantial source of revenue supporting the web today. Despite the importance of this area, little formal, published research exists. We describe a system that learns how to extract keywords from web pages for advertisement targeting. The system uses a number of features, such as term frequency of each potential keyword, inverse document frequency, presence in meta-data, and how often the term occurs in search query logs. The system is trained with a set of example pages that have been hand-labeled with "relevant" keywords. Based on this training, it can then extract new keywords from previously unseen pages. Accuracy is substantially better than several baseline systems.

Organised by

ECS Logo

in association with

BCS Logo ACM Logo

Platinum Sponsors

Sponsor of The CIO Dinner

Valid XHTML 1.0! IFIP logo WWW Conference Committee logo Web Consortium logo Valid CSS!