Research Papers Library

Automatic Extraction of Keywords for a Multimedia Search Engine Using the Chi-Square Test

We present a method to automatically acquire a set of keywords that characterise a large multimedia collection. Our method compares captions associated with pictures in the collection with a model of general English language. The words that deviate from the model are very specific of the captions and thus make appropriate keywords. Professional annotators evaluated our results and concluded that more than 97% of our top 2,000 one-word keywords were truly descriptive of the collection. We also mined the collection’s query logs and extracted keywords that reflect the most important indexing terms from the users’ perspective. Our method offers a strategy for selecting the keywords that make up the indices of multimedia search engines.

Download PDF

Get Exclusive Research Tips in Your Inbox

Receive Great tips via email, enter your email to Subscribe.
Please wait

airs logo

Association of Internet Research Specialists is the world's leading community for the Internet Research Specialist and provide a Unified Platform that delivers, Education, Training and Certification for Online Research.

Newsletter Subscription

Receive Great tips via email, enter your email to Subscribe.
Please wait

Follow Us on Social Media

Book Your Seat for Webinar GET FREE REGISTRATION FOR MEMBERS ONLY      Register Now