|
Tweets from Twitter¡¯s public timeline are collected and analyzed to identify the book popularity. Popular books are those that are mentioned frequently in tweets.
Apache Solr is used to calculate the Lucene score. Apache Solr is comprised of three main components: an indexer, a Lucene index and a server . To determine the popularity of a book, relevant tweets are collected from Twitter using Twitter Developer API and passed on as queries to the Apache Solr server. The server retrieves the book's description associated with the tweet content along with a weight that indicates the similarity between the query and the book's description for each gathered tweet. These books are sorted based on the total weight of their tweets. As a result, the books are ranked according to their popularity (indicated by Lucene score). The more popular the book is, the higher is the Lucene score.
|