11th International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services

Research Article

When Twitter meets Foursquare: Tweet Location Prediction using Foursquare

Download916 downloads
  • @INPROCEEDINGS{10.4108/icst.mobiquitous.2014.258092,
        author={Kisung Lee and Raghu Ganti and Mudhakar Srivatsa and Ling Liu},
        title={When Twitter meets Foursquare: Tweet Location Prediction using Foursquare},
        proceedings={11th International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services},
        publisher={ICST},
        proceedings_a={MOBIQUITOUS},
        year={2014},
        month={11},
        keywords={twitter location foursquare geotagging},
        doi={10.4108/icst.mobiquitous.2014.258092}
    }
    
  • Kisung Lee
    Raghu Ganti
    Mudhakar Srivatsa
    Ling Liu
    Year: 2014
    When Twitter meets Foursquare: Tweet Location Prediction using Foursquare
    MOBIQUITOUS
    ICST
    DOI: 10.4108/icst.mobiquitous.2014.258092
Kisung Lee1,*, Raghu Ganti2, Mudhakar Srivatsa2, Ling Liu3
  • 1: Georgia Insititute of Technology
  • 2: IBM T J Watson Research Center
  • 3: Georgia Institute of Technology
*Contact email: kslee@gatech.edu

Abstract

The continued explosion of Twitter data has opened doors for many applications, such as location-based advertisement and entertainment using smartphones. Unfortunately, only about 0.58 percent of tweets are geo-tagged to date. To tackle the location sparseness problem, this paper presents a methodical approach to increasing the number of geo- tagged tweets by predicting the fine-grained location of those tweets in which their location can be inferred with high confidence. In order to predict the fine-grained location of tweets, we first build probabilistic models for locations using unstructured short messages tightly coupled with se- mantic locations. Based on the probabilistic models, we pro- pose a 3-step technique (Filtering-Ranking-Validating) for tweet location prediction. In the filtering step, we introduce text analysis techniques to filter out those location-neutral tweets, which may not be related to any location at all. In the ranking step, we utilize ranking techniques to select the best candidate location for a tweet. Finally, in the validat- ing step, we develop a classification-based prediction valida- tion method to verify the location of where the tweet was actually written. We conduct extensive experiments using tweets covering three months and the results show that our approach can increase the number of geo-tagged tweets 4.8 times compared to the original Twitter data and place 34% of predicted tweets within 250m from their actual location.