Data in Brief (Feb 2024)

A tourism dataset from historical transaction for recommender systems

  • Choirul Huda,
  • Yaya Heryadi,
  • Lukas,
  • Widodo Budiharto

Journal volume & issue
Vol. 52
p. 109990

Abstract

Read online

The tourism industry has currently grown in various aspects, including the types of attractions, their quantity, and the number of tourist visits in various regions, contributing positively to both regional and global economies. Historical transactions are essential for developing recommender systems, utilizing techniques such as Collaborative Filtering and Demographic Filtering. TripAdvisor is a reputable website providing a wide range of accessible tourism information, including attractions, user profiles, and ratings. However, this unstructured raw data requires processing to create an adequate dataset for recommender systems. This study conducted a series of data processing steps on the raw data, including data restructuring, validation, content addition, integration with Google Maps, normalization, and modeling. This study successfully produced an original dataset comprising User Transaction, Item or Attraction, Attraction Type, Continent, Region, Country, City, and Visiting Mode. It also includes an entity relational model for tourism in Indonesia, particularly in Bali, Malang, and Yogyakarta regions, based on various global user experiences. This dataset is adequate and essential for developing various models of tourism recommender systems such as using Collaborative Filtering.

Keywords