JMIR Research Protocols (Jul 2015)

Collecting and Analyzing Patient Experiences of Health Care From Social Media

  • Rastegar-Mojarad, Majid,
  • Ye, Zhan,
  • Wall, Daniel,
  • Murali, Narayana,
  • Lin, Simon

DOI
https://doi.org/10.2196/resprot.3433
Journal volume & issue
Vol. 4, no. 3
p. e78

Abstract

Read online

BackgroundSocial Media, such as Yelp, provides rich information of consumer experience. Previous studies suggest that Yelp can serve as a new source to study patient experience. However, the lack of a corpus of patient reviews causes a major bottleneck for applying computational techniques. ObjectiveThe objective of this study is to create a corpus of patient experience (COPE) and report descriptive statistics to characterize COPE. MethodsYelp reviews about health care-related businesses were extracted from the Yelp Academic Dataset. Natural language processing (NLP) tools were used to split reviews into sentences, extract noun phrases and adjectives from each sentence, and generate parse trees and dependency trees for each sentence. Sentiment analysis techniques and Hadoop were used to calculate a sentiment score of each sentence and for parallel processing, respectively. ResultsCOPE contains 79,173 sentences from 6914 patient reviews of 985 health care facilities near 30 universities in the United States. We found that patients wrote longer reviews when they rated the facility poorly (1 or 2 stars). We demonstrated that the computed sentiment scores correlated well with consumer-generated ratings. A consumer vocabulary to describe their health care experience was constructed by a statistical analysis of word counts and co-occurrences in COPE. ConclusionsA corpus called COPE was built as an initial step to utilize social media to understand patient experiences at health care facilities. The corpus is available to download and COPE can be used in future studies to extract knowledge of patients’ experiences from their perspectives. Such information can subsequently inform and provide opportunity to improve the quality of health care.