Date of Graduation
Bachelor of Science
Computer Science and Computer Engineering
Committee Member/Second Reader
Twitter is a microblogging website where any user can publicly release a message, called a tweet, expressing their feelings about current events or their own lives. This candid, unfiltered feedback is valuable in the spaces of healthcare and public health communications, where it may be difficult for cancer patients to divulge personal information to healthcare teams, and randomly selected patients may decline participation in surveys about their experiences. In this thesis, BERTweet, a state-of-the-art natural language processing (NLP) model, was used to predict sentiment and emotion labels for cancer-related tweets collected in 2019 and 2020. In longitudinal plots, trends in these emotions and sentiment values can be clearly linked to popular cancer awareness events, the beginning of stay-at-home mandates related to COVID-19, and the relative mortality rates of different cancer diagnoses. This thesis demonstrates the accuracy and viability of using state-of-the-art NLP techniques to advance the field of public health communications analysis.
bertweet, transformers, twitter, cancer, nlp, sentiment
Baker, W. (2021). Using Large Pre-Trained Language Models to Track Emotions of Cancer Patients on Twitter. Computer Science and Computer Engineering Undergraduate Honors Theses Retrieved from https://scholarworks.uark.edu/csceuht/92