top of page
Forretningsmøde

Text Analysis

SENTIMENT ANALYSIS

Exploring the emotions of the subreddits

Are people using r/JoeBiden less happy than people on r/DonaldTrump or is it the other way around? Are people engaging with a subreddit on Donald Trump more susceptible to mood swings?

 

We assigned a happiness score from 1-9 to all users based on the words of their comments. This made us able to compare the sentiments of the two user groups.

​

FINDINGS

Sentiment box plot

​

  • Most of the comments in both camps were discussing in a neutral manner.

  • Comments on r/JoeBiden may be slightly more positive

  • Emotions are more extreme in the comments on r/DonaldTrump

BoxPlotDpi.png
Anker 1

TEXT ANALYSIS

Language statistics

What are the users talking about? Which groups speaks about what?

Using natural language processing techniques such as

 

  • term frequency distributions

  • term frequency-term ratio

  • term frequency-inverse document frequency

 

we were able to find the words that are specific to various user groups and visualize our findings in clouds of words.

We investigated different partitionings of users. We found some of the most informative partitionings to be

​

  • r/DonaldTrump vs r/JoeBiden

  • 3 sentiment groups in both r/DonaldTrump and r/JoeBiden ranging from most negative over neutral to most positive.

​

The comments from r/JoeBiden and r/DonaldTrump were first compiled into two documents with approximate 30,000 words in each.

​

​

 

​

​

trumpdated.png
biden_cloud_test.png
130867630_308991870293411_65317984686762

FINDINGS

Term frequency distribution

​

  • Trump is the talk of the town: the term ”Trump” was the most used on both subreddits

  • ”Fuck” or ”thank”? The users on r/DonaldTrump say ”fuck” in about the same proportion as the users on r/JoeBiden say ”thank”.

​

  • r/DonaldTrump talks more about black Americans than r/JoeBiden.

  • r/JoeBiden users eagerly discuss about voting.

 

​

Anker 2

Diving into the topics of discussion

Using NLTK (the Natural Langauge Toolkit) we also investigated the contexts of certain words to get a deeper understanding the words of the comments. In this way we learned that

 

  • users on r/DonaldTrump think Democrats and Joe Biden are racist

  • users on r/JoeBiden think Donald Trump and the people voting for him are racist.
     

​

Visualizing sentiment group differences

Using an inner quartile range filter on the sentiment data we were able to partition both subreddits into a negative, neutral and positive comment segment and compile these comment segments into six documents.

​

Negative Comments

Negative sentiment segment

r/DonaldTrump

130285066_436491291049753_51489270042796

Negative comments on r/DonaldTrump contains a lot of derogatory terms such as "bitch", ”monster”, "soyboys" and ”zuckfuck”.
 

Negative comments on r/JoeBiden are less personal: "terrorism", "fraud" and "violence” are prominent negative words but there are also personal derogatory terms such as ”turntables” and ”fuckers”.

130285066_436491291049753_51489270042796

r/JoeBiden

Neutral Comments

Neutral comments on both subreddits witness that the users engage in sober political discussions: Prominent words are ”people”, ”right”, ”vote”, ”republican” and ”democrat”.

130587687_1452321838306764_8766454161637

Neutral sentiment segment

r/DonaldTrump

130587687_1452321838306764_8766454161637

r/JoeBiden

Positive Comments

130923248_200707791647212_40433895317459

Positive sentiment segment
r/DonaldTrump

130923248_200707791647212_40433895317459

r/JoeBiden

​The positive r/DonaldTrump comments seem to mention "kim", "kimberly" and "klacik" quite a lot: This indicates there is a positive sentiment towards the Republican Kimberly Klacik on the subreddit
 

For the r/JoeBiden positive comment segment we see words such as "momma", "papa" and "heaven", ”god" and "believe". This points in the direction that family related issues and religious issues may appear more often in the r/JoeBiden comments in positive contexts

Anker 3
Anker 4

Team

Simon_edited.jpg

Simon Ankjær Tommerup

Thomas.jpg

Thomas Dahl Heshe

130974750_211447000452998_1751524863031353467_n_edited_edited.jpg

Asger Frederik Græsholt

bottom of page