I just added brief questions but you will find the questions in the PPT attached:You need R to work on this.
Tidy Text Format” – Question 1
a)You have been assigned “your author”* in:
–ITS836-46_Week12 Authors for Text Analysis.xlsx
b)Identify books for the author: www.gutenberg.orgorg/browse/authors/a”>http://www.gutenberg.org/browse/authors/a
c)Compare word frequencies as in Figure 1.3
of Jane Austen, the Brontë sisters, and “your author”
*You can chose another author: org/browse/scores/top”>https://www.gutenberg.org/browse/scores/top
Make sure it is not on the list for anyone else
“Sentiment analysis with tidy data” Question 2
a)Analyze the sentiment through multiple works (minimum 2) belonging to “your author’” as Fig 2.2
b)Comparing three sentiment lexicons through the sentiment lexicons as Fig 2.3
–AFINN from Finn Årup Nielsen,
–bing from Bing Liu and collaborators, and
–nrc from Saif Mohammad and Peter Turney.
c)Plot words that contribute to positive and negative sentiment for your authors works as in Fig 2.4
d)Create a world cloud of the most common words for your author’s works as in Fig 2.5
“Analyzing word and document frequency: tf-idf” Question 3


0 comments