4000 stories with sentiment analysis dataset

2019-03-01T16:27:08Z (GMT) by James Carney Cole Robertson
This dataset presents 4,000 short stories that have been classified in terms of their emotional content and semantic structure. Emotional content was calculated using the valence, arousal and dominance norms in Warriner et al. (2014). Semantic structure was derived using the doc2vec algorithm, which classifies each text as a 300-place vector. The authors created this dataset as part of a study of the impact of narrative literature on mental health.