Skip to main content

Table 4 Datasets combined for the study

From: Developing an online hate classifier for multiple social media platforms

Source

Platform and domain

Number of comments

Cum. count

% from total (%)

ICWSM-18-SALMINEN [5]

YouTube news media

T = 3221

3221

1.6

H = 2364 (73.4%)

NH = 857 (26.6%)

ALMEREKHI-19 [73]

Reddit 10 popular sub-communities

9991

13,212

5.1

1619 (16.2%)

8372 (83.8%)

DAVIDSON-17-ICWSM [16]

Twitter generic tweets

24,783

37,995

12.5

20,620 (83.2%)

4163 (16.8%)

KAGGLE-18 [7]

Wikipedia editor discussions

159,571

197,566

80.8

15,294 (9.6%)

144,277 (90.4%)

  1. Breakdown under number of comments shows T total comments, NH non-hateful comments, H hateful comments