Please find here our own work on “User Comment Analysis in Online Journalism”.

Tools

ToolDeveloperDescription
The Conflict of Interest Resolver (in development)German nonprofit investigative newsroom CORRECTIVThe Conflict Resolver is a tool that scrapes scientific journals and extracts the statements on conflicts of interest. It can be used by journalists to evaluate experts credibility and possible bias, but also by the public as a browser plug-in.
EditorThe New York Times Research & DevelopmentExperimental text editing interface that explores how collaboration between machine learning systems and journalists could afford fine-grained annotation and tagging of news articles.
Facebook Page Post ScraperMax WoolfTool that enables to scrape the reactions data from any public Facebook Page
News Stream (in development)Consortium of Fraunhofer IAIS, dpa, Neofonie and Deutsche Welle. Funded by the German Federal Ministry of Education and ResearchResearch tool for journalists that offers real time analysis of diverse news streams by means of big data technology.

Data Sources

InstitutionDatasetDescription
Austrian Research Institute for Artificial IntelligenceOne Million Posts Corpus“One Million Posts” corpus is an annotated data set consisting of user comments posted to an Austrian newspaper website (in German language).
Simon Fraser UniversitySFU Opinion and Comments CorpusThe SFU Opinion and Comments Corpus (SOCC) is a corpus for the analysis of online news comments. It contains comments and the articles from which the comments originated. The articles are all opinion articles, not hard news articles. The corpus is larger than any other currently available comments corpora, and has been collected with attention to preserving reply structures and other metadata. In addition to the raw corpus, they also present annotations for four different phenomena: constructiveness, toxicity, negation and its scope, and appraisal.