We demonstrate that Search Engine Optimization (SEO) attributes provide strong signals for predicting news site reliability. We introduce a novel attributed webgraph dataset with labeled news domains and their connections to outlinking and backlinking domains. Finally, we introduce and evaluate a novel graph-based algorithm for discovering previously unknown misinformation news sources.
This dataset is provided courtesy of Ahrefs.com. The associated paper is upcoming at ICWSM 2024.
Carragher, P., Williams, E., & Carley, K. (2024). Detection and Discovery of Misinformation Sources using Attributed Webgraphs. arXiv preprint arXiv:2401.02379.