UC Irvine
ML Repository
Theme

Online News Popularity

Download(7.1 MB)

About

This dataset summarizes a heterogeneous set of features about articles published by Mashable in a period of two years. The goal is to predict the number of shares in social networks (popularity). * The articles were published by Mashable (www.mashable.com) and their content as the rights to reproduce it belongs to them. Hence, this dataset does not share the original content but some statistics associated with it. The original content be publicly accessed and retrieved using the provided urls. * Acquisition date: January 8, 2015 * The estimated relative performance values were estimated by the authors using a Random Forest classifier and a rolling windows as assessment method. See their article for more details on how the relative performance values were set.
Subject Area
Business
Instances
39,797
Features
61
Data Types
Multivariate
Tasks
Classification, Regression
Feature Types
Integer, Continuous

Features

NameRoleTypeUnitsMissing Values

Introductory Paper

A Proactive Intelligent Decision Support System for Predicting the Popularity of Online News
Kelwin Fernandes, Pedro Vinagre, P. Cortez. 2015.
Portuguese Conference on Artificial Intelligence

Additional Metadata

Keywords
–
Authors
Kelwin Fernandes
Pedro Vinagre
Paulo Cortez
Pedro Sernadela
Year Created
2015
License
CC BY 4.0