Student Performance
About
Predict student performance in secondary education (high school).
This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was collected by using school reports and questionnaires. Two datasets are provided regarding the performance in two distinct subjects: Mathematics (mat) and Portuguese language (por). In [Cortez and Silva, 2008], the two datasets were modeled under binary/five-level classification and regression tasks. Important note: the target attribute G3 has a strong correlation with attributes G2 and G1. This occurs because G3 is the final year grade (issued at the 3rd period), while G1 and G2 correspond to the 1st and 2nd period grades. It is more difficult to predict G3 without G2 and G1, but such prediction is much more useful (see paper source for more details).
Subject Area
Social Science
Instances
649
Features
33
Data Types
Multivariate
Tasks
Classification, Regression
Feature Types
Integer
Features
Name | Role | Type | Units | Missing Values | Description |
---|---|---|---|---|---|
school | Feature | Categorical | - | No | |
sex | Feature | Binary | - | No | |
age | Feature | Integer | - | No | |
address | Feature | Categorical | - | No | |
famsize | Feature | Categorical | - | No | |
Pstatus | Feature | Categorical | - | No | |
Medu | Feature | Integer | - | No | |
Fedu | Feature | Integer | - | No | |
Mjob | Feature | Categorical | - | No | |
Fjob | Feature | Categorical | - | No | |
reason | Feature | Categorical | - | No | |
guardian | Feature | Categorical | - | No | |
traveltime | Feature | Integer | - | No | |
studytime | Feature | Integer | - | No | |
failures | Feature | Integer | - | No | |
schoolsup | Feature | Binary | - | No | |
famsup | Feature | Binary | - | No | |
paid | Feature | Binary | - | No | |
activities | Feature | Binary | - | No | |
nursery | Feature | Binary | - | No | |
higher | Feature | Binary | - | No | |
internet | Feature | Binary | - | No | |
romantic | Feature | Binary | - | No | |
famrel | Feature | Integer | - | No | |
freetime | Feature | Integer | - | No | |
goout | Feature | Integer | - | No | |
Dalc | Feature | Integer | - | No | |
Walc | Feature | Integer | - | No | |
health | Feature | Integer | - | No | |
absences | Feature | Integer | - | No | |
G1 | Target | Categorical | - | No | |
G2 | Target | Categorical | - | No | |
G3 | Target | Integer | - | No |
Introductory Paper
Using data mining to predict secondary school student performance
P. Cortez, A. M. G. Silva. 2008.
Proceedings of 5th Annual Future Business Technology Conference