The adult
dataset consists of many columns containing various information
about the relationship, hours worked per week, workclass, etc... and about
salary, whether more than 50K a year or not. Lots of possible protected
attributes such as sex, race age. Some columns contain level "unknown" and
these values are not removed and removing them depends on the user as they might
contain some information.
data(adult)
A data frame with 32561 rows and 15 variables:
factor, <=50K/>50K whether a person salary exceeds 50K a year or not
integer, age of person
factor, field of work
numeric
factor, completed education degree
numeric, education number in converted from education factor, the bigger the better
factor
factor, where this person works
factor, relationship information
factor, ethnicity of a person
factor, gender of a person
numeric
numeric
numeric, how many hours per week does this person work
factor, in which country was this person born
Data from UCL https://archive.ics.uci.edu/ml/datasets/adult