adult
dataset consists of many columns containing various information about relationship, hours worked per week, workclass etc... and about
salary, whether more than 50K a year or not. Lot's of possible protected attributes such as sex, race age. Some columns contain
level "unknown" and these values are not removed and removing them depends on user as they might contain some information.
data(adult)
A data frame with 32561 rows and 15 variables:
factor, <=50K/>50K whether a person salary exceeds 50K a year or not
integer, age of person
factor, field of work
numeric
factor, completed education degree
numeric, education number in converted from education factor, the bigger the better
factor
factor, where this person works
factor, relationship information
factor, ethnicity of a person
factor, gender of a person
numeric
numeric
numeric, how many hours per week does this person work
factor, in which country was this person born
Data from UCL https://archive.ics.uci.edu/ml/datasets/adult