german dataset. Data contains information about people and their credit risks.
A data frame with 1000 rows and 10 variables:
factor, good/bad risk connected with giving the credit. Models should predict this values
factor, male/female , considered to be protected group
numeric, job titles converted to integers where 0- unemployed/unskilled, 3- management/ self-employed/highly qualified employee/ officer
factor, rent/own/free where this person lives
factor, little/moderate/quite rich/rich/not_known, where not_known indicates NA
factor, little/moderate/rich/not_known, where not_known indicates NA
numeric, amount of money in credit
numeric, duration of credit
factor, purpose of credit
numeric, age of person that applied for credit
Data from kaggle https://www.kaggle.com/kabure/german-credit-data-with-risk/. The original source is UCL https://archive.ics.uci.edu/ml/datasets/Statlog+(German+Credit+Data).