german dataset. Data contains information about people and their credit risks.

data(german)

Format

A data frame with 1000 rows and 10 variables:

Risk

factor, good/bad risk connected with giving the credit. Models should predict this values

Sex

factor, male/female , considered to be protected group

Job

numeric, job titles converted to integers where 0- unemployed/unskilled, 3- management/ self-employed/highly qualified employee/ officer

Housing

factor, rent/own/free where this person lives

Saving.accounts

factor, little/moderate/quite rich/rich/not_known, where not_known indicates NA

Checking.account

factor, little/moderate/rich/not_known, where not_known indicates NA

Credit.amount

numeric, amount of money in credit

Duration

numeric, duration of credit

Purpose

factor, purpose of credit

Age

numeric, age of person that applied for credit

Source

Data from kaggle https://www.kaggle.com/kabure/german-credit-data-with-risk. The original source is UCL https://archive.ics.uci.edu/ml/datasets/Statlog+(German+Credit+Data).