This function selects subset of rows from data set. This is useful if data is large and we need just a sample to calculate profiles.
select_sample(data, n = 100, seed = 1313)
data | set of observations. Profile will be calculated for every observation (every row) |
---|---|
n | named list of vectors. Elements of the list are vectors with points in which profiles should be calculated. See an example for more details. |
seed | seed for random number generator. |
a data frame with selected rows
Note that select_subsample
function is S3 generic.
If you want to work on non standard data sources (like H2O ddf, external databases)
you should overload it.
library("DALEX2") small_apartments <- select_sample(apartments_test) head(small_apartments)#> m2.price construction.year surface floor no.rooms district #> 8946 2174 1959 123 8 4 Wola #> 4458 4319 1927 68 8 2 Ochota #> 7384 5501 1929 95 5 3 Srodmiescie #> 5450 2810 1982 124 10 5 Ochota #> 6744 1770 1982 143 9 6 Ursynow #> 6688 2796 1938 75 7 3 Wola