vignettes/novel-factor-levels.Rmd
novel-factor-levels.Rmd
random forest example where the wrong type of stuff happens because the factors are now treated as integers
if the model you want to work can deal with new factors, please let it!
if you want to test if method x will explode, do something like this:
…here we see that things have broken. what are our options?
missing data if the prediction method can still make predictions
dummy vignette also says you can use integer encoding, or step_other
. don’t recommend integer encoding. example of how step other might work.
For additional details, please read the full vignette on dummy variables in the recipes
package.