Ranger Classification Learner

Random classification forest. Calls ranger::ranger() from package ranger.

Custom mlr3 parameters

mtry:
- This hyperparameter can alternatively be set via our hyperparameter mtry.ratio as mtry = max(ceiling(mtry.ratio * n_features), 1). Note that mtry and mtry.ratio are mutually exclusive.

Initial parameter values

num.threads:
- Actual default: 2, using two threads, while also respecting environment variable R_RANGER_NUM_THREADS, options(ranger.num.threads = N), or options(Ncpus = N), with precedence in that order.
- Adjusted value: 1.
- Reason for change: Conflicting with parallelization via future.

Dictionary

This mlr3::Learner can be instantiated via the dictionary mlr3::mlr_learners or with the associated sugar function mlr3::lrn():

mlr_learners$get("classif.ranger")
lrn("classif.ranger")

Meta Information

Task type: “classif”
Predict Types: “response”, “prob”
Feature Types: “logical”, “integer”, “numeric”, “character”, “factor”, “ordered”
Required Packages: mlr3, mlr3learners, ranger

Parameters

Id	Type	Default	Levels	Range
always.split.variables	untyped	-		-
class.weights	untyped	NULL		-
holdout	logical	FALSE	TRUE, FALSE	-
importance	character	-	none, impurity, impurity_corrected, permutation	-
keep.inbag	logical	FALSE	TRUE, FALSE	-
max.depth	integer	NULL		$[1, \infty)$
min.bucket	untyped	1L		-
min.node.size	untyped	NULL		-
mtry	integer	-		$[1, \infty)$
mtry.ratio	numeric	-		$[0, 1]$
na.action	character	na.learn	na.learn, na.omit, na.fail	-
num.random.splits	integer	1		$[1, \infty)$
node.stats	logical	FALSE	TRUE, FALSE	-
num.threads	integer	1		$[1, \infty)$
num.trees	integer	500		$[1, \infty)$
oob.error	logical	TRUE	TRUE, FALSE	-
regularization.factor	untyped	1		-
regularization.usedepth	logical	FALSE	TRUE, FALSE	-
replace	logical	TRUE	TRUE, FALSE	-
respect.unordered.factors	character	-	ignore, order, partition	-
sample.fraction	numeric	-		$[0, 1]$
save.memory	logical	FALSE	TRUE, FALSE	-
scale.permutation.importance	logical	FALSE	TRUE, FALSE	-
seed	integer	NULL		$(-\infty, \infty)$
split.select.weights	untyped	NULL		-
splitrule	character	gini	gini, extratrees, hellinger	-
verbose	logical	TRUE	TRUE, FALSE	-
write.forest	logical	TRUE	TRUE, FALSE	-

References

Wright, N. M, Ziegler, Andreas (2017). “ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R.” Journal of Statistical Software, 77(1), 1–17. doi:10.18637/jss.v077.i01 .

Breiman, Leo (2001). “Random Forests.” Machine Learning, 45(1), 5–32. ISSN 1573-0565, doi:10.1023/A:1010933404324 .

Super classes

mlr3::Learner -> mlr3::LearnerClassif -> LearnerClassifRanger

Methods

Public methods

LearnerClassifRanger$new()
LearnerClassifRanger$importance()
LearnerClassifRanger$oob_error()
LearnerClassifRanger$selected_features()
LearnerClassifRanger$clone()

Inherited methods

Method `new()`

Creates a new instance of this R6 class.

Usage

LearnerClassifRanger$new()

Method `importance()`

The importance scores are extracted from the model slot variable.importance. Parameter importance.mode must be set to "impurity", "impurity_corrected", or "permutation"

Usage

LearnerClassifRanger$importance()

Returns

Named numeric().

Method `oob_error()`

The out-of-bag error, extracted from model slot prediction.error.

Usage

LearnerClassifRanger$oob_error()

Returns

numeric(1).

Method `selected_features()`

The set of features used for node splitting in the forest.

Usage

LearnerClassifRanger$selected_features()

Returns

character().

Method `clone()`

The objects of this class are cloneable with this method.

Usage

LearnerClassifRanger$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Examples

# Define the Learner and set parameter values
learner = lrn("classif.ranger")
print(learner)
#> 
#> ── <LearnerClassifRanger> (classif.ranger): Random Forest ──────────────────────
#> • Model: -
#> • Parameters: num.threads=1
#> • Packages: mlr3, mlr3learners, and ranger
#> • Predict Types: [response] and prob
#> • Feature Types: logical, integer, numeric, character, factor, and ordered
#> • Encapsulation: none (fallback: -)
#> • Properties: hotstart_backward, importance, missings, multiclass, oob_error,
#> selected_features, twoclass, and weights
#> • Other settings: use_weights = 'use'

# Define a Task
task = tsk("sonar")

# Create train and test set
ids = partition(task)

# Train the learner on the training ids
learner$train(task, row_ids = ids$train)

# Print the model
print(learner$model)
#> Ranger result
#> 
#> Call:
#>  ranger::ranger(dependent.variable.name = task$target_names, data = task$data(),      probability = self$predict_type == "prob", num.threads = 1L) 
#> 
#> Type:                             Classification 
#> Number of trees:                  500 
#> Sample size:                      139 
#> Number of independent variables:  60 
#> Mtry:                             7 
#> Target node size:                 1 
#> Variable importance mode:         none 
#> Splitrule:                        gini 
#> OOB prediction error:             18.71 % 

# Importance method
if ("importance" %in% learner$properties) print(learner$importance)
#> function () 
#> .__LearnerClassifRanger__importance(self = self, private = private, 
#>     super = super)
#> <environment: 0x56277980e960>

# Make predictions for the test rows
predictions = learner$predict(task, row_ids = ids$test)

# Score the predictions
predictions$score()
#> classif.ce 
#>  0.1594203

Custom mlr3 parameters

Initial parameter values

Dictionary

Meta Information

Parameters

References

See also

Super classes

Methods

Public methods

Method new()

Usage

Method importance()

Usage

Returns

Method oob_error()

Usage

Returns

Method selected_features()

Usage

Returns

Method clone()

Usage

Arguments

Examples

Method `new()`

Method `importance()`

Method `oob_error()`

Method `selected_features()`

Method `clone()`