Zephyrnet Logo

Exploiting Categorical Structure Using Tree-Based Methods. (arXiv:2004.07383v1 [stat.ML])

Date:

[Submitted on 15 Apr 2020]

Download PDF

Abstract: Standard methods of using categorical variables as predictors either endow
them with an ordinal structure or assume they have no structure at all.
However, categorical variables often possess structure that is more complicated
than a linear ordering can capture. We develop a mathematical framework for
representing the structure of categorical variables and show how to generalize
decision trees to make use of this structure. This approach is applicable to
methods such as Gradient Boosted Trees which use a decision tree as the
underlying learner. We show results on weather data to demonstrate the
improvement yielded by this approach.

Submission history

From: Brian Lucena [view email]
[v1]
Wed, 15 Apr 2020 22:58:27 UTC (801 KB)

Source: http://arxiv.org/abs/2004.07383

spot_img

Latest Intelligence

spot_img