An Extension of CART's Pruning Algorithm CART
- Author(s):
- Kim, Sung Ho
- Publication Year:
- 1991
- Report Number:
- RR-91-34
- Source:
- ETS Research Report
- Document Type:
- Report
- Page Count:
- 29
- Subject/Key Words:
- Algorithms, Classification and Regression Trees Software (CART), Data Analysis, Graphical Analysis, Mathematical Logic, Tree Structures (Graphs)
Abstract
Among the computer-based methods used for the construction of trees such as AID, THAID, CART and FACT, the only one that uses an algorithm that first grows a tree and then prunes the tree is CART. The pruning component of CART is analogous in spirit to the backward elimination approach in regression analysis. This idea provides a tool in controlling the tree sizes to some extent and thus estimating the prediction error by the tree within a certain range of tree size. In the CART pruning process, Breiman, Friedman, Olshen, and Stone (1984) use a linear combination of the expected loss of the decisions by the tree and the total number of the terminal nodes of the tree. In this paper, CART's pruning is extended by considering a function of all the nodes of the tree in addition to the factors involved in the linear combination. For example, if we consider the cost of observing a variable at each node as is the main concern of this paper, or the structural complexity of the tree, we can see such an extension. (29pp.)
Read More
- Request Copy (specify title and report number, if any)
- http://dx.doi.org/10.1002/j.2333-8504.1991.tb01401.x