Tree pruning useful in decision tree induction

Why is tree pruning useful in decision tree induction? What is a drawback of using a separate set of tuples to evaluate pruning?

The decision tree built may overfit the training data. There could be too many branches, some of which may reflect anomalies in the training data due to noise or outliers. Tree pruning addresses this issue of overfitting the data by removing the least reliable branches (using statistical measures). This generally results in a more compact and reliable decision tree that is faster and more accurate in its classification of data. The drawback of using a separate set of tuples to evaluate pruning is that it may not be representative of the training tuples used to create the original decision tree. If the separate set of tuples are skewed, then using them to evaluate the pruned tree would not be a good indicator of the pruned tree’s classification accuracy. Furthermore, using a separate set of tuples to evaluate pruning means there are less tuples to use for creation and testing of the tree. While this is considered a drawback in machine learning, it may not be so in data mining due to the availability of larger data sets.

Comments

  1. Hey mate your post is very informative for everyone. If anyone want to read about the tree pruning then mostly read this one article. tree pruning is very beneficial for trees health and our environment.

    ReplyDelete
  2. Branch Specialists Tree Service in Buffalo NY, stands as the premier choice for comprehensive tree care. Our team of expert arborists delivers top-notch services, including precise tree trimming, safe removals, and stump grinding, all tailored to your specific needs. With a commitment to quality and safety, we ensure customer satisfaction with every project. Contact Branch Specialists Tree Service for professional, reliable tree care that surpasses expectations in Buffalo, NY.

    ReplyDelete

Post a Comment

Popular posts from this blog

Machine Learning Syllabus