Ceiling Effects
In fact 1R* only uses one feature (the best one)
C4 uses on average 6.6 features
5.6 features buy only about 2% improvement
Conclusion?
- Either real world learning problems are easy (use 1R*)
- Or we need more challenging datasets
- We need to be aware of ceiling effects in results