![]() |
|
![]() |
![]() |
|
![]() |
![]() |
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Subscribe]
DM: FYI: prediction accuracy of classification trees(Note: specially edited to preserve columns, despite scrolling. /Dorothy Firsching)From: Tjen-Sien Lim Date: Thu, 7 Jan 1999 22:50:52 -0500 (EST) I've completed the prediction accuracy part of my review. The datasets used in the experiment can be downloaded from http://www.stat.wisc.edu/~limt/mv.html Note that since there's a design flaw in the prediction part of SPSS AnswerTree, the results for AnswerTree QUEST don't reflect the true predictive performance of the QUEST algorithm.
--------------------------------------------------------------------------------------------
Plurality ARCed Bagged KnowledgeSEEKER See5 Boosted See5 Boosted AnswerTree
Datasets Rule CARTŪ (r) CARTŪ (r) CARTŪ (r) (EXHAUSTIVE) Tree See5 Tree Rule See5 Rule (QUEST)
-----------------------------------------------------------------------------------------------------
adt .236 .142 .155 .151 .174 .149 .150 .148 .152 .172
att .496 .394 .397 .396 .377 .398 .396 .394 .383 .461
ban .422 .322 .195 .217 .329 .259 .200 .254 .172 .290
bcw .345 .0586 .0344 .0459 .0466 .0644 .0372 .0602 .0373 .0759
bio .359 .164 .139 .139 .174 .144 .134 .144 .135 .176
bld .419 .334 .293 .288 .390 .316 .278 .309 .269 .420
bos .657 .256 .218 .212 .292 .229 .202 .223 .210 .267
bpr .397 .313 .251 .218 .409 .339 .254 .342 .237 .414
cmc .573 .447 .500 .479 .466 .488 .492 .481 .484 .476
crx .445 .149 .135 .138 .166 .146 .139 .146 .129 .162
der .694 .0460 .0354 .0319 .0354 .0680 .0192 .0674 .0217 .255
ech .328 .358 .349 .351 .328 .378 .357 .378 .349 .348
edu .461 .436 .429 .423 .445 .454 .456 .424 .445 .477
hab .265 .261 .334 .308 .278 .288 .268 .288 .265 .304
hco .369 .169 .185 .163 .137 .160 .163 .163 .161 .313
hea .459 .221 .204 .211 .214 .281 .195 .256 .191 .254
hep .206 .233 .174 .175 .246 .188 .155 .176 .161 .742
hin .491 .279 .300 .290 .302 .293 .258 .281 .256 .437
hyp .0477 .00727 .0126 .00980 .0136 .00759 .00885 .00759 .00917 .0177
imp .672 .233 .142 .179 .369 .225 .139 .237 .164 .410
pid .349 .245 .249 .238 .284 .256 .248 .249 .241 .262
usn .660 .279 .232 .236 .341 .286 .235 .283 .243 .301
tae .656 .365 .352 .352 .497 .503 .477 .503 .551 .558
Means:
Error Rate .248 .231 .228 .275 .257 .229 .253 .229 .330
Rank 4.85 4.37 3.76 6.26 6.13 3.37 5.35 2.96 7.96
Multiple comparisons:
1. Two methods are statistically significantly different at 10% simultaneous level when their means error rate differ by at least 0.0424.
2. Two methods are statistically significantly different at 10% simultaneous level when their means rank differ by at least 2.33.
--
Tjen-Sien Lim (608) 262-8181
Dept. of Statistics limt@stat.wisc.edu
Univ. of Wisconsin-Madison http://www.stat.wisc.edu/~limt
1210 West Dayton Street
Madison, WI 53706
|
MHonArc 2.2.0