![]() |
|
![]() |
![]() |
|
![]() |
![]() |
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Subscribe]
Re: DM: Standard test suites/benchmarkingFrom: ragrawal Date: Wed, 5 Nov 1997 15:40:03 -0500 (EST)
Checkout the Quest website:
http://www.almaden.ibm.com/cs/quest
for some synthetic data generation programs.
Cheers
/rakesh
ejs2c@watt.seas.virginia.edu on 11/03/97 10:36:49 AM
Please respond to ejs2c@watt.seas.virginia.edu
To: datamine-l@nautilus-sys.com
cc: (bcc: Rakesh Agrawal/Almaden/IBM)
Subject: DM: Standard test suites/benchmarking
Hi folks... I'm looking for some advice,
I am in currently working on an undergraduate thesis project dealing
with
the evaluation of data mining tools. To evaluate these tools I would
be
interested to know if there is any collection of standard test suites
(data sets) which might be used. I have found the collection of data
sets
at www.kdnuggets.com. This listing is great, but overwhelming - It is
very hard to tell which of these data sets would be useful for
evaluating
software packages.
I would appreciate any advice on which data sets to use and where
to find these sets. The problems could be simple or complex, but I am
specifically interested in data sets which are representative of
certain "types" of statistical problems, or data sets which are
considered classic examples. Also, if I am looking in the wrong
direction
here, let me know as well.
Please respond with any hints or advice you might have on the data
sets I
might use for testing data mining tools.
Thank You in Advance,
Eric Schmidt
University of Virginia
Systems Engineering
Class of 1998
|
MHonArc 2.2.0