HOME Dataset, creation of artificial datasets Datasets Dataset commandsDataset download of real world datasets: prdatasets

Dataset download of real world datasets: prdatasets

On the PRTools website a number of example datasets has been made available. They can easily be downloaded by the command

    prdatasets

This command just checks whether a directory prdatasets is already in the path. If it is, the contents are displayed, if not the user is given the opportunity to download the prdatasets directory and the related m-files. Every dataset has its own command to load it. If it is not yet available in the prdatasets directory it is downloaded from the website.

Some of these datasets are made available by the Delft Pattern Recognition Laboratory, others are the PRTools versions of public domain datasets. As far as possible the origin, links and references are given in the help files of the commands that load them.

In the below table a number available datasets is listed. The set may be extended.

> Commands in prdatasets for loading datasets in PRTools format
name # objects # features # classes description
x80 45 8 3 radial distances of characters
arrhythmia 420 278 2 presence or absence of cardia arrhythmia
auto_mpg 398 6 2 Car/miles-per-gallon
malaysia 291 8 20 segment features in utility symbols
biomed 194 5 2
breast 683 9 2 Wisconsion breast cancer dataset
cbands 12000 30 24 chromosome banding patterns
chromo 1143 8 24 chromosome blob features
diabetes 768 8 2 Pima Indians Diabetes Database
ecoli 272 7 3 protein localisation sites
glass 214 9 4 glass types from chemical components
heart 297 13 2 heart disease dataset
imox 192 8 4 radial distances of characters
ionosphere 351 34 2
iris 150 4 3 Fisher's Iris dataset
liver 345 6 2 liver disorder
satellite 6435 36 6
sonar 208 60 2 rock / metal sonar features
soybean1 266 35 19 large Soybeans
soybean2 136 35 4 small Soybeans
twonorm 7400 20 2 Leo Breiman's two normal example.
ringnorm 7400 20 2 Leo Breiman's ringnorm example.
wine 178 13 3 wine recognition
mfeat_fac 2000 216 10 Face features in digits dataset
mfeat_fou 2000 76 10 Fourier features in digits dataset
mfeat_kar 2000 64 10 Karhunen Loeve features in digits dataset
mfeat_pix 2000 240 10 Pixel features in digits dataset
mfeat_zer 2000 53 10 Zernike moments in digits dataset
mfeat_mor 2000 6 10 Morphological features in digits dataset
Multi-band images (pixels are objects, bands are features)
emim31 128*128 8 1 8-band EM image
emim32 128*128 8 1 8-band EM image
emim33 128*128 8 1 8-band EM image
emim34 128*128 8 1 8-band EM image
emim37 256*256 8 1 8-band EM image
lena 256*256 3 1 full-color image
texturel 5*128*128 7 5 texture features for 5 different texture images
texturet 256*256 7 5 composite texture image
Image datasets (pixels are features, images are objects)
kimia 216 64*64 18 resampled (64*64) Kimia dataset of silhouettes
nist32 5000 32*32 10 Resampled Nist digits
nist16 2000 16*16 10 Normalized Nist digits


R.P.W. Duin, January 28, 2013


HOME Dataset, creation of artificial datasets Datasets Dataset commandsDataset download of real world datasets: prdatasets