Supporting data for "A novel ground truth multispectral image dataset with weight, anthocyanins and Brix index measures of grape berries tested for its utility in machine learning pipelines"
- Pedro, Navarro J
- Leanne, Miller
- María, Díaz-Galián Victoria
- Alberto, Gila-Navarro
- Diego, Aguila J
- Marcos, Egea-Cortines
Resum
The combination of computer vision devices such as multispectral cameras coupled with Artificial Intelligence has provided a major leap forward in image-based analysis of biological processes. Supervised Artificial Intelligence algorithms require large ground truth image datasets for model training, which allows to validate or refute research hypotheses and to carry out comparisons between models. However, public datasets of images are scarce and ground truth images are surprisingly few considering the numbers required for training algorithms. <br>We created a dataset of 1283 multidimensional arrays, using berries from five different grape varieties. Each array has 37 images of wavelengths between 488.38nm and 952.76nm obtained from single berries. Coupled to each multispectral image we added a dataset with measurements including, weight, anthocyanin content and Brix index for each independent grape. Thus, the images have paired measures creating a ground truth dataset. We tested the dataset with two neural network algorithms: multilayer perceptron (MLP), three-dimensional convolutional neural network (3D-CNN). A perfect (100% accuracy) classification model was fit with either the MLP or 3D-CNN algorithms. <br>This is the first public dataset of grape ground truth multispectral images. Associated with each multispectral image there are measures of the weight, anthocyanins, and Brix index. The dataset should be useful to develop deep learning algorithms for classification, dimensionality reduction, regression, and prediction analysis.