摘要 |
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting training images. One of the methods includes determining, for each of a plurality of labels that each designate a respective food class of a plurality of food classes, a respective measure of importance. A respective sample size is determined for the label based on the respective measure of importance of the label. A number of training images are selected for each respective label according to the determined sample size for the label. A predictive model is trained using the selected training images as training data. |
主权项 |
1. A computer-implemented method of classifying an image into a food class comprising:
for each of a plurality of labels that each designate a respective food class of a plurality of food classes, wherein each food class represents a different food item:
determining a respective measure of importance of the label of the plurality of labels, anddetermining a respective sample size for the label of the plurality of labels, wherein the sample size is based on the respective measure of importance of the label; determining, for each label of a subset of labels having smallest respective measures of importance, that a collection of labeled images includes fewer images having the label than a respective determined sample size for the label; selecting, from the collection of labeled images, for each respective label of the plurality of labels, a number of training images according to the determined sample size for the label, including selecting, for each label of the subset of labels having the smallest respective measures of importance, multiple instances of at least one image having the label; and training a predictive model using the selected training images as training data. |