Currently, the program works by reading in the log file which contains a list of coordinate-image lines to extract the pixels from a particular region of an image. For each line in the log file the program does the following:
- Compute the extraction region in the image.
- Convert the image from RGB to L*a*b*.
- Create 2 32-bin histograms, one for the 'a' channel and one for the 'b' channel (we discard the 'L' channel as it does not add much useful information in this case).
- Compute the histograms.
- Write out each histogram, bin by bin, to the resulting text file along with a 0 or a 1 to indicate if the region was a positive or negative training example.
My next step will be to train a support vector machine on this training data. Since I'm most familiar with SVMLight, I'm probably going to start there.
No comments:
Post a Comment