StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POClassifiying a set of Images into Classes
primarykey
Id
16302127
data
AcceptedAnswerId
16457320
AnswerCount
4
ClosedDate
CommentCount
2
CommunityOwnedDate
CreationDate
2013-04-30T14:16:50.607
FavoriteCount
0
LastActivityDate
2016-12-21T10:40:45.727
LastEditDate
2013-05-05T13:45:31.560
LastEditorUserId
419497
OwnerUserId
419497
ParentId
0
PostTypeId
1
Score
11
ViewCount
2082
LastEditorDisplayName
text
Body
I have the problem that I get a set of pictures and need to classify those. The thing is, i do not really have any knowledge of these images. So i plan on using as many descriptors as I can find and then do a PCA on those to identify only the descriptors that are of use to me. I can do supervised learning on a lot of datapoints, if that helps. However there is a chance that pictures are connected to each other. Meaning there could be a development from Image X to Image X+1, although I kinda hope this gets sorted out with the information in each Image. My question are: <ol> <li>How do i do this best when using Python? (I want to make a proof of concept first where speed is a non-issue). What libraries should i use? </li> <li>Are there examples already for an image Classification of such a kind? Example of using a bunch of descriptors and cooking them down via PCA? This part is kinda scary for me, to be honest. Although I think python should already do something like this for me.</li> </ol> Edit: I have found a neat kit that i am currently trying out for this: <a href="http://scikit-image.org/" rel="noreferrer">http://scikit-image.org/</a> There seem to be some descriptors in there. Is there a way to do automatic feature extraction and rank the features according to their descriptive power towards the target classification? PCA should be able to rank automatically. Edit 2: I have my framework for the storage of the data now a bit more refined. I will be using the Fat system as a database. I will have one folder for each instance of a combination of classes. So if an image belongs to class 1 and 2, there will be a folder img12 that contains those images. This way i can better control the amount of data i have for each class. Edit 3: I found an example of a libary (sklearn) for python that does some sort of what i want to do. it is about recognizing hand-written digits. I am trying to convert my dataset into something that i can use with this. here is the example i found using sklearn: <pre><code>import pylab as pl # Import datasets, classifiers and performance metrics from sklearn import datasets, svm, metrics # The digits dataset digits = datasets.load_digits() # The data that we are interested in is made of 8x8 images of digits, # let's have a look at the first 3 images, stored in the `images` # attribute of the dataset. If we were working from image files, we # could load them using pylab.imread. For these images know which # digit they represent: it is given in the 'target' of the dataset. for index, (image, label) in enumerate(zip(digits.images, digits.target)[:4]): pl.subplot(2, 4, index + 1) pl.axis('off') pl.imshow(image, cmap=pl.cm.gray_r, interpolation='nearest') pl.title('Training: %i' % label) # To apply an classifier on this data, we need to flatten the image, to # turn the data in a (samples, feature) matrix: n_samples = len(digits.images) data = digits.images.reshape((n_samples, -1)) # Create a classifier: a support vector classifier classifier = svm.SVC(gamma=0.001) # We learn the digits on the first half of the digits classifier.fit(data[:n_samples / 2], digits.target[:n_samples / 2]) # Now predict the value of the digit on the second half: expected = digits.target[n_samples / 2:] predicted = classifier.predict(data[n_samples / 2:]) print("Classification report for classifier %s:\n%s\n" % (classifier, metrics.classification_report(expected, predicted))) print("Confusion matrix:\n%s" % metrics.confusion_matrix(expected, predicted)) for index, (image, prediction) in enumerate( zip(digits.images[n_samples / 2:], predicted)[:4]): pl.subplot(2, 4, index + 5) pl.axis('off') pl.imshow(image, cmap=pl.cm.gray_r, interpolation='nearest') pl.title('Prediction: %i' % prediction) pl.show() </code></pre>
Tags
<python><image><classification><descriptor><pca>
Title
Classifiying a set of Images into Classes
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. UStarrasch
UserOwnerUserId
1. UStarrasch
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POClassifiying a set of Images into Classes
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTDownMod
2. VO
 singulars
 PostPostId
 POClassifiying a set of Images into Classes
 UserUserId
 UStarrasch
 VoteTypeVoteTypeId
 VTBountyStart
3. VO
 singulars
 PostPostId
 POClassifiying a set of Images into Classes
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. COAnd what did you try so far? Show some effort mate.
 singulars
 PostPostId
 POClassifiying a set of Images into Classes
 UserUserId
 USTymoteusz Paul
2. COi will edit the stuff in that i accomplish so far.
 singulars
 PostPostId
 POClassifiying a set of Images into Classes
 UserUserId
 UStarrasch

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.