StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POFuzzy Template matching?
primarykey
Id
14792103
data
AcceptedAnswerId
14799188
AnswerCount
2
ClosedDate
CommentCount
5
CommunityOwnedDate
CreationDate
2013-02-09T22:03:14.960
FavoriteCount
17
LastActivityDate
2015-02-27T13:40:24.677
LastEditDate
2013-02-10T04:13:30.543
LastEditorUserId
550900
OwnerUserId
550900
ParentId
0
PostTypeId
1
Score
14
ViewCount
7978
LastEditorDisplayName
text
Body
I'm attempting to wrap my head around the basics of CV. The bit that initially got me interested was template matching (it was mentioned in a Pycon talk unrelated to CV), so I figured I'd start there. I started with this image: <img src="https://i.stack.imgur.com/cn7PB.jpg" alt="Scene from SMB3"> Out of which I want to detect Mario. So I cut him out: <img src="https://i.stack.imgur.com/auXZU.png" alt="The Plumber"> I understand the concept of sliding the template around the image to see the best fit, and following a tutorial, I'm able to find mario with the following code: <pre><code>def match_template(img, template): s = time.time() img_size = cv.GetSize(img) template_size = cv.GetSize(template) img_result = cv.CreateImage((img_size[0] - template_size[0] + 1, img_size[1] - template_size[1] + 1), cv.IPL_DEPTH_32F, 1) cv.Zero(img_result) cv.MatchTemplate(img, template, img_result, cv.CV_TM_CCORR_NORMED) min_val, max_val, min_loc, max_loc = cv.MinMaxLoc(img_result) # inspect.getargspec(cv.MinMaxLoc) print min_val print max_val print min_loc print max_loc cv.Rectangle(img, max_loc, (max_loc[0] + template.width, max_loc[1] + template.height), cv.Scalar(120.), 2) print time.time() - s cv.NamedWindow("Result") cv.ShowImage("Result", img) cv.WaitKey(0) cv.DestroyAllWindows() </code></pre> So far so good, but then I came to realize that this is incredibly fragile. It will only ever find Mario with that specific background, and with that specific animation frame being displayed. So I'm curious, given that Mario will always have the same Mario-ish attributes, (size, colors) is there a technique with which I could find him regardless of whether his currect frame is standing still, or one of the various run cycle sprites? Kind of like fuzzy matching that you can do on strings, but for images. Maybe since he's the only red thing, there is a way of simply tracking the red pixels? The whole other issue is removing the background from the template. Maybe that would help the MatchTemplate function find Mario even though he doesn't exactly match the tempate? As of now, I'm not entirely sure how that would work ( I see that there is a mask param in MatchTemplate, but I'll have to investigate further) My main question is whether or not template matching is the way to go about detecting an image that is mostly the same, but varies (like when he's walking), or is there another technique I should look into? <h2>Update:</h2> <h2>Attempts at matching other Marios</h2> <hr> Going off of mmgp's suggestion that it should be workable for matching other things, I ran a couple of tests. I used this as the template to match: <img src="https://i.stack.imgur.com/EYs9B.png" alt="Super mario"> And then took a couple of screen shots to test the matching against. For the first, I successfully find Mario, and get a max value of 1. <img src="https://i.stack.imgur.com/RyYor.png" alt="enter image description here"> However, trying to find jumping Mario results in a complete misfire. <img src="https://i.stack.imgur.com/zBL1Y.png" alt="Misfire"> Now granted, the mario in the template, and the mario in the scene is facing opposite directions, as well as being different animation frames, but I would think they still match a lot more than anything else in the image -- if only for the colors alone. But it targets the platform as being the closest match to the template. Note that the max value for this one was <code>0.728053808212</code>. Next I tried a scene without mario to see what would happen. <img src="https://i.stack.imgur.com/szqDb.png" alt="enter image description here"> But oddly enough, I get the exact result as the image with jumping mario -- right down to the similarity value: <code>0.728053808212</code>. Mario being in the picture is just as accurate as him not being in the picture. Really strange! I don't know the actual details of the underlying algorithm, but I'd imagine, from a standard deviation perspective, the boxes in the scene that at least match the Red in template Mario's suit would be closer to the mean distance than a blue platform, right? So, it's extra confusing that it's not even in the general area of where I would expect it to be. I'm guessing this is user error on my end, or maybe just a misunderstanding. Why would a scene with a similar Mario have as much of a match as a scene with no Mario at all? 
Tags
<opencv><computer-vision>
Title
Fuzzy Template matching?
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USZack
UserOwnerUserId
1. USZack
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POFuzzy Template matching?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POFuzzy Template matching?
 UserUserId
 USYXD
 VoteTypeVoteTypeId
 VTFavorite
3. VO
 singulars
 PostPostId
 POFuzzy Template matching?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.