StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POViola-Jones' face detection claims 180k features
primarykey
Id
1707620
data
AcceptedAnswerId
1711158
AnswerCount
5
ClosedDate
CommentCount
10
CommunityOwnedDate
CreationDate
2009-11-10T12:30:20.900
FavoriteCount
51
LastActivityDate
2018-04-21T00:28:33.017
LastEditDate
2014-04-18T00:55:29.217
LastEditorUserId
97160
OwnerUserId
154306
ParentId
0
PostTypeId
1
Score
71
ViewCount
20810
LastEditorDisplayName
text
Body
I've been implementing an adaptation of <a href="http://scholar.google.com/scholar?cluster=6119571473300502765" rel="noreferrer">Viola-Jones' face detection algorithm</a>. The technique relies upon placing a subframe of 24x24 pixels within an image, and subsequently placing rectangular features inside it in every position with every size possible. These features can consist of two, three or four rectangles. The following example is presented. <img src="https://i.stack.imgur.com/5MKl7.png" alt="Rectangle features"> They claim the exhaustive set is more than 180k (section 2): <blockquote> Given that the base resolution of the detector is 24x24, the exhaustive set of rectangle features is quite large, over 180,000 . Note that unlike the Haar basis, the set of rectangle features is overcomplete. </blockquote> The following statements are not explicitly stated in the paper, so they are assumptions on my part: <ol> <li>There are only 2 two-rectangle features, 2 three-rectangle features and 1 four-rectangle feature. The logic behind this is that we are observing the difference between the highlighted rectangles, not explicitly the color or luminance or anything of that sort.</li> <li>We cannot define feature type A as a 1x1 pixel block; it must at least be at least 1x2 pixels. Also, type D must be at least 2x2 pixels, and this rule holds accordingly to the other features.</li> <li>We cannot define feature type A as a 1x3 pixel block as the middle pixel cannot be partitioned, and subtracting it from itself is identical to a 1x2 pixel block; this feature type is only defined for even widths. Also, the width of feature type C must be divisible by 3, and this rule holds accordingly to the other features.</li> <li>We cannot define a feature with a width and/or height of 0. Therefore, we iterate x and y to 24 minus the size of the feature.</li> </ol> Based upon these assumptions, I've counted the exhaustive set: <pre><code>const int frameSize = 24; const int features = 5; // All five feature types: const int feature[features][2] = {{2,1}, {1,2}, {3,1}, {1,3}, {2,2}}; int count = 0; // Each feature: for (int i = 0; i < features; i++) { int sizeX = feature[i][0]; int sizeY = feature[i][1]; // Each position: for (int x = 0; x <= frameSize-sizeX; x++) { for (int y = 0; y <= frameSize-sizeY; y++) { // Each size fitting within the frameSize: for (int width = sizeX; width <= frameSize-x; width+=sizeX) { for (int height = sizeY; height <= frameSize-y; height+=sizeY) { count++; } } } } } </code></pre> The result is 162,336. The only way I found to approximate the "over 180,000" Viola & Jones speak of, is dropping assumption #4 and by introducing bugs in the code. This involves changing four lines respectively to: <pre><code>for (int width = 0; width < frameSize-x; width+=sizeX) for (int height = 0; height < frameSize-y; height+=sizeY) </code></pre> The result is then 180,625. (Note that this will effectively prevent the features from ever touching the right and/or bottom of the subframe.) Now of course the question: have they made a mistake in their implementation? Does it make any sense to consider features with a surface of zero? Or am I seeing it the wrong way?
Tags
<algorithm><image-processing><computer-vision><face-detection><viola-jones>
Title
Viola-Jones' face detection claims 180k features
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USAmro
UserOwnerUserId
1. USPaul Lammertsma
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
2. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
3. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
3. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POViola-Jones' face detection claims 180k features
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POViola-Jones' face detection claims 180k features
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POViola-Jones' face detection claims 180k features
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.