StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
6181478
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
2
CommunityOwnedDate
CreationDate
2011-05-30T23:05:20.460
FavoriteCount
0
LastActivityDate
2011-05-30T23:05:20.460
LastEditDate
2017-05-23T10:00:04.083
LastEditorUserId
-1
OwnerUserId
389051
ParentId
6178083
PostTypeId
2
Score
0
ViewCount
0
LastEditorDisplayName
text
Body
<p>I wasn't able to get the FVH to handle phrase queries correctly, and wound up having to develop my own summarizer. The gist of my approach is discussed <a href="https://stackoverflow.com/questions/5838542/matching-token-sequences" title="Matching Token Sequences | StackOverflow">here</a>; what I wound up doing is creating an array of objects, one for each term that I pulled from the queries. Each object contains a word index and its position, and whether it was already used in some match. These instances are the <code>TermAtPosition</code> instances in the sample below. Then, given position span and an array of word identities (indexes) corresponding to a phrase query, I iterated through the array, looking to match all term indexes within the given span. If I found a match, I marked each matching term as being consumed, and added the matching span to a list of matches. I could then use these matches to score sentences. Here is the matching code:</p> <pre><code>protected void scorePassage(TermPositionVector v, String[] words, int span, float score, SentenceScore[] scores, Scorer scorer) { TermAtPosition[] order = getTermsInOrder(v, words); if (order.length < words.length) return; int positions[] = new int[words.length]; List<int[]> matches = new ArrayList<int[]>(); for(int t=0; t<order.length; t++) { TermAtPosition tap = order[t]; if (tap.consumed) continue; int p = 0; positions[p++] = tap.position; for(int u=0; u<words.length; u++) { if (u == tap.termIndex) continue; int nextTermPos = spanContains(order, u, tap.position, span); if (nextTermPos == -1) break; positions[p++] = nextTermPos; } // got all terms if (p == words.length) matches.add(recordMatch(order, positions.clone())); } if (matches.size() > 0) for (SentenceScore sentenceScore: scores) { for(int[] matchingPositions: matches) scorer.scorePassage(sentenceScore, matchingPositions, score); } } protected int spanContains(TermAtPosition[] order, int targetWord, int start, int span) { for (int i=0; i<order.length; i++) { TermAtPosition tap = order[i]; if (tap.consumed || tap.position <= start || (tap.position > start + span)) continue; if (tap.termIndex == targetWord) return tap.position; } return -1; } </code></pre> <p>This approach seems to work, but it is greedy. Given a sequence "a a b c" it will it match the first a (leaving the second a alone), and then match b and c. I think a bit of recursion or integer programming could be applied to make it less greedy, but I couldn't be bothered, and wanted a faster rather than a more accurate algorithm anyway.</p>
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POPosition offset for Phrase queries in Lucene
  singulars
  PostTypePostTypeId
  PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. USCommunity
UserOwnerUserId
1. USGene Golovchinsky
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. This table or related slice is empty.
CommentsPostId
1. CODoes this work for MultiPhraseQuery as well?
  singulars
  PostPostId
  PO
  UserUserId
  USJahangir
2. COYou have to know which terms are variants (implicitly ORed) and which are required for a match. I would process the required terms as above; to process variants (only one of which has to match), change the logic around the spanContains() call that calls it once for each variant, and keeps the return value closes to the required term.
  singulars
  PostPostId
  PO
  UserUserId
  USGene Golovchinsky

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.