StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POHierarchical clusterization heuristics
primarykey
Id
6644453
data
AcceptedAnswerId
6646184
AnswerCount
2
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2011-07-10T23:30:12.887
FavoriteCount
4
LastActivityDate
2011-07-11T12:29:39.813
LastEditDate
2011-07-11T12:29:39.813
LastEditorUserId
653511
OwnerUserId
653511
ParentId
0
PostTypeId
1
Score
4
ViewCount
1079
LastEditorDisplayName
text
Body
I want to explore relations between data items in large array. Every data item represented by multidimensional vector. First of all, I've decided to use clusterization. I'm interested in finding hierarchical relations between clusters (groups of data vectors). I'm able to calculate distance between my vectors. So at the first step I'm finding minimal spanning tree. After that I need to group data vectors according to links in my spanning tree. But at this step I'm disturbed - how to combine different vectors into hierarchical clusters? I'm using heuristics: if two vectors linked, and distance between them is very small - that means that they are in the same cluster, if two wectors are linked but distance between them is larger than threshold - that means that they are in different clusters with common root cluster. But maybe there is better solution? Thanks P.S. Thanks to all! In fact I've tried to use k-means and some variation of CLOPE, but didn't get good results. So, now I'm know that clusters of my dataset actually have complex structure (much more complex than n-spheres). Thats why I want to use hierarchical clusterisation. Also I'm guess that clusters are looks like n-dimension concatenations (like 3d or 2d chain). So I use single-link strategy. But I'm disturbed - how to combine different clusters with each other (in which situation I've to make common root cluster, and in which situations I've to combine all sub-clusters in one cluster?). I'm using such simple strategy: <blockquote> <ul> <li>If clusters (or vectors) are too close to each other - I'm combine their content into one cluster (regulated by threshold)</li> <li>If clusters (or vectors) are too far from each other - I'm creating root cluster and put them into it</li> </ul> </blockquote> But using this strategy I've got very large cluster trees. I'm trying to find satisfactory threshold. But maybe there might be better strategy to generate cluster-tree? Here is a simple picture, describes my question: <img src="https://i.stack.imgur.com/ytsE5.png" alt="enter image description here">
Tags
<algorithm><graph><cluster-analysis><data-mining><hierarchical-clustering>
Title
Hierarchical clusterization heuristics
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USstemm
UserOwnerUserId
1. USstemm
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POHierarchical clusterization heuristics
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POHierarchical clusterization heuristics
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POHierarchical clusterization heuristics
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.