StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
17719393
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
8
CommunityOwnedDate
CreationDate
2013-07-18T09:20:22.447
FavoriteCount
0
LastActivityDate
2013-07-18T09:20:22.447
LastEditDate
LastEditorUserId
0
OwnerUserId
2480047
ParentId
17717179
PostTypeId
2
Score
2
ViewCount
0
LastEditorDisplayName
text
Body
What you intend to accomplish is quite complex (mainly delivering an acceptable accuracy) and there is no simple solution. Basic ideas of the approach I would take in your position: <ol> <li>It can exclusively be used for a given language (e.g., English), the one you consider while performing the development. Relying on translations to bring other languages into account would affect its reliability a lot. A new "starting analysis" would be required for each new language you want to consider. </li> <li>The most important part of this development will be the database and thus you would have to put a big focus on its design, connectivity, data storage/retrieval, etc.</li> <li>Minimum content for the database: a list of words, their associated "value" (timid, angry, etc.), their importance within the given value (the f word getting a 8 out of 10 for "aggressive"), how likely is this word to have a different meaning (as suggested in the comment above, "sick" might get a warning flag to check the exact context), further considerations (e.g., how the number of repetitions affects its value), etc. Additionally, you would have to include different levels of storage (per post, per user, per time, etc.).</li> <li>You have to create an algorithm adaptable and scalable enough (lots of changes, improvements, additions, etc. are expected here) to deliver what you want. The basic idea I would come up from is: assessing each post on account of the values for each word as defined in the database (by considering each word alone, the number of repetitions of the given word, the context of the given word, etc.), that is, checking which words are analysis-worthy and which ones are not; a parsing system not only capable to extract individual words but to analyse the context (words before and after the target one in the same sentence/paragraph or in a different one); setting up some rules to avoid "misunderstandings" (e.g., minimum number of posts to consider that a behaviour is aggressive, otherwise just ignore (perhaps just humour); accounting for complex moods formed on account of different types of posts (e.g., angry in 3 posts + timid in 9 posts = *); etc.); in summary, it has to be capable of converting the "discrete posts reality" into the desired output (an assessment for the post/user) as accurately as possible.</li> </ol> As said, it will not be easy. But if you do things step by step and make sure that the structure is adaptable enough to allow any modification/extension, you might get a pretty reliable piece of software (by understanding what is the ideal result you can expect, that is, a ranking of the type of language used; extrapolating this to a real personality analysis sounds perhaps too ambitious) which might be of interest to quite a few people.
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POC# - Personality Analysis based on on Word/Language
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USvarocarbas
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.