Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p>What you intend to accomplish is quite complex (mainly delivering an acceptable accuracy) and there is no simple solution.</p> <p>Basic ideas of the approach I would take in your position:</p> <ol> <li>It can exclusively be used for a given language (e.g., English), the one you consider while performing the development. Relying on translations to bring other languages into account would affect its reliability a lot. A new "starting analysis" would be required for each new language you want to consider. </li> <li>The most important part of this development will be the database and thus you would have to put a big focus on its design, connectivity, data storage/retrieval, etc.</li> <li>Minimum content for the database: a list of words, their associated "value" (timid, angry, etc.), their importance within the given value (the f word getting a 8 out of 10 for "aggressive"), how likely is this word to have a different meaning (as suggested in the comment above, "sick" might get a warning flag to check the exact context), further considerations (e.g., how the number of repetitions affects its value), etc. Additionally, you would have to include different levels of storage (per post, per user, per time, etc.).</li> <li>You have to create an algorithm adaptable and scalable enough (lots of changes, improvements, additions, etc. are expected here) to deliver what you want. The basic idea I would come up from is: assessing each post on account of the values for each word as defined in the database (by considering each word alone, the number of repetitions of the given word, the context of the given word, etc.), that is, checking which words are analysis-worthy and which ones are not; a parsing system not only capable to extract individual words but to analyse the context (words before and after the target one in the same sentence/paragraph or in a different one); setting up some rules to avoid "misunderstandings" (e.g., minimum number of posts to consider that a behaviour is aggressive, otherwise just ignore (perhaps just humour); accounting for complex moods formed on account of different types of posts (e.g., angry in 3 posts + timid in 9 posts = *); etc.); in summary, it has to be capable of converting the "discrete posts reality" into the desired output (an assessment for the post/user) as accurately as possible.</li> </ol> <p>As said, it will not be easy. But if you do things step by step and make sure that the structure is adaptable enough to allow any modification/extension, you might get a pretty reliable piece of software (by understanding what is the ideal result you can expect, that is, a ranking of the type of language used; extrapolating this to a real personality analysis sounds perhaps too ambitious) which might be of interest to quite a few people.</p>
    singulars
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload