Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p>Ahh... but "I really love dogs" and "I really hate dogs" are totally similar ;), both discuss one's feelings towards dogs. It seems that you're missing a step in there: </p> <ol> <li>Run your algorithm and get the general topic groups (i.e. "feelings towards dogs").</li> <li>Run your algorithm again, but this time on each previously "discovered" group and let your algorithm further classify them into subgroups (i.e. "i hate dogs"/"i love dogs").</li> </ol> <p>If your algorithm adjusts itself based on its experience (i.e. there some learning involved)., then make sure you run separate instances of the algorithm for the first classification, and a new instance of the algorithm for each sub-classification... if you don't, you may end up with a case where you find some groups and any time you run your algo on the same groups the results are nearly identical and/or nothing has changed at all.</p> <h2>Update</h2> <p><a href="http://mahout.apache.org/" rel="nofollow noreferrer">Apache Mahout</a> provides a lot of useful algorithms and <a href="https://cwiki.apache.org/confluence/display/MAHOUT/Quickstart" rel="nofollow noreferrer">examples</a> of Clustering, Classification, Genetic Programming, Decision Forest, Recommendation Mining. Here are a some of the text classification examples from mahout:</p> <ul> <li><a href="https://cwiki.apache.org/confluence/display/MAHOUT/Wikipedia+Bayes+Example" rel="nofollow noreferrer">Wikipedia classification</a> </li> <li><a href="https://cwiki.apache.org/confluence/display/MAHOUT/Twenty+Newsgroups" rel="nofollow noreferrer">Twenty Newsgroups classification</a></li> <li><a href="https://cwiki.apache.org/MAHOUT/creating-vectors-from-text.html" rel="nofollow noreferrer">Creating Vectors from Text</a></li> <li><a href="http://www.slideshare.net/gsingers/intro-to-apache-mahout" rel="nofollow noreferrer">Document Similarity with Mahout</a></li> <li><a href="http://mahout.apache.org/javadoc/core/org/apache/mahout/cf/taste/impl/recommender/GenericItemBasedRecommender.html" rel="nofollow noreferrer">Item Based Recommender</a> </li> </ul> <p>I'm not sure which one would best apply to your problem, but maybe if you look them over you'll figure out which one is the most suitable for your specific application.</p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
    2. VO
      singulars
      1. This table or related slice is empty.
    3. VO
      singulars
      1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload