Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p>To my knowledge there are no well known classification methods that work directly on dynamically sized input collections. Dimensionality reduction works by reducing high dimensional, but fixed data to lower dimensions, so it really doesn't seem like what you need.</p> <p>One way in which this is handled in machine learning for binary classification (which seems is the problem you're interested in) is by building histograms. For example you can categorize text (of varying length) by building histograms of the words that appear in the text. Several extensions have been presented, histograms of bi-grams, n-grams but they build on the same idea. </p> <p>Another type of idea is structured prediction, a good example of which is when you have a sentence and you need to determine for each word, what part of speech it is, in this type of setup each word has a label, and the interaction between labels is very important. Well understood method for this type of problem are <a href="http://www.cs.cornell.edu/~cnyu/latentssvm/" rel="nofollow">Latent Structural SVM</a>, <a href="http://en.wikipedia.org/wiki/Conditional_random_field" rel="nofollow">CRF</a>, and <a href="http://www.google.com/url?sa=t&amp;rct=j&amp;q=&amp;esrc=s&amp;source=web&amp;cd=1&amp;cad=rja&amp;ved=0CDgQFjAA&amp;url=http://books.nips.cc/papers/files/nips16/NIPS2003_AA04.pdf&amp;ei=t9vAUIH2NKm20QGV9YGACw&amp;usg=AFQjCNHzW9Wc68rib1oO9mV71CtHXm7AWg&amp;sig2=nyz53sKhTuQg7n5HFKwtuw" rel="nofollow">Max Margin Markov Network</a></p>
    singulars
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload