Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p>First, i have no inside knowledge of S/U's Recommendation Engine. What i do know, i've learned from following this topic for the last few years and from studying the publicly available sources (including StumbleUpon's own posts on their company Site and on their Blog), and of course, as a user of StumbleUpon. </p> <p>I haven't found a single source, authoritative or otherwise, that comes anywhere close to saying "here's how the S/U Recommendation Engine works", still given that this is arguably the most successful Recommendation Engine ever--the statistics are insane, <a href="http://www.adweek.com/news/technology/stumblers-out-refer-facebookers-134287" rel="noreferrer">S/U accounts for <em>over half</em> of all referrals on the Internet</a>, and substantially more than facebook, despite having a fraction of the registered users that facebook has (800 million versus 15 million); what's more S/U is not really a site with a Recommendation Engine, like say, Amazon.com, instead the Site <em>itself</em> is a Recommendation Engine--there is a substantial volume of discussion and gossip among the fairly small group of people who build Recommendation Engines such that if you sift through this, i think it's possible to reliably discren the types of algorithms used, the data sources supplied to them, and how these are connected in a working data flow.</p> <p>The description below refers to my <em>Diagram</em> at bottom. Each step in the data flow is indicated by a roman numeral. My description proceeds backwards--beginning with the point at which the URL is delivered to the user, hence in actual use step I occurs last, and step V, first. </p> <p><strong>salmon-colored ovals</strong> => <em>data sources</em></p> <p><strong>light blue rectangles</strong> => <em>predictive algorithms</em></p> <hr> <p>I. A Web Page recommended to an S/U user is the last step in a multi-step flow</p> <p>II. The StumbleUpon Recommendation Engine is supplied with data (web pages) from three distinct sources:</p> <ul> <li><p>web <em>pages tagged with topic tags</em> matching your pre-determined <em>Interest</em>s (topics a user has indicated as interests, and which are available to view/revise by clicking the "Settings" Tab on the upper right-hand corner of the logged-in user page);</p></li> <li><p>socially Endorsed Pages (*pages <em>liked</em> by this user's Friends*); and</p></li> <li><p>peer-Endorsed Pages (*pages <em>liked</em> by similar users*);</p></li> </ul> <p> <p>III. Those sources in turn are results returned by StumbleUpon predictive algorithms (<em>Similar Users</em> refers to users in the same cluster as determined by a <em>Clustering Algorithm</em>, which is perhaps k-means).</p> <p>IV. The data used fed to the <em>Clustering Engine</em> to train it, is comprised of web pages annotated with user ratings</p> <p>V. This data set (web pages rated by StumbleUpon users) is also used to train a <em>Supervised Classifier</em> (<em>e.g</em>., multi-layer perceptron, support-vector machine) The output of this supervised classifier is a class label applied to a web page not yet rated by a user. </p> <p>The single best source i have found which discussed SU's Recommendation Engine in the context of other Recommender Systems is <a href="http://www.betabeat.com/2011/05/18/how-hunch-and-stumbleupon-are-different/" rel="noreferrer">this BetaBeat Post</a>.</p> <p><img src="https://i.stack.imgur.com/rqxpR.png" alt="enter image description here"></p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
    2. VO
      singulars
      1. This table or related slice is empty.
    3. VO
      singulars
      1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload