Note that there are some explanatory texts on larger screens.

plurals
  1. POCan you recommend a full-text search engine?
    text
    copied!<p>Can you recommend a full-text search engine? (Preferably open source)</p> <p>I have a database of many (though relatively short) HTML documents. I want users to be able to search this database by entering one or more search words in my C++ desktop application. Hence, I’m looking for a fast full-text search solution to integrate with my app. Ideally, it should:</p> <ul> <li>Skip common words, such as <code>the</code>, <code>of</code>, <code>and</code>, etc.</li> <li>Support stemming, i.e. search for <code>run</code> also finds documents containing <code>runner</code>, <code>running</code> and <code>ran</code>.</li> <li>Be able to update its index in the background as new documents are added to the database.</li> <li>Be able to provide search word suggestions (like Google Suggest)</li> <li>Have a well-documented API</li> </ul> <p>To illustrate, assume the database has just two documents:</p> <blockquote> <p><em>Document 1</em>: <code>This is a test of text search.</code></p> <p><em>Document 2</em>: <code>Testing is fun.</code></p> </blockquote> <p>The following words should be in the index: <code>fun</code>, <code>search</code>, <code>test</code>, <code>testing</code>, <code>text</code>. If the user types <code>t</code> in the search box, I want the application to be able to suggest <code>test</code>, <code>testing</code> and <code>text</code> (Ideally, the application should be able to query the search engine for the 10 most common search words starting with <code>t</code>). A search for <code>testing</code> should return both documents.</p> <p>Other points:</p> <ul> <li>I don't need multi-user support</li> <li>I don't need support for complex queries</li> <li>The database resides on the user's computer, so the indexing should be performed locally. </li> </ul> <p>Can you suggest a C or C++ based solution? (I’ve briefly reviewed <a href="http://clucene.sourceforge.net/" rel="nofollow noreferrer">CLucene</a> and <a href="http://xapian.org" rel="nofollow noreferrer">Xapian</a>, but I’m not sure if either will address my needs, especially querying the search word indexes for the suggest feature).</p>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload