Note that there are some explanatory texts on larger screens.

plurals
  1. PO
    primarykey
    data
    text
    <p>You could analyse how much the strings overlap (and generate list of words/substring which appear in most of them) and then pick the most relevant words.</p> <p>For example, if the words appear in certain percentage of the strings, you can identify them as the most likely candidates for the product name. (So similar to what you have done but add thresholds - e.g. you can see that 5 words appear in 88% of the strings and the other ones in much lower percentage - then pick the top 5 as a product name. This is not something exact I am afraid and needs to be manually tweaked.) This should allow to gather majority of the information but will never be perfect.</p> <p>Additionally, you can have a pre-defined list of brands and filter those words out. I would also account for partial matching of the words as they can be a product of manual data entry and there can always be typos. You can see how relevant this is, if you get strong enough "signal" by simply discarding them, then no need to worry.</p> <p>Going even further, you can specify another filter to mark items for manual curation but this may be very time consuming.</p> <p>I am afraid there is no simple answer. What you are doing is essentially text mining. I have just thrown a few ideas and starting points that can help you start.</p> <p>The above would work assuming you are building some automatic crawler trying to put together date from multiple sources. If you would like to enable visitors to search your site and return the right product page for all of the queries, then I would suggest diving into some text-searching (principal data analysis anyone?). Or just use some ready-made solution.</p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
    2. VO
      singulars
      1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload