Note that there are some explanatory texts on larger screens.

plurals
  1. POMOSS 2007 Crawl
    primarykey
    data
    text
    <p>I'm trying to get crawl to work on two separate farms I have but can't get it to work on either one. They both have two WFE's with an additional WFE configured as an Index server. There is one more server dedicated for Query and two clustered SQL 2005 back end servers for the database. I have unsuccessfully tried at least 50 different websites that I found with solutions from a search engine. I have configured (extended) my Web App to use <a href="http://servername:12345" rel="nofollow noreferrer">http://servername:12345</a> as the default zone and <a href="http://abc.companyname.com" rel="nofollow noreferrer">http://abc.companyname.com</a> as the custom and intranet zones. When I enter each of those into the content source and then try to run a crawl, I get a couple of errors in the crawl log:</p> <p><a href="http://servername:12345" rel="nofollow noreferrer">http://servername:12345</a> returns:<br> "Could not connect to the server. Please make sure the site is accessible."</p> <p><a href="http://abc.companyname.com" rel="nofollow noreferrer">http://abc.companyname.com</a> returns:<br> "Deleted by the gatherer. (The start address or content source that contained this item was deleted and hence this item was deleted.)"</p> <p>However, I can click both URL's and the page is accessible.</p> <p>Any ideas?</p> <hr> <p>More info:</p> <p>I wiped the slate clean, so to speak, and ran another crawl to provide an updated sample.</p> <p>My content sources are as such:</p> <p><a href="http://servername:33333" rel="nofollow noreferrer">http://servername:33333</a><br> <a href="http://sharepoint.portal.fake.com" rel="nofollow noreferrer">http://sharepoint.portal.fake.com</a><br> sps3://servername:33333</p> <p>My current crawl log errors are:</p> <p>sps3://servername:33333<br> Error in PortalCrawl Web Service.</p> <p><a href="http://servername:33333/mysites" rel="nofollow noreferrer">http://servername:33333/mysites</a><br> Content for this URL is excluded by the server because a no-index attribute.</p> <p><a href="http://servername:33333/mysites" rel="nofollow noreferrer">http://servername:33333/mysites</a><br> Crawled</p> <p>sts3://servername:33333/contentdbid={62a647a...<br> Crawled</p> <p>sts3://servername:33333<br> Crawled</p> <p><a href="http://servername:33333" rel="nofollow noreferrer">http://servername:33333</a><br> Crawled</p> <p><a href="http://sharepoint.portal.fake.com" rel="nofollow noreferrer">http://sharepoint.portal.fake.com</a><br> The Crawler could not communicate with the server. Check that the server is available and that the firewall access is configured correctly.</p> <p>I double checked for typos above and I don't see any so this should be an accurate reflection.</p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. VO
      singulars
      1. This table or related slice is empty.
    2. VO
      singulars
      1. This table or related slice is empty.
    3. VO
      singulars
      1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload