External Site Search problem

When I'm trying to set up a search for external web site for MOSS 2007 . For examplehttp://news.com.com/. I'm getting an Error and crawling is not happening.

The Error is the following:

Could not Connect to the server. Please make sure the server is accessible.

I'm doing the following:

1. Create new content source

2. Create crawl Rule

3. Then I do a full crawl (but it's not happening because of the above mentioned error)

I have a doubt that I'm not setting the Authentication properly. I've tried to use the default content access account which is SharePoint Administrator.

Does anybody have some idea what account should we use or how to make this work?

Thanks in advance!

- Istvan

[871 byte] By [IstvanSimon] at [2008-2-6]
# 1

I suppose you are trying to crawl an internet stie from your company' network. Please make necessary infrastructure changes so that you MOSS 2007 can talk to internet sites.

Inorder to crawl password protected internet sites, you need to do few things.

Please refer the following links

http://support.microsoft.com/kb/284022

http://blogs.msdn.com/sharepoint/archive/2006/01/12/511912.aspx

http://technet2.microsoft.com/WindowsServer/WSS/en/library/8208d71e-7c41-4845-bc06-95429de02cf11033.mspx

SundarNarasimman at 2007-9-6 > top of Msdn Tech,SharePoint Products and Technologies,SharePoint - Search...
# 2

HI,

Did you get the solution to the problem.Please let me know when you find it............ me too running with same problem. Mail

me on this Honey008@gmail.com. Your help is greatly appreciated.

Thanks,

HaniNataraj at 2007-9-6 > top of Msdn Tech,SharePoint Products and Technologies,SharePoint - Search...
# 3

Hi,

is there anything like external ip which is available for internet needed for MOSS to crawl external sites.Kindly let me know

if any one have answer.

HaniNataraj at 2007-9-6 > top of Msdn Tech,SharePoint Products and Technologies,SharePoint - Search...
# 4

If you are trying to crawl a site that allows annonymous which your example http://news.com.com does then you should be able to crawl. More likely problem instead of authentication could be proxy server settings. The crawler may not be able to go out of your intranet without those settings(provided you guys are using proxy servers to get to the internet from your intranet) . If you are using proxy servers and you havent configured them then the link below shows how

http://technet2.microsoft.com/Office/en-us/library/fe8aa9d7-96e6-4136-b699-9fddc7947fa11033.mspx?mfr=true

-Puneet

PuneetNarula-MSFT at 2007-9-6 > top of Msdn Tech,SharePoint Products and Technologies,SharePoint - Search...
# 5

Hi,

The proxy server settings are everything fine ..... but i guess there is some problem with authentication as Ivan said.to connect to intenet the Id we use in the browser in not a domain account (in my case) . My default credentials for crawling is a domain account which i use for all the administrative purposes, so MOSS easily crawling sharepoint sites, But this id i not a valid for connecting to internet.... i am confused here a little.

Does MOSS allows to use different credentials for different content sources (external internet sites) ? or where exactly i am deviating from actual settings. Please if any one knows "How to" steps then it would be of great use. I am still hanging on net to get solution.

-Hani

HaniNataraj at 2007-9-6 > top of Msdn Tech,SharePoint Products and Technologies,SharePoint - Search...

SharePoint Products and Technologies

Site Classified