The nightly builds are all cataloged here:
http://lucene.zones.apache.org:8080/hudson/job/Nutc
h-Nightly/
The current nightly build is #153 from July 18.
For instance, you could do:
wget http://lucene.zones.apache.org:8080/hudson/job/Nutch-N
ightly/153/artifact/trunk/build/nutch-2007-07-18_04-01-20.ta
r.gz
--Kai
----- Original Message ----
From: Tsengtan A Shuy <ttashuy sbcglobal.net>
To: nutch-dev lucene.apache.org
Sent: Wednesday, July 18, 2007 11:59:52 AM
Subject: RE: no nutch script file under bin directory
Where do you get the nightly build? I followed your referral
web page and
use " wget
http://lucene.zones.apache.org:8080/
hudson/job/Nutch-Nightly/lastStableBuild
/artifact/trunk/build/nutch-2007-06-27_06-52-44.tar.gz"
to get it. Then I
got the "file not found" error message.
Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: Kai_testing Middleton [mailto:kai_testing yahoo.com]
Sent: Wednesday, July 18, 2007 11:35 AM
To: nutch-dev lucene.apache.org
Subject: Re: no nutch script file under bin directory
I'm not actually sure ... I think I downloaded and unzipped
a nightly build
in my usr/local directory thus creating this directory:
/usr/local/nutch-2007-06-27_06-52-44
then from within that directory I ran the svn command ... if
I remember
correctly.
You can always try just making a 'nutch' directory or a
'nutch0.9'
directory, running svn, and see if it creates another
subdirectory under
that, then moves things to where you want.
----- Original Message ----
From: Tsengtan A Shuy <ttashuy sbcglobal.net>
To: nutch-dev lucene.apache.org
Sent: Tuesday, July 17, 2007 5:30:18 PM
Subject: RE: no nutch script file under bin directory
This may seems like a silly question, but I need to know it
anyway.
When I check out the trunk, I shall put it to the nutch
directory which
should be the latest release directory e.g: nutch-0.9
release.
Am I right?
Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: Tsengtan A Shuy [mailto:ttashuy sbcglobal.net]
Sent: Tuesday, July 17, 2007 12:33 PM
To: 'Tsengtan A Shuy'; nutch-dev lucene.apache.org
Subject: RE: no nutch script file under bin directory
BTW, I just found out there is only one web page reference
in your last
email. So I do not understand what you quoted "two
discussions".
Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: Tsengtan A Shuy [mailto:ttashuy sbcglobal.net]
Sent: Tuesday, July 17, 2007 12:23 PM
To: 'nutch-dev lucene.apache.org'
Subject: no nutch script file under bin directory
I follow the msg06571.html to check out the trunk.
Then I found there is no nutch script file under the bin
directory.
How do you crawl the multiple websites without this nutch
script file?
Adam Shuy, President
ePacific Web Design & Hosting
Professional Web/Software developer
TEL: 408-272-6946
www.epacificweb.com
-----Original Message-----
From: Kai_testing Middleton [mailto:kai_testing yahoo.com]
Sent: Monday, July 16, 2007 8:43 AM
To: nutch-dev lucene.apache.org
Subject: Re: OOM error during parsing with nekohtml
You could try looking at these two discussions:
http://www.mail
-archive.com/nutch-dev lucene.apache.org/msg06571.html
http://www.mail
-archive.com/nutch-dev lucene.apache.org/msg06571.html
--Kai
____________________________________________________________
________________
________
Shape Yahoo! in your own image. Join our Network Research
Panel today!
http://surveylink.yahoo.com/gmrs/yahoo_panel_invite.a
sp?a=7
____________________________________________________________
________________________
Food fight? Enjoy some healthy debate
in the Yahoo! Answers Food & Drink Q&A.
http://answers.yahoo.com/dir/?link=list&sid=3965453
67 |