linux - Can't run parallel jobs in Heritrix3 Web Crawler -
i created 2 jobs in heritrix 3.2.0 , launched both after building, both started running after 15 20 seconds, 1 job stopped , other continues , when job stopped, status in jobs log follows:
2015-05-12t06:40:33.715z info empty 20150512063923
so not multi-process jobs. how fix it?
no means job done (queue empty). if no pages downloaded, means decide rules strict , don't allow downloaded.
Comments
Post a Comment