java – crawler4j recrawl a website not operative
Your Crawl Storage Folder was combined after a initial time, furthermore, this record can't be auto-delete(to recrawl) since a opening to a record is denied, so in a second time, a way checked this record and thinks that all URLs are crawled. You should correct a crawler4j to tie totally a opening to a Crawl Storage Folder. Follow this: https://code.google.com/p/crawler4j/issues/detail?id=157
Read full article from java - crawler4j recrawl a website not operative | Zap Video
No comments:
Post a Comment