For some reason nutch is crawling URLs with the anchor tag (#). I even updated the regex-filter to not include the # in URLs during the crawl, but that doesn't seem to be working.
Any ideas what may be going on?
For some reason nutch is crawling URLs with the anchor tag (#). I even updated the regex-filter to not include the # in URLs during the crawl, but that doesn't seem to be working.
Any ideas what may be going on?
Copyright © 2021 Jogjafile Inc.