56 | | !DataAccessor as this is the best place to do this detection. First, this prevents object creation when the resource has not been modified since the last scan (DataAccessor returns null). Furthermore, this allows the !HttpDataAccessor to use HTTP-specific functionality to let |
57 | | the webserver decide on whether the resource has changed since the last scan, |
58 | | preventing an unchanged file from being transported to the crawling side in |
59 | | the first place. |
| 56 | !DataAccessor as this is the best place to do this detection: |
| 57 | |
| 58 | * This prevents object creation when the resource has not been modified since the last scan (DataAccessor returns null). |
| 59 | |
| 60 | * This allows for more sophisticated optimizations, e.g. the !HttpDataAccessor uses HTTP-specific functionality so that the webserver can decide whether the resource has changed since the last scan. This prevents an unchanged web page from being transported to the crawling side in the first place. |