Changes between Version 3 and Version 4 of ApertureSimpleDataCrawler
- Timestamp:
- 10/17/05 11:28:51 (19 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
ApertureSimpleDataCrawler
v3 v4 68 68 "Crawler" is probably not a good term to use in the name of this interface as there is some other code using an instance of this interface that retrieves the folder and object uris and decides to retrieve them (i.e. the actual crawling). 69 69 70 The Javadoc comments suggest that Extractors are applied internally in the crawler implementations. I think they should be kept outside it: it is not up to the crawlers to decide how the encountered objects are processed. For example, a wget-like utility using this framework will need all metadata obtained from the data source but would have no interest in the extracted text. 71 70 72 What I do like is that folders become more prominent. This may seem to contradict what I said before. What I mean is that it is possible to retrieve information about folders itself. Using a simple extension (e.g., a getDataOfFolder method) it becomes possible to retrieve all metadata of a folder. This is something we have not considered before in our own architecture. 71 73