Changes between Version 3 and Version 4 of ApertureSimpleDataCrawler


Ignore:
Timestamp:
10/17/05 11:28:51 (19 years ago)
Author:
anonymous
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • ApertureSimpleDataCrawler

    v3 v4  
    6868"Crawler" is probably not a good term to use in the name of this interface as there is some other code using an instance of this interface that retrieves the folder and object uris and decides to retrieve them (i.e. the actual crawling). 
    6969 
     70The Javadoc comments suggest that Extractors are applied internally in the crawler implementations. I think they should be kept outside it: it is not up to the crawlers to decide how the encountered objects are processed. For example, a wget-like utility using this framework will need all metadata obtained from the data source but would have no interest in the extracted text. 
     71 
    7072What I do like is that folders become more prominent. This may seem to contradict what I said before. What I mean is that it is possible to retrieve information about folders itself. Using a simple extension (e.g., a getDataOfFolder method) it becomes possible to retrieve all metadata of a folder. This is something we have not considered before in our own architecture. 
    7173