Changes between Version 6 and Version 7 of ApertureDataAccessor
- Timestamp:
- 10/18/05 20:52:54 (20 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
ApertureDataAccessor
v6 v7 27 27 * The !CrawlData interface allows us in the future to get rid of the !CrawlDataBase implementation class, which has its own storage format, and create an adapter that works on top of the Sesame Repository that also contains all extracted metadata. This way all known metadata of a resource is stored in a single place, ensuring consistency, lower resource comsumption, improved caching behaviour, etc. 28 28 29 * Leo: to simplify and seperate the '''get (=get it now!)''' and '''getCrawl (=check if changed, get if changed)''' I would suggest to define two methods, one for really getting a resource and one in the crawling scenario. The getCrawl method would be the existing one, the get method a simpler one. 30 29 31 30 32 == Java Interface == … … 36 38 * A DataAccessor provides access to physical resources by creating DataObjects 37 39 * representing the resource, based on a url and optionally data about a previous access 38 * and other parameters. 40 * and other parameters. 41 * The main task of a DataAccessor is to find the resource identified by the URL String 42 * and create a DataObject that represents the resource. When crawling, the DataAccessor 43 * additionally uses the passed CrawlData interface to check and update information about 44 * the last crawl. 45 * About the returned DataObject: i n most cases, the DataObject is just a passive container 46 * of information, the DataAccessor will have filled it with information. However, it may 47 * also have returned a dedicated DataObject implementation that determines some things 48 * dynamically, that is up to the DataAccessor to decide. 39 49 */ 40 50 public interface DataAccessor { … … 57 67 * @param uri The uri used to address the resource. 58 68 * @param dataSource The source that will be registered as the source of the DataObject. 59 * @param accessDataOptional database containing information about previous accesses.69 * @param crawlData Optional database containing information about previous accesses. 60 70 * @param params Optional additional parameters needed to access the physical resource. 71 * also, parameters may be passed that determine how the metadata should be 72 extracted or which detail 73 * of metadata is needed. Applications may pass params through the whole chain. 61 74 * @return A DataObject for the specified URI, or null when an AccessData instance has been 62 75 * specified and the binary resource has not been modified since the last access. … … 65 78 */ 66 79 public DataObject get(URI uri, DataSource source, 67 AccessData accessData, Map<?,?> params) throws UriNotFoundException, IOException;80 CrawlData crawlData , Map<?,?> params) throws UriNotFoundException, IOException; 68 81 } 69 82 }}}