wiki:ApertureHierachicalAccess

A class that allows hierarchical access to the data inside a datasource, if the datasource supports hierachical access. Most desktop datasources, like Outlook, Lotus Notes, Files, EmailFolders, etc. have very strong hierarchical access. DFKI frameworks often have to extract this hierarchical data from the datasource directly to extract the implicit semantics.

Therefore, this Interface is a parallel add-on for convenience, independent from ApertureDataCrawler. Whereas ApertureDataCrawler is only for incremental crawling, the ApertureHierachicalAccess is for one-time crawling when someone (the user ?) wants to see the hierarchy inside. It must be noted, that IF a DataSource supports ApertureHierachicalAccess, then the extracted Data that is stored in some database has also to have the hierarchical structure visible somehow. So all data that build the Hierarchy expressed in ApertureHierachicalAccess should also be returned by ApertureDataCrawler.

/**
 * convenience access to the data inside a DataSource.
 * Not all DataSources will support HierachicalAccess, if a DataSource does, this class
 * provides the user a way to see the structure inside the DataSource without having to
 * crawl it completely. A preview to the extracted data can be given using the HierachicalAccess
 * or the hierachical structure can be extracted without having to extract all data.
 */
public interface HierachicalAccess {

 /**
  * Returns the DataSource on which this HierachicalAccess works.
  */
 public DataSource getDataSource();

 /**
  * get the uri of the root folder of this datasource. This is the first folder to crawl,
  * its subfolders can be retrieved using getSubFolders and then incrementally.
  */
 public String getRootFolder();

 /**
  * get the detailed data of one object,
  * this is costly.
  * If the DataObject is a file, then the InputStream of the file is not converted to
  * metadata yet, use the Extractors assigned to the mime-type.
  * Internally, this uses a suitable DataAccessor to access the DataObject from inside 
  * the datasource.
  */
 public DataObject getDataObject(String uri);

 /**
  * List sub-folders of this folder. Iterator contains folder uris as Strings.
  * this may also return the uris of objects, if the objects can contain sub-objects. 
  * (IMAP-attachments)-but this is bad as detection of sub-objects of emails is costly.
  * the first call of this method would be with the getRootUri()
  */
 public Iterator listSubFolders(String uri);

 /**
  * List objects inside the passed folder, Iterator contains uris of objects as Strings.
  * To get the metadata and data of the object, use getDataObject with the returned uri.
  */
 public Iterator listSubObjects(String uri);

}

Last modified 19 years ago Last modified on 10/20/05 11:18:53