11 | | * easy to use: easy to learn, easy to code, easy to deploy in industrial projects |
12 | | * Extract fulltext from many common file formats and information systems like IMAP email servers |
13 | | * Extract metadata like author, date, subject and more from the data sources |
14 | | * open the data objects for viewing |
15 | | * Fully configurable framework, storing and editing config files is done through a SWING gui. |
16 | | * Pluggable architecture: can be easily extended, can be easily integrated to other projects. |
17 | | * Architecture based on industry standard OSGI |
18 | | * Compatible with RDF, but not solely based on it |
| 11 | * easy to use: easy to learn, easy to code, easy to deploy in industrial projects |
| 12 | * Extract fulltext from many common file formats and information systems like IMAP email servers |
| 13 | * Extract metadata like author, date, subject and more from the data sources |
| 14 | * open the data objects for viewing |
| 15 | * Fully configurable framework, storing and editing config files is done through a SWING gui. |
| 16 | * Pluggable architecture: can be easily extended, can be easily integrated to other projects. |
| 17 | * Architecture based on industry standard OSGI |
| 18 | * Compatible with RDF, but not solely based on it |
22 | | * DataSource Interface |
23 | | * TextExtractor Interface |
24 | | * DataSource implementation for Filesystem |
25 | | * DataSource implementation for IMAP mail servers |
26 | | * TextExtractor implementation for everything we know: PDF, Word, Fulltext, excel |
27 | | * OSGI bindings and connector code |
28 | | * Configuration gui |
29 | | * Sample appication showing how to use it, with gui (=either Autofocus or Sesame or Gnowsis) |
30 | | * Metadata format description (RDFS schema) and example file for the metadata |
| 22 | * DataSource Interface |
| 23 | * TextExtractor Interface |
| 24 | * DataSource implementation for Filesystem |
| 25 | * DataSource implementation for IMAP mail servers |
| 26 | * TextExtractor implementation for everything we know: PDF, Word, Fulltext, excel |
| 27 | * OSGI bindings and connector code |
| 28 | * Configuration gui |
| 29 | * Sample appication showing how to use it, with gui (=either Autofocus or Sesame or Gnowsis) |
| 30 | * Metadata format description (RDFS schema) and example file for the metadata |
34 | | * Plain text |
35 | | * HTML |
36 | | * XML |
37 | | * PDF (Portable Document Format) |
38 | | * RTF (Rich Text Format) |
39 | | * Microsoft Word 97+ |
40 | | * Microsoft Excel 97+ |
41 | | * Microsoft Powerpoint 97+ |
42 | | * Microsoft Works |
43 | | * OpenOffice 1.0+: Writer, Calc, Impress, Draw |
44 | | * StarOffice 6.0+: Writer, Calc, Impress, Draw |
45 | | * WordPerfect 5.x |
46 | | * Emails |
47 | | * IMAP Servers |
| 34 | * Plain text |
| 35 | * HTML |
| 36 | * XML |
| 37 | * PDF (Portable Document Format) |
| 38 | * RTF (Rich Text Format) |
| 39 | * Microsoft Word 97+ |
| 40 | * Microsoft Excel 97+ |
| 41 | * Microsoft Powerpoint 97+ |
| 42 | * Microsoft Works |
| 43 | * OpenOffice 1.0+: Writer, Calc, Impress, Draw |
| 44 | * StarOffice 6.0+: Writer, Calc, Impress, Draw |
| 45 | * WordPerfect 5.x |
| 46 | * Emails |
| 47 | * IMAP Servers |