This package provides panFMP's harvesters and their corresponding classes.
Class Summary Class Description DirectoryHarvesterHarvester for traversing file system directories. ElasticsearchHarvesterTODO HarvesterHarvester interface to panFMP. NoOpHarvesterThis harvester can be used to 'disable' harvesting from a source, e.g. OAIHarvesterHarvester for OAI-PMH repositories. OAIHarvesterBaseAbstract base class for OAI harvesting support in panFMP. OAIMetadataDocumentSpecial implementation of
MetadataDocumentthat adds OAI set support to internal fields
OAIMetadataSaxRuleThis class is used as a rule for the "metadata" element of the OAI response. OAIStaticRepositoryHarvesterHarvester for OAI static repositories. PanFMP1IndexHarvesterThis harvester supports replication XML contents from a legacy panFMP 1.x installation. RebuilderIndex rebuilder implemented as harvester that reads all documents from an index and pushes them back into the index. SingleFileEntitiesHarvesterAbstract harvester class for single file entities (like files from web page or from a local directory). UpdateAliasesTODO WebCrawlingHarvesterHarvester for traversing websites and harvesting XML documents. ZipFileHarvesterHarvester for unzipping ZIP files and reading their contents.
Exception Summary Exception Description OAIExceptionThis exception is thrown on an OAI protocol error, which is triggered by an
<error>response from the repository.
RetryAfterIOExceptionThrown when HTTP server responds with