Package de.pangaea.metadataportal.harvester

This package provides panFMP's harvesters and their corresponding classes.

See:
          Description

Interface Summary
HarvesterCommitEvent This interface must be implemented to receive harvester commit events.
 

Class Summary
Checker Index checker.
DirectoryHarvester Harvester for traversing file system directories.
ExternalIndexHarvester This harvester supports replication XML contents from an foreign panFMP installation.
Harvester Harvester interface to panFMP.
IndexBuilder Component of panFMP that analyzes and indexes harvested documents in different threads.
MetadataDocument This class holds all information harvested and provides methods for IndexBuilder to create a Lucene Document instance from it.
OAIHarvester Harvester for OAI-PMH repositories.
OAIHarvesterBase Abstract base class for OAI harvesting support in panFMP.
OAIMetadataDocument Special implementation of MetadataDocument that adds OAI set support to internal fields
OAIMetadataSaxRule This class is used as a rule for the "metadata" element of the OAI response.
OAIStaticRepositoryHarvester Harvester for OAI static repositories.
Optimizer Index optimizer.
Rebuilder Index rebuilder implemented as harvester that reads all documents from an index and pushes them back into the index.
RemoveDuplicates Maintenance component for removing duplicate documents in index.
SingleFileEntitiesHarvester Abstract harvester class for single file entities (like files from web page or from a local directory).
WebCrawlingHarvester Harvester for traversing websites and harvesting XML documents.
XPathResolverImpl Helper class that implements several XPath interfaces to supply variables and functions to XPath expressions.
ZipFileHarvester Harvester for unzipping ZIP files and reading their contents.
 

Enum Summary
DocumentErrorAction Enumeration that specifies what action should be taken on a document error.
 

Exception Summary
IndexBuilderBackgroundFailure IndexBuilder throws this exception, if an error occurs in the indexer/converter threads.
OAIException This exception is thrown on an OAI protocol error, which is triggered by an <error> response from the repository.
RetryAfterIOException Thrown when HTTP server responds with HttpURLConnection.HTTP_UNAVAILABLE.
 

Package de.pangaea.metadataportal.harvester Description

This package provides panFMP's harvesters and their corresponding classes.



Copyright ©2007-2009 panFMP Developers c/o Uwe Schindler