- datatype - Variable in class de.pangaea.metadataportal.config.FieldConfig
-
- datestamp - Variable in class de.pangaea.metadataportal.processor.MetadataDocument
-
- dbf - Static variable in class de.pangaea.metadataportal.utils.StaticFactories
-
- de.pangaea.metadataportal - package de.pangaea.metadataportal
-
This is the root package of panFMP.
- de.pangaea.metadataportal.config - package de.pangaea.metadataportal.config
-
This package provides classes and methods for panFMP's configuration (config.xml
).
- de.pangaea.metadataportal.harvester - package de.pangaea.metadataportal.harvester
-
This package provides panFMP's harvesters and their corresponding classes.
- de.pangaea.metadataportal.processor - package de.pangaea.metadataportal.processor
-
This package provides panFMP's document processor (used to parse and transform documents for indexing with Elasticsearch) and corresponding classes.
- de.pangaea.metadataportal.push - package de.pangaea.metadataportal.push
-
This package contains a simple HTTP server to push XML documents to index panFMP.
- de.pangaea.metadataportal.utils - package de.pangaea.metadataportal.utils
-
This package provides utility classes for panFMP.
- DEFAULT_BULK_MEMORY - Static variable in class de.pangaea.metadataportal.processor.DocumentProcessor
-
- DEFAULT_BULK_SIZE - Static variable in class de.pangaea.metadataportal.processor.DocumentProcessor
-
- DEFAULT_CONCURRENT_BULK_REQUESTS - Static variable in class de.pangaea.metadataportal.processor.DocumentProcessor
-
- DEFAULT_CONTENT_TYPE - Static variable in class de.pangaea.metadataportal.processor.DocumentProcessor
-
- DEFAULT_DELETE_UNSEEN_BULK_SIZE - Static variable in class de.pangaea.metadataportal.processor.DocumentProcessor
-
- DEFAULT_MAX_QUEUE - Static variable in class de.pangaea.metadataportal.processor.DocumentProcessor
-
- DEFAULT_NUM_THREADS - Static variable in class de.pangaea.metadataportal.processor.DocumentProcessor
-
- DEFAULT_RETRY_COUNT - Static variable in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
- DEFAULT_RETRY_COUNT - Static variable in class de.pangaea.metadataportal.harvester.WebCrawlingHarvester
-
- DEFAULT_RETRY_COUNT - Static variable in class de.pangaea.metadataportal.harvester.ZipFileHarvester
-
- DEFAULT_RETRY_TIME - Static variable in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
- DEFAULT_RETRY_TIME - Static variable in class de.pangaea.metadataportal.harvester.WebCrawlingHarvester
-
- DEFAULT_RETRY_TIME - Static variable in class de.pangaea.metadataportal.harvester.ZipFileHarvester
-
- DEFAULT_TIMEOUT - Static variable in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
- DEFAULT_TIMEOUT - Static variable in class de.pangaea.metadataportal.harvester.WebCrawlingHarvester
-
- DEFAULT_TIMEOUT - Static variable in class de.pangaea.metadataportal.harvester.ZipFileHarvester
-
- defaultValue - Variable in class de.pangaea.metadataportal.config.FieldConfig
-
- deleted - Variable in class de.pangaea.metadataportal.processor.MetadataDocument
-
- deleteDocument(String) - Method in class de.pangaea.metadataportal.harvester.Harvester
-
Queues the given ID for deletion.
- deleteDocument(String) - Method in class de.pangaea.metadataportal.push.PushWrapperHarvester
-
- deleteMissingDocuments - Variable in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
If enabled, on any kind of full harvesting it will track all valid identifiers and delete all of them not seen in index.
- destContentHandler - Variable in class de.pangaea.metadataportal.utils.SaxRule
-
- DirectoryHarvester - Class in de.pangaea.metadataportal.harvester
-
Harvester for traversing file system directories.
- DirectoryHarvester(HarvesterConfig) - Constructor for class de.pangaea.metadataportal.harvester.DirectoryHarvester
-
- disable() - Method in class de.pangaea.metadataportal.utils.SimpleCookieHandler
-
Cleans up the cookie list and disables the handler.
- DOCUMENT_PROCESSOR_NAMESPACE - Static variable in class de.pangaea.metadataportal.processor.XPathResolverImpl
-
- DocumentErrorAction - Enum in de.pangaea.metadataportal.processor
-
Enumeration that specifies what action should be taken on a document error.
- DocumentProcessor - Class in de.pangaea.metadataportal.processor
-
Component of panFMP that analyzes and indexes harvested documents in a thread pool.
- doError(String, String) - Method in class de.pangaea.metadataportal.harvester.OAIHarvester
-
Deprecated.
- dombuilder - Static variable in class de.pangaea.metadataportal.utils.StaticFactories
-
- DoNothingRule() - Constructor for class de.pangaea.metadataportal.utils.ExtendedDigester.DoNothingRule
-
- doParse(Supplier<ExtendedDigester>, String, AtomicReference<Instant>) - Method in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
Harvests a URL using the suplied digester.
- get() - Static method in class de.pangaea.metadataportal.Package
-
Gets package object from classloader.
- get(URI, Map<String, List<String>>) - Method in class de.pangaea.metadataportal.utils.SimpleCookieHandler
-
- getCode() - Method in exception de.pangaea.metadataportal.harvester.OAIException
-
- getContentHandler() - Method in class de.pangaea.metadataportal.utils.SaxRule
-
return the current ContentHandler
.
- getConverter() - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
Returns a converter instance that does transformation and validation
according to index config.
- getCurrentAssignedPrefixes() - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
Returns all current namespace prefix that are assigned.
- getCurrentNamespaceContext(boolean, boolean) - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
- getCurrentNamespaceForPrefix(String) - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
Returns the current namespace URI for the given prefix.
- getCurrentNamespaceMap() - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
Returns the current namespace prefix mappings as modifiable
Map
containing the prefix and the current namespace assignment (it is just a
copy of the internal representation's current mapping).
- getCurrentVariableMap() - Method in class de.pangaea.metadataportal.processor.XPathResolverImpl
-
- getCustomContentHandler() - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
Gets the custom event handler.
- getDatestamp() - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
- getDebugStream(Log) - Static method in class de.pangaea.metadataportal.utils.LogUtil
-
Returns a PrintStream that logs all output with DEBUG method to the given
Commons Logging logger.
- getDefault() - Method in class de.pangaea.metadataportal.config.FieldConfig
-
- getDocumentProcessor(HarvesterConfig, String) - Method in class de.pangaea.metadataportal.processor.ElasticsearchConnection
-
- getEntityResolver(EntityResolver) - Method in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
- getErrorHandler() - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
Not supported, always returns null
- getErrorStream(Log) - Static method in class de.pangaea.metadataportal.utils.LogUtil
-
Returns a PrintStream that logs all output with ERROR method to the given
Commons Logging logger.
- getFatalStream(Log) - Static method in class de.pangaea.metadataportal.utils.LogUtil
-
Returns a PrintStream that logs all output with FATAL method to the given
Commons Logging logger.
- getFinalDOM() - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
Returns XML contents as DOM tree.
- getFullPackageDescription() - Static method in class de.pangaea.metadataportal.Package
-
Gets a version string to print out.
- getIdentifier() - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
- getInfoStream(Log) - Static method in class de.pangaea.metadataportal.utils.LogUtil
-
Returns a PrintStream that logs all output with INFO method to the given
Commons Logging logger.
- getInputSource(URL, AtomicReference<Instant>) - Method in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
Returns a SAX InputSource
for retrieving stream data of an
URL.
- getInstance() - Static method in class de.pangaea.metadataportal.processor.XPathResolverImpl
-
- getKeyValuePairs() - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
Converts this instance to a Elasticsearch JSON node
- getMessage() - Method in exception de.pangaea.metadataportal.harvester.OAIException
-
- getMetadataDocumentFactory() - Method in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
Returns a factory for creating the
MetadataDocument
s in Digester
code (using
FactoryCreateRule
).
- getName() - Method in class de.pangaea.metadataportal.config.FieldConfig
-
- getProductName() - Static method in class de.pangaea.metadataportal.Package
-
Gets product name ("panFMP").
- getProductVendor() - Static method in class de.pangaea.metadataportal.Package
-
Gets product vendor (the developer team).
- getRawIndexName(boolean) - Method in class de.pangaea.metadataportal.config.TargetIndexConfig
-
- getRetryAfter() - Method in exception de.pangaea.metadataportal.harvester.RetryAfterIOException
-
- getSets() - Method in class de.pangaea.metadataportal.harvester.OAIMetadataDocument
-
Returns all OAI sets as
Set
.
- getTraceStream(Log) - Static method in class de.pangaea.metadataportal.utils.LogUtil
-
Returns a PrintStream that logs all output with TRACE method to the given
Commons Logging logger.
- getTransformContentHandler() - Method in class de.pangaea.metadataportal.processor.MetadataDocument.XMLConverter
-
- getValidHarvesterPropertyNames() - Method in class de.pangaea.metadataportal.harvester.Harvester
-
Return the Set
of harvester property names that this harvester
supports.
- getVersion() - Static method in class de.pangaea.metadataportal.Package
-
Gets version of panFMP.
- getWarnStream(Log) - Static method in class de.pangaea.metadataportal.utils.LogUtil
-
Returns a PrintStream that logs all output with WARN method to the given
Commons Logging logger.
- getXML() - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
Returns XML contents as String (a cache is used).
- globalHarvesterProperties - Variable in class de.pangaea.metadataportal.config.TargetIndexConfig
-
- saxFactory - Static variable in class de.pangaea.metadataportal.utils.StaticFactories
-
- SaxRule - Class in de.pangaea.metadataportal.utils
-
This class is used as a rule for included documents during
Digester
parsing.
- SaxRule() - Constructor for class de.pangaea.metadataportal.utils.SaxRule
-
Default constructor
- schema - Variable in class de.pangaea.metadataportal.config.Config
-
- serializeToContentBuilder(XContentBuilder) - Method in class de.pangaea.metadataportal.utils.KeyValuePairs
-
Serializes the object to a sequence of fields.
- setAugmentation(String) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setContentHandler(ContentHandler) - Method in class de.pangaea.metadataportal.utils.SaxRule
-
Sets the SAX ContentHandler
that gets all SAX Events after the
startElement
event.
- setCustomContentHandler(ContentHandler) - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
Sets a custom
ContentHandler
, that receives all SAX events until
disabled (
null
).
- setDataType(String) - Method in class de.pangaea.metadataportal.config.FieldConfig
-
Deprecated.
- setDatestamp(Instant) - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
Set the datestamp (last modification time of document file).
- setDatestampField(String) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setDefault(String) - Method in class de.pangaea.metadataportal.config.FieldConfig
-
- setDeleted(boolean) - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
Marks a harvested document as deleted.
- setDigester(Digester) - Method in class de.pangaea.metadataportal.utils.SaxRule
-
Set the Digester
with which this Rule
is
associated.
- setEnabled(boolean) - Method in class de.pangaea.metadataportal.harvester.OAIMetadataSaxRule
-
If enabled, a DOM tree is build from metadata.
- setErrorHandler(ErrorHandler) - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
- setEsMappingFile(String) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setEsSettings(Settings.Builder) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setExcludeNamespaces(Set<String>) - Method in class de.pangaea.metadataportal.utils.SaxRule
-
Sets a Set<String>
containing all Namespace URIs that should not be
feed to the target ContentHandler
on match.
- setFilterDefault(String) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setFinalDOM(Document) - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
Sets XML final (transformed) xml contents as DOM tree.
- setGranularity(String) - Method in class de.pangaea.metadataportal.harvester.OAIHarvester
-
Deprecated.
- setHaltOnSchemaError(String) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setHarvesterClass(String) - Method in class de.pangaea.metadataportal.config.HarvesterConfig
-
Deprecated.
- setHarvestingDateReference(Instant) - Method in class de.pangaea.metadataportal.harvester.Harvester
-
Reference date of this harvesting event (in time reference of the original
server).
- setHeaderInfo(String, String, String) - Method in class de.pangaea.metadataportal.harvester.OAIMetadataDocument
-
Deprecated.
- setIdentifier(String) - Method in class de.pangaea.metadataportal.processor.MetadataDocument
-
Set the document identifier.
- setIndexSettings(Settings.Builder) - Method in class de.pangaea.metadataportal.config.TargetIndexConfig
-
Deprecated.
- setName(String) - Method in class de.pangaea.metadataportal.config.FieldConfig
-
- setName(ExtendedDigester, String) - Method in class de.pangaea.metadataportal.config.VariableConfig
-
Deprecated.
- setNameSuffix1(String) - Method in class de.pangaea.metadataportal.config.TargetIndexConfig
-
- setNameSuffix2(String) - Method in class de.pangaea.metadataportal.config.TargetIndexConfig
-
- setRequest(Map<String, String>) - Method in class de.pangaea.metadataportal.harvester.OAIHarvester
-
Deprecated.
- setResponseDate(String) - Method in class de.pangaea.metadataportal.harvester.OAIHarvester
-
Deprecated.
- setResumptionToken(String, String, String, String) - Method in class de.pangaea.metadataportal.harvester.OAIHarvester
-
Deprecated.
- setRulesWithInvalidElementCheck(Rules) - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
Adds a default Rule for not allowing invalid (not registered) event paths.
- sets - Variable in class de.pangaea.metadataportal.harvester.OAIHarvesterBase
-
the sets to harvest from the configuration, null
to harvest all
- sets - Variable in class de.pangaea.metadataportal.harvester.OAIMetadataDocument
-
- setSchema(String, String) - Method in class de.pangaea.metadataportal.config.Config
-
- setSourceField(String) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setTemplate(Templates) - Method in class de.pangaea.metadataportal.config.ExpressionConfig
-
- setTemplate(Templates) - Method in class de.pangaea.metadataportal.config.FilterConfig
-
- setType(String) - Method in class de.pangaea.metadataportal.config.FilterConfig
-
Deprecated.
- setTypeName(String) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setValidIdentifiers(Set<String>) - Method in class de.pangaea.metadataportal.harvester.Harvester
-
Set a set of all "seen" valid identifiers.
- setXMLField(String) - Method in class de.pangaea.metadataportal.config.Config
-
Deprecated.
- setXPath(ExtendedDigester, String) - Method in class de.pangaea.metadataportal.config.ExpressionConfig
-
Deprecated.
- SimpleCookieHandler - Class in de.pangaea.metadataportal.utils
-
A CookieHandler that can be enabled and used per thread.
- SingleFileEntitiesHarvester - Class in de.pangaea.metadataportal.harvester
-
Abstract harvester class for single file entities (like files from web page
or from a local directory).
- SingleFileEntitiesHarvester(HarvesterConfig) - Constructor for class de.pangaea.metadataportal.harvester.SingleFileEntitiesHarvester
-
- SingleFileEntitiesHarvester(HarvesterConfig, DocumentErrorAction) - Constructor for class de.pangaea.metadataportal.harvester.SingleFileEntitiesHarvester
-
- size() - Method in class de.pangaea.metadataportal.utils.KeyValuePairs
-
- skippedEntity(String) - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
- startDocument() - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
- startElement(String, String, String, Attributes) - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
- startPrefixMapping(String, String) - Method in class de.pangaea.metadataportal.utils.ExtendedDigester
-
- StaticFactories - Class in de.pangaea.metadataportal.utils
-
Some pre-allocated XML factories.