public class HdtBasedSink extends FileBasedSink
| Modifier and Type | Class and Description |
|---|---|
protected class |
HdtBasedSink.HDTParser |
FileBasedSink.StreamStatus| Modifier and Type | Field and Description |
|---|---|
private ExecutorService |
EXECUTION_SERVICE |
protected String |
inputType
input type for parsing the file
|
private static org.slf4j.Logger |
LOGGER |
protected FileOutputStream |
out
outputstream for Metadata
|
protected File |
outputDirectory |
DEFAULT_OUTPUT_LANG, outputLang, streamMapping, useCompression| Constructor and Description |
|---|
HdtBasedSink(File outputDirectory)
Creates a temp file for the FileBasedsink storage
|
| Modifier and Type | Method and Description |
|---|---|
void |
addData(org.dice_research.squirrel.data.uri.CrawleableUri uri,
InputStream stream) |
void |
addTriple(org.dice_research.squirrel.data.uri.CrawleableUri uri,
org.apache.jena.graph.Triple triple) |
void |
closeSinkForUri(org.dice_research.squirrel.data.uri.CrawleableUri uri)
Recovers the temp file generated and parse it to hdt
|
void |
openSinkForUri(org.dice_research.squirrel.data.uri.CrawleableUri uri) |
addQuad, flushMetadata, generateFileNameprivate static final org.slf4j.Logger LOGGER
private final ExecutorService EXECUTION_SERVICE
protected File outputDirectory
protected String inputType
protected FileOutputStream out
public HdtBasedSink(File outputDirectory) throws IOException
outputDirectory - IOExceptionpublic void addTriple(org.dice_research.squirrel.data.uri.CrawleableUri uri,
org.apache.jena.graph.Triple triple)
addTriple in interface org.dice_research.squirrel.sink.triplebased.TripleBasedSinkaddTriple in class FileBasedSinkpublic void openSinkForUri(org.dice_research.squirrel.data.uri.CrawleableUri uri)
openSinkForUri in interface org.dice_research.squirrel.sink.SinkBaseopenSinkForUri in class FileBasedSinkpublic void closeSinkForUri(org.dice_research.squirrel.data.uri.CrawleableUri uri)
closeSinkForUri in interface org.dice_research.squirrel.sink.SinkBasecloseSinkForUri in class FileBasedSinkpublic void addData(org.dice_research.squirrel.data.uri.CrawleableUri uri,
InputStream stream)
addData in interface org.dice_research.squirrel.sink.UnstructuredDataSinkaddData in class FileBasedSinkCopyright © 2017–2020. All rights reserved.