public interface UriCollector
extends org.dice_research.squirrel.sink.SinkBase
getUris(CrawleableUri)
method.Modifier and Type | Method and Description |
---|---|
void |
addNewUri(org.dice_research.squirrel.data.uri.CrawleableUri uri,
org.dice_research.squirrel.data.uri.CrawleableUri newUri)
Adds the given new URI to the list of URIs collected for the given URI.
|
default void |
addNewUri(org.dice_research.squirrel.data.uri.CrawleableUri uri,
org.apache.jena.graph.Node newUri)
Adds the given new URI to the list of URIs collected for the given URI.
|
default void |
addNewUri(org.dice_research.squirrel.data.uri.CrawleableUri uri,
String newUri)
Adds the given new URI to the list of URIs collected for the given URI.
|
default void |
addTriple(org.dice_research.squirrel.data.uri.CrawleableUri uri,
org.apache.jena.graph.Triple triple)
Adds the given triple to the list of URIs collected from the given URI.
|
long |
getSize(org.dice_research.squirrel.data.uri.CrawleableUri uri)
Returns the total of uris that have been collected
|
Iterator<byte[]> |
getUris(org.dice_research.squirrel.data.uri.CrawleableUri uri)
Returns a list of serialized
CrawleableUri instances that have been
collected for the given URI. |
default void addTriple(org.dice_research.squirrel.data.uri.CrawleableUri uri, org.apache.jena.graph.Triple triple)
addNewUri(CrawleableUri, CrawleableUri)
method instead since this
enables the addition of meta data to the collected URI.uri
- The URI from which the given triple has been collected.triple
- The triple that has been collected.void addNewUri(org.dice_research.squirrel.data.uri.CrawleableUri uri, org.dice_research.squirrel.data.uri.CrawleableUri newUri)
uri
- The URI from which the given new URI has been collected.newUri
- The new URI that has been collected.default void addNewUri(org.dice_research.squirrel.data.uri.CrawleableUri uri, org.apache.jena.graph.Node newUri)
uri
- The URI from which the given new URI has been collected.newUri
- The new URI that has been collected.default void addNewUri(org.dice_research.squirrel.data.uri.CrawleableUri uri, String newUri)
uri
- The URI from which the given new URI has been collected.newUri
- The new URI that has been collected.Iterator<byte[]> getUris(org.dice_research.squirrel.data.uri.CrawleableUri uri)
CrawleableUri
instances that have been
collected for the given URI.uri
- The URI from which the returned serialized URIs have been
collected.Iterator
that iterates over the already serialized URIs
that have been collected for the given URI.long getSize(org.dice_research.squirrel.data.uri.CrawleableUri uri)
uri
- The URI from which the returned serialized URIs have been
collected.Copyright © 2017–2020. All rights reserved.