Seems not a Kinesis client library application for spark plus kinesis integration -
i going through code https://github.com/apache/spark/blob/master/extras/kinesis-asl/src/main/java/org/apache/spark/examples/streaming/javakinesiswordcountasl.java
it shows how kinesis can emit stream data sparkcontext , spark can process further.
in given example code trying understand how code kcl application if see http://docs.aws.amazon.com/kinesis/latest/dev/kinesis-record-processor-implementation-app-java.html says must complete following tasks when implementing amazon kinesis application in java:
tasks
implement irecordprocessor methods implement class factory irecordprocessor interface modify configuration properties
but spark example code https://github.com/apache/spark/blob/master/extras/kinesis-asl/src/main/java/org/apache/spark/examples/streaming/javakinesiswordcountasl.java has no reference irecordprocessor , worker etc.
note: https://spark.apache.org/docs/1.2.0/streaming-kinesis-integration.html under deploying section says single kinesis input dstream can read multiple shards of kinesis stream creating multiple kinesisrecordprocessor threads. there no implementation of kinesisrecordprocessor missing . or missing obvious understand
could please explain me how kcl application ?
the kinesis streaming implementation takes care of interactions , abstracts away app. see https://github.com/apache/spark/tree/master/extras/kinesis-asl/src/main/scala/org/apache/spark/streaming/kinesis
Comments
Post a Comment