Home > database >  Datastax Cassandra Spark Connector node awareness
Datastax Cassandra Spark Connector node awareness

Time:02-11

Is Datastax Cassandra community edition integration with Spark community edition using spark-cassandra-connector community edition node aware or is this feature reserved for Enterprise editions only?

By node awareness I mean if Spark will send job execution to the nodes owning the data

CodePudding user response:

Yes, the Spark connector is node-aware and will function in that manner with both DSE and (open source) Apache Cassandra.

In fact on a SELECT it knows how to hash the partition keys to a token, and send queries on specific token ranges only to the nodes responsible for that data. It can do this, because (like the Cassandra Java driver) it has a window into node-to-node gossip and can see things like node status (up/down) and token range assignment.

  • Related