I recently started working with spark and was eager to know if I have to perform queries which would be better spark sql or databricks sql and why?
CodePudding user response:
We need to distinguish two things here:
- Spark SQL as a dialect of the SQL language. Originally started as Shark & Hive on Spark projects (
Now Databricks is platform where you have to pay, it contains Apache SPARK Delta Lake many built in extras.
As expected, performance and SQL dialect between Hadoop and Delta Lake are different since they are different databases.
You can install Delta Lake in Apache Spark so you compare Hadoop vs Delta Lake