What Is Spark SQL?
The Spark SQL is used for real-time, in-memory and parallelized SQL-on-Hadoop engine.
The Spark SQL is not a general purpose SQL layer and it’s used to allow us to do several advanced analytics with data.
The Spark SQL supports only a subset of SQL functionality and users have to write code in Java, Python and so on to execute a query.
Great Features of Spark SQL -
ü Spark SQL provides security through encryption using SSL for HTTP protocols.
ü The Spark SQL supports lots of features to analysis the large scale of data.
ü The Spark SQL supports lots of data types for machine learning.
ü In the Spark SQL, you can easily to write data pipelines.
ü In the Spark SQL, easy to add optimization rules, data types and data source by using the Scala programming language
When To Use Spark SQL?
Spark SQL is the best SQL-on-Hadoop tool and best used of Spark SQL is fetch data for diverse machine learning tasks.
Disadvantage of Spark SQL -
The Spark SQL is lacks advanced security features.
I hope you are enjoying with this post! Please share with you friends!! Thank you!!!