i) Spark architecture ii) Difference between RDDs, DataFrames and Datasets iii) What is fault tolerance and how does spark handle it iv) Memory management and garbage collection in Spark v) One SparkSQL based question vi) One PySpark based question vii) ETL implementation in AWS Glue
Big Data Consultant Interview Questions
1,784 big data consultant interview questions shared by candidates
kafka, zookeper, spark, kerberos, CDP, hive
what is RDBM? what is query ?
More on java spark and hive if you able to make the concept clear to them there is a chance and the two person took my interview both with good knowledge of same. Overall simple process and they are very co-operate.
how much CTC you are currently having how much percent of hike you are getting
Tell me about yourself? What you want to do in future?
some SQL problem that examined knowledge about the JOIN function
what is implicits? what is accumulator? what is broadcast variable? partition and bucketing colease and redistribution. rdd and dataframe what is mapping what is val and var
When you will not use docker
Difference between Structural databases and unstructured databases.
Viewing 1151 - 1160 interview questions