1st Round Codility Exam, Simple java DS/Algo base question 2nd Round F2F (Virtual) Technical Round 3rd Managerial/Technical
Sr Data Engineer Interview Questions
2,563 sr data engineer interview questions shared by candidates
Kafka out of sync replicas
1)Storm and kafka 2)Spark streaming and structured streaming 3)Spark rdd dataframe and dataset 4)Partitioning and bucketing scenarios 5)Sql functions practice 6)Test cases 7)Spark hive context and sql context and spark session difference 8)Data modeling for bigdata 9)Dimension table and fact tables 10)Data crunching 11)Webservices and microservices applications 12)1000 files with particular set of id store id columns and final outcome should be like 13)count of stores for repeating id 14)So id and stores are repeating g 15)How to deal with 1000 files 16)How sql query steps works 17)Algorithm with spark program 18)How will it check job dependency first completes then only completes second one 19)Oozie scheduler 20)Lookinto titan glm code and spark functions
What are Spark optimisation technique’s ?
Need to be very good at fundamentals. DWH, RDBMS, Distributed processing
SQL, Python, AWS, Database, Performance Tuning
Covered mostly in GCP Bigquery, Spark and Big Data
About past experience and tech stack
Relevant projects and best practices
What specific tools or libraries you use to implement data quality standards?
Viewing 2101 - 2110 interview questions