Describe a project that you have done. Discuss the overall flow of how it works. Problem: Given a time series data which is a clickstream of user activity is stored in hive, ask is to enrich the data with session id using spark. Session Definition: Session expires after inactivity of 1 hour Data: click_time,user_id 2018-01-01 11:00:00,u1, u1s1 2018-01-01 12:10:00,u1, u1s2 2018-01-01 13:00:00,u1, u1s2 2018-01-01 13:50:00,u1 2018-01-01 14:40:00,u1 2018-01-01 15:30:00,u1 2018-01-01 16:20:00,u1 2018-01-01 16:50:00,u1 2018-01-01 11:00:00,u2 2018-01-02 11:05:00,u2
Sr Data Engineer Interview Questions
2,565 sr data engineer interview questions shared by candidates
Search words in a huge file of characters.
Types of shuffling in Apache Spark
Tell me about a time when most of the other team members disagreed with you and you manged to convince them to your opinion.
Primarily SQL questions around basic joins, aggregates, filtration. Subsequent questions involved computing a median function from scratch which is non-trivial, and another involved using the PIVOT function in SQL.
Current Project Experience
Describe fields in the dimension schema that you used in data warehouse.
What do you know about the company?
Basics of problem solving skills and coding skills, Scenario based questions
Why do you want to work at Imperfect Foods?
Viewing 1671 - 1680 interview questions
See Interview Questions for Similar Jobs
Data EngineeringSenior Data EngineerBig Data EngineerSenior Big Data EngineerData EngineerSenior Data ConsultantLead Data EngineerData ScientistsData Scientist IiiPrincipal Data EngineerData Mining EngineerBigdata EngineerSr Data AnalystEtl Data Stage DeveloperSr Database EngineerDatabase Administrator IiSenior Data Stage DeveloperEtl Informatica Developer