Data Engineer Interview Questions

Data Engineer Interview Questions

I Data engineer sono professionisti informatici richiesti pressoché in tutti i settori. Si occupano di monitorare i trend dei dati per pianificare le azioni più adeguate che un'azienda deve intraprendere. Uno degli aspetti più critici del lavoro di un Data engineer è l'elaborazione dei dati grezzi e la loro trasformazione in dati utilizzabili per creare pipeline e sistemi di dati.

Domande tipiche dei colloqui per Data engineer e come rispondere

Question 1

Domanda 1: Puoi descrivere in dettaglio il tuo livello di competenza nell'ambito dei linguaggi di programmazione?

How to answer
Come rispondere: Prima del colloquio, ripassa il tuo CV e/o portfolio e stila un elenco dei programmi che conosci meglio. Se scopri di non avere una buona conoscenza del programma usato in prevalenza nell'azienda, descrivi te stesso come una persona intraprendente e altamente motivata, che si impegnerà senza sosta per imparare a usare il programma.
Question 2

Domanda 2: Spiega a parole tue che cos’è il data engineering.

How to answer
Come rispondere: Analizza il tuo ruolo in relazione all'azienda e ad altri ruoli quali il data scientist, così da definire in modo chiaro il tuo contributo al sistema aziendale nel suo complesso. Spiega la differenza tra il ruolo di un ingegnere che lavora ai database e quello di un ingegnere che si occupa di pipeline.
Question 3

Domanda 3: Puoi descrivere un'esperienza lavorativa con Apache Hadoop e in ambienti di gestione dei dati nel cloud?

How to answer
Come rispondere: Per prepararti a questa domanda, fai le dovute ricerche sul software utilizzato dall'azienda, sui prodotti cloud per i dati e sull'uso di Apache Hadoop. I Data engineer devono avere un'ottima padronanza dei linguaggi di programmazione e dei sistemi di gestione dei dati utilizzati nel settore, quali Apache Hadoop.

20,133 data engineer interview questions shared by candidates

want you to write me a simple spell checking engine. The query language is a very simple regular expression-like language, with one special character: . (the dot character), which means EXACTLY ONE character (it can be any character). So, for example, 'c.t' would match 'cat' as the dot matches any character. There may be any number of dot characters in the query (or none). Your spell checker will have to be optimized for speed, so you will have to write it in the required way. There would be a one-time setUp() function that does any pre-processing you require, and then there will be an isMatch() function that should run as fast as possible, utilizing that pre-processing. There are some examples below, feel free to ask for clarification. Word List: [cat, bat, rat, drat, dart, drab] Queries: cat -> true c.t -> true .at -> true ..t -> true d..t -> true dr.. -> true ... -> true .... -> true ..... -> false h.t -> false c. -> false */ // write a function // Struct setup(List<String> list_of_words) // Do whatever processing you want here // with reasonable efficiency. // Return whatever data structures you want. // This function will only run once // write a function // bool isMatch(Struct struct, String query) // Returns whether the query is a match in the // dictionary (True/False) // Should be optimized for speed
avatar

Data Engineer

Interviewed at Meta

3.6
May 22, 2020

want you to write me a simple spell checking engine. The query language is a very simple regular expression-like language, with one special character: . (the dot character), which means EXACTLY ONE character (it can be any character). So, for example, 'c.t' would match 'cat' as the dot matches any character. There may be any number of dot characters in the query (or none). Your spell checker will have to be optimized for speed, so you will have to write it in the required way. There would be a one-time setUp() function that does any pre-processing you require, and then there will be an isMatch() function that should run as fast as possible, utilizing that pre-processing. There are some examples below, feel free to ask for clarification. Word List: [cat, bat, rat, drat, dart, drab] Queries: cat -> true c.t -> true .at -> true ..t -> true d..t -> true dr.. -> true ... -> true .... -> true ..... -> false h.t -> false c. -> false */ // write a function // Struct setup(List<String> list_of_words) // Do whatever processing you want here // with reasonable efficiency. // Return whatever data structures you want. // This function will only run once // write a function // bool isMatch(Struct struct, String query) // Returns whether the query is a match in the // dictionary (True/False) // Should be optimized for speed

""" # Question 2: # Fill in the blanks # # Given an array containing None values fill in the None values # with most recent non None value in the array # # For example: # - input array: [1,None,2,3,None,None,5,None] # # - output array: [1,1,2,3,3,3,5,5] #
avatar

Data Engineer

Interviewed at Meta

3.6
Jun 8, 2020

""" # Question 2: # Fill in the blanks # # Given an array containing None values fill in the None values # with most recent non None value in the array # # For example: # - input array: [1,None,2,3,None,None,5,None] # # - output array: [1,1,2,3,3,3,5,5] #

Python questions: 1. Replace None value with previous value present in a list. 2. Given a ´dictionary, print the key for nth highest value present in the dict. If there are more than 1 record present for nth highest value then sort the key and print the first one. 3. Given two sentences, you have to print the words those are not present in either of the sentences.(If one word is present twice in 1st sentence but not present in 2nd sentence then you have to print that word too) 4. Forgot another question you have to pass all the cases, specially on edge cases. SQL questions: 1. Mostly % calculation, also refer the questions available here in Glassdoor. Next will be my Onsite Interview of 3.30hr. If anyone can help me on that or else I will update later on that interview as well.
avatar

Data Engineer

Interviewed at Meta

3.6
Mar 9, 2021

Python questions: 1. Replace None value with previous value present in a list. 2. Given a ´dictionary, print the key for nth highest value present in the dict. If there are more than 1 record present for nth highest value then sort the key and print the first one. 3. Given two sentences, you have to print the words those are not present in either of the sentences.(If one word is present twice in 1st sentence but not present in 2nd sentence then you have to print that word too) 4. Forgot another question you have to pass all the cases, specially on edge cases. SQL questions: 1. Mostly % calculation, also refer the questions available here in Glassdoor. Next will be my Onsite Interview of 3.30hr. If anyone can help me on that or else I will update later on that interview as well.

You have a 2-D array of friends like [[A,B],[A,C],[B,D],[B,C],[R,M], [S],[P], [A]] Write a function that creates a dictionary of how many friends each person has. People can have 0 to many friends. However, there won't be repeat relationships like [A,B] and [B,A] and neither will there be more than 2 people in a relationship
avatar

Data Engineer

Interviewed at Meta

3.6
Oct 31, 2018

You have a 2-D array of friends like [[A,B],[A,C],[B,D],[B,C],[R,M], [S],[P], [A]] Write a function that creates a dictionary of how many friends each person has. People can have 0 to many friends. However, there won't be repeat relationships like [A,B] and [B,A] and neither will there be more than 2 people in a relationship

SQL questions - A table schema with tables like employee, department, employee_to_projects, projects 1) Select employee from departments where max salary of the department is 40k 2) Select employee assigned to projects 3) Select employee which have the max salary in a given department 4) Select employee with second highest salary 5) Table has two data entries every day for # of apples and oranges sold. write a query to get the difference between the apples and oranges sold on a given day
avatar

Data Engineer

Interviewed at Meta

3.6
May 24, 2016

SQL questions - A table schema with tables like employee, department, employee_to_projects, projects 1) Select employee from departments where max salary of the department is 40k 2) Select employee assigned to projects 3) Select employee which have the max salary in a given department 4) Select employee with second highest salary 5) Table has two data entries every day for # of apples and oranges sold. write a query to get the difference between the apples and oranges sold on a given day

Viewing 1 - 10 interview questions

Glassdoor has 20,133 interview questions and reports from Data engineer interviews. Prepare for your interview. Get hired. Love your job.