Case study - statistical analysis, and build a classifier for predicting high cost patients. This is timed to be performed in 3 hours. Totally ridiculous dataset - you'll get 18 files to analyze with each file containing from 120 - 20k rows, and each file has about 10-35+ columns. And you won't get any data schema, so you'll have no idea on the PK, FK. And PK are different for each file. You'll run out of memory, so analyze each file separately. And also look into the from_date from claims transactions file to include only latest transactions, as some of it is from 1929.
Sr Data Scientist Interview Questions
3,387 sr data scientist interview questions shared by candidates
Leetcode medium and some ML depth related questions.
How would you join and summarize these tables? If you had an app that recommends restaurants based on customer reviews (think yelp), how would your app make recommendations for new restaurants (for which there would be no reviews)?
1. Questions about RAG, chunking techniques, Retrieval Optimization, Prompting techniques 2. BERT and embeddings 3. Cloud Managed AI questions
896. Monotonic Array An array is monotonic if it is either monotone increasing or monotone decreasing. An array nums is monotone increasing if for all i <= j, nums[i] <= nums[j]. An array nums is monotone decreasing if for all i <= j, nums[i] >= nums[j]. Given an integer array nums, return true if the given array is monotonic, or false otherwise.
Create a ML model to detect fraud in transactional data.
Almost no Data Science questions.
NLP concepts. fundamental data science questions .
Viewing 671 - 680 interview questions