Data Engineering Questions – 1

if your #dataengineering experience grows more than 5 years you expect these questions in your interviews…..

1. Explain me the architecture of spark?
2. How does internals job execution happens?
3. what will happen when you fire the Spark Job?
4. How did you tune your jobs?
5. Explain optimizations you have used in your project?
6. How did you connected with external sources? explain me the process?
7. Where did see the logs if job fails?
8. Explain the actions you have taken when your job fails?
9. Explain the project you have handled End to End alone?
10. Explain the pipeline building process you have followed?
11. I have 1 TB of data with me, i want to move to cloud, explain me end to end pipeline for this? explain the sources you use and why?
12. have you worked on testing and maintenance of your code?
13. Tell me the CI/CD process you followed in your project?
14. have you worked on any unix environment?
15. how did you mentor junior when they need help in building project?

Leave a comment

Create a website or blog at WordPress.com

Up ↑

Design a site like this with WordPress.com
Get started