Senior Data Engineer Role – Application Form
Thank you for considering us in your career journey. Before we proceed with evaluating your fit for the role, we kindly ask you to complete our self-assessment form. This will give us a clearer picture of your skills, experience, and professional goals, helping us align you with the best opportunities. By doing so, we can use the initial call to focus on your questions rather than HR screening or managerial queries.Rest assured, your information will be kept confidential and used solely to assess your candidacy. We appreciate your time and look forward to reviewing your application. Thank you!
Take a look at the full job details here! 🚀
If you're looking for an awesome new challenge and a place to grow, don’t miss out - check out the role and see if it's a great fit for you: http://itsavvystaffing.tilda.ws/
Name
*
First Name
Last Name
Email
*
example@example.com
Your current location (Country, City)
*
Please indicate your legal status in the current location (Citizen, Permanent Resident, Work Visa Holder, Student Visa Holder, Tourist Visa Holder, Other (Please specify) )
Do you currently own a legal entity (such as a business or company) in your location that can enter into a B2B contract? If so, please specify the type of entity (e.g. LLC, Sole Proprietorship, etc.).
*
Please upload your resume
Browse Files
Drag and drop files here
Choose a file
Cancel
of
If you don’t have a CV or would prefer to share your LinkedIn profile instead, please use this folder to provide the link.
I hereby consent to the processing of my personal data, as provided in this document, for the purpose of recruitment process in accordance with the Personal Data Protection Act of 10 May 2018 (Journal of Laws 2018, item 1000) and Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, repealing Directive 95/46/EC (General Data Protection Regulation).
*
Yes
No
Hard skills - Assess Your Hard Skills and Experience
Overall Experience
0 - No Experience
1 - Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for many years (5y+)
4 - I am an expert (10y+), speaker at conferences, etc
Tech Lead experience
Team Lead experience
Architecture design
Task estimation experience
Refactoring legacy code experience
Code review experience
Technical docs creation
PySpark Expertise
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Proficiency in PySpark for data processing, optimization, and scaling.
Ability to write efficient PySpark code for large datasets and ensure best practices for performance (e.g., partitioning, memory tuning).
Big Data Technologies
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Deep understanding of distributed computing frameworks (e.g., Apache Spark, Apache Hadoop).
Experience with cloud-based big data tools like AWS EMR, Databricks.
ETL/ELT Pipelines
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Ability to design, build, and optimize ETL/ELT pipelines that handle real-time and batch data processing.
Knowledge of data integration, transformation, and loading techniques using Python, SQL, or other relevant tools.
Cloud Infrastructure (AWS)
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Hands-on experience with AWS services like EMR, S3, Lambda, Athena, and Glue
Ability to leverage cloud architecture for data storage, compute, and analytics
SQL & Data Modeling
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Advanced SQL skills for querying large datasets, optimizing queries, and data transformations.
Knowledge of data modeling techniques and relational/non-relational database design
Data Lake & Data Warehouse Design
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Understanding of data lake architecture for scalable storage and analytics.
Experience in setting up and managing data warehouses (e.g., Redshift, Snowflake)
Real-Time Data Processing
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Familiarity with tools like Kafka or Spark Streaming for real-time data pipelines.
Ability to build and maintain event-driven architectures for real-time analytics.
Performance Tuning & Cost Optimization
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Experience in optimizing Spark jobs for scalability and efficiency.
Security & Compliance
0 - No experience
1- Little experience, use it from time to time
2 - Work with it every day (up to 5y)
3 - Have been working with it for 5y+
4 - I am an expert (10y+), speaker at conferences, etc
Knowledge of data governance, security best practices, and compliance standards.
Familiarity with data encryption, IAM roles, and data access control in a cloud environment.
Other valuable skills and comments
Certification (list of valid and expired)
Language: English - speaking
*
Zero
Elementary
Pre- intermediate
Intermediate
Upper-intermediate
Advanced / Fluent as native
Language: English - writing
*
Zero
Elementary
Pre- intermediate
Intermediate
Upper-intermediate
Advanced / Fluent as native
Career Details
This section is important to help us understand your expectations and availability for the role you are applying for, and to ensure that we can offer you a competitive compensation package and align on a mutually suitable start date and other details.
How soon would you be available to start working if you are offered a full-time position with a weekly commitment of 40 hours?
*
Notice period (in weeks)
*
What are your compensation expectations for your next role? You’re welcome to share a specific number or a range in USD, either per hour or per month (before taxes)
Work experience
Please provide a brief summary of your overall professional experience
Could you walk us through your current role and past job experience, focusing on the technologies you've worked with? We’d love to hear about a project you are particularly proud of or found most challenging (what made it complex, your specific contributions, and the impact it had on the business or team)
*
PySpark Optimization: Describe a scenario where you had to optimize a PySpark job for performance and scalability. What specific techniques or strategies did you use, and how did it impact the overall system performance?
*
AWS Experience: Can you walk us through a project where you utilized AWS services, specifically AWS EMR, to process large datasets? What challenges did you encounter, and how did you resolve them?
*
ETL/ELT Processes: How do you approach designing and building high-performance ETL/ELT pipelines? What tools and techniques do you typically use to ensure data is processed efficiently in real-time or batch mode?
Data Lake Architecture: Explain your experience working with a data lake architecture. How do you ensure data is effectively stored and optimized for large-scale analytics?
Real-Time Data Processing: What experience do you have in implementing real-time data processing pipelines? Can you provide an example where you successfully supported real-time analytics in a distributed computing environment?
Handling Big Data: How do you approach ingesting, transforming, and loading massive datasets from various sources in a distributed environment?
Performance Optimization: Can you share a specific example where you significantly improved the performance of a distributed data processing job?
Please let me know how you’d like to proceed. I’d be happy to arrange an initial call (ETA: up to 1h) to discuss the position, the company, and answer any questions you may have. Alternatively, if you prefer to receive this information in writing, I can provide that and move forward with setting up the next round of interviews.
*
I would appreciate the opportunity to discuss the position and company details over a call. Could we schedule a time that works for both of us?
I prefer to receive the information in writing. Once I have that, I’d be happy to move forward with the next round of interviews.
If you have any questions or run into any blockers during the next steps, please let me know. This way, I can either prepare better for our call (if necessary) or send a detailed message to clear up any doubts before we move forward.
Thank you for being awesome and submitting the form!
Our team has received your information and we will carefully analyze it. You can expect to hear back from us within 2 business days. If you have any questions or concerns in the meantime, please don't hesitate to reach out.
Submit
Should be Empty: