B
BSX
BackendSenior
Senior Java Engineer (Data Pipelines, PySpark)
JavaPythonPysparkApache SparkDatabricksAws EmrHadoopKafkaMicroservices
About the Position
We are looking for a Senior Java Engineer with strong experience in building data pipelines and distributed data processing systems.
Responsibilities
- Develop and support Java backend services and data pipelines
- Build and optimize PySpark data processing jobs
- Collaborate with ML engineers on data preparation and processing
- Work with distributed systems and real-time services
- Improve scalability and performance of data infrastructure
Requirements
- 5+ years of commercial experience with Java
- Strong backend development experience (microservices, APIs, distributed systems)
- Commercial experience with Python and PySpark (2+ years preferred)
- Experience building and maintaining data pipelines
- Hands-on experience with Spark ecosystem (Databricks / EMR / Hadoop / Apache Spark)
- Experience with real-time data processing systems
- Good understanding of data transformation and ETL processes
- Experience working with cloud environments (AWS is a plus)
- English — UPPER Intermediate+ or higher
Benefits
- Long-term project with competitive compensation.
- Opportunities for team collaboration and growth.
- Participation in both new software development and existing projects (no old code).
Prepare for this role on ITLead
Take the matching roadmap and practice real interview questions asked for this level.
Senior Java Engineer (Data Pipelines, PySpark)