We are looking for an experienced Data Engineer to join our Analytics & Visualization team. We are interested in engineers with strong knowledge and experience in building Data Intensive Applications (DIA’s) in an industrial/enterprise environment. You are expected to handle data from various data sources and determine how best to structure the data in order to provide data in a ready-to-use form to data analysts that are looking to run queries and algorithms against the information for predictive & prescriptive analytics through machine learning. You will design and implement robust scalable data pipelines on the ecosystems. You are expected to write high quality, maintainable, and robust code, and provide solutions for ensuring long-term quality and integrity of the data.
The vacancy aims at supporting the development of novel data analytics, visualizations and control solutions by cutting edge techniques from Machine Learning and Data Mining. Successful methodologies aim to be included in customer applications.
Will you be our next talented data engineer to join the team responsible for creating applications that aids our customers to keep the machines optimally running during production. This could be your next ultimate job, so if you would like to join us - please apply!
Our multinational teams are composed of five to seven developers, a Scrum Master and a Product Owner. We are committed to follow the Agile way of working, with sprints and demos every two weeks, aiming for frequent releases of working software. In all teams we cooperate with internal and external experts from different knowledge domains to discover and build the best solutions possible. We use Continuous Integration with GIT, Jira and Bamboo. We move fast to help our customers reach their goals, and we strive to create reliable and well-tested DIA’s, as a failure in our software stack can severely impact customers' operations.
-Design and implement Data Intensive Application’s, realizing the product backlog defined by the Product Owner
-Ensure quality of own deliverables, this includes designing and implementing automated tests (a.o. on unit- and integration levels)
-Cooperate with other teams to ensure consistent implementation of the architecture, agree on interfaces and timing of cross-team deliveries
-Troubleshoot, analyze and solve integration issues, both from internal alpha and beta tests, as well as reported by our customers
-Write or update product documentation in accordance with company processes
-Suggest improvements to our technical solutions and way of working, and implement them in alignment with your team and their stakeholders
-M.Sc. or Ph.D. in Data Science, Computer Science, Electrical Engineering, Mathematics or Physics
-Experience in having build DIA’s in an industrial/enterprise environment
-Experienced in software development (strong coding and testing skills)
-IT related knowledge is considered a pre
-Familiar languages include Python, Java, etc.
-Familiarity with relational and non-relational (documen, columnar, graph) database architectures Experience with frameworks from the big data ecosystem’s (Spark, Kafka, HBase, etc)
-Experience with orchestration & containerization is a plus (Kubernetes, Docker, Mesos DC/OS)
-Strong ability to communicate and negotiate designs of data pipelines with Data Architects and Data Scientists
Highly valued qualifications & experiences
-Experience working on practical applications using real-world datasets
-Hands-on with different data formats (CSV, XML, ARVO, TXT, JSON, etc.)
-Familiarity with statistical languages like R and/or Matlab
-Handle, analyze and visualize complex, high-volume, high-dimensional data from varying sources
-Enthusiastic and intrinsically motivated, creative thinker
-Good & proactive communication in an international and multidisciplinary environment
-Taking responsibility, self-propelling
-Goal-oriented and flexible mindset, willing to acquire lithography and other semiconductor manufacturing knowledge