As a data science engineer, you will work together with other team members to build a dashboard for reliability data analysis, which is critical to provide feedback from past field behaviors in our installed base for future improvements. You will start from understanding the data structure for reliability calculation, and then work closely with data inputs engineer and reliability engineer to collect data inputs. Then you will write python scripts to do data analysis and share data insights, and publish the information in dashboard. After that, you will need to update the scripts by adding new features based on requests from internal customers. With more data collection, you will use machine learning models (such as decision tree, XGBoost, Neural network) to make prediction.
Responsibilities
Provide general guidance of data platform selection, database setup (including data input format and data output format), and data sharing
Write python scripts to get raw data or join data from different sources together
Write python scripts to do data processing and data analysis
Build dashboard to share data analysis results
Perform data mining, data benchmarking from actual performance data, machine data, field KPIs, dashboard, SOs, and other sources.
Provide insights from data mining analysis to higher management and other team members
Education and experience
• Bachelor or master's degree in math, statistics, data science, computer science, or another related field • 1-3 years work experiences in data analyst or data scientist or other related roles
Required skills
Python coding, proficient in Pandas and making interactive plots using Jupyter Notebook
Be able to make dashboard from raw data inputs.
Fast learner, clear logic, can execute tasks with new software/applications/python libraries.
Delivers fact-based and well-structured messages in a range of different formats
Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amount of information with attention to detail and accuracy
Preferred skills
Reliability knowledge, such as Weibull distribution, Crow-AMSAA reliability growth model.
Good presentation skill to management and also cross functional teams
Has strong ability to use a range of different IT techniques (Python, VBA, SQL, PowerBI, Spotfire, SAP)
Able to deploy machine learning models for data insights
Task orientation, be digital problem solver for the organization