PriceSenz
Irving, Texas
10/01/2020
Full time
Emerging Technologies Data Engineer Description: Work with an innovative team of data scientists, data engineers, product managers, and technical leaders to better understand Verizons customers. In this role youll be interfacing directly with the data to build reproducible pipelines to power advanced machine learning models for both training and production use cases. Must Have Skills: 2+ years of coding experience, Python or Java preferred 4+ years of experience manipulating large data sets using SQL, Spark, or other related technologies 2+ years working with cloud scale data movement technologies (Nifi, Spark, etc) Experience with data pipeline and workflow management tools: Luigi, Airflow, Argo, etc AWS or other scalable cloud Linux system administration Docker Strong analytical skills related to pulling data multiple production data stores Ability to communicate complex ideas in data science to relevant stakeholders Desired Skills: Production experience building, deploying, and supporting machine learning models Understanding of common machine learning algorithms (regression, xgboost, neural networks, etc) Experience with streaming data movement tools (Kafka, Pulsar, etc) Lean Startup or Other Similar Experience JOB DUTIES: Work with Verizon systems to source data suitable for building a time series model of customer behaviors. Collaborate with data scientists to gain an understanding of the data and build reproducible pipelines from multiple source data sets to feed a real-time model. EDUCATION/CERTIFICATIONS: BS in computer science or related field or 4 years of experience - provided by Dice