Analytics, Lead Data Engineer/Solution Architect | RI - Woonsocket
Our Data Engineering team is helping lead the transformation of our program into a world-class personalization and loyalty program. With over 10,000 Retail locations and +70M active members this is a top initiative within the company and we have a team dedicated to recruiting the best talent in the world to help propel us to this goal. The company has invested in state-of-the-art cloud infrastructure and scaling of our loyalty program, now we are focused on optimizing how we communicate with our customers. We are looking for the best and brightest to add to our existing Analytics team and help deliver on this initiative.
Specifically, you will collaborate with the business and lead a team of advanced Data Engineers to design, build, test, productionalize and support the Front Store Personalization Engine. As CVS continues on its journey to deliver our customers’ 1:1 personalized experiences through multiple channels you will support this effort, by working across multiple teams to rapidly build, test, and scale high-priority use cases that drive increased reach, relevance and rewards for our customers.
Making analytics faster, more insightful, and more efficient by building, architecting and maintaining a next-generation Big Data Machine Learning framework. Rapidly develop prototypes and proof of concepts for the selected solutions, and implementing complex big data projectsBringing a DevOps mindset to enable big data and batch/real-time analytical solutions that leverage emerging technologies and best practices in release managementInterpret methodology to design and develop ML pipeline application architecture incorporating modeling inputs, business logic/rules and constraints to deliver personalized recommendationsManage production processes responsible for delivering weekly, daily and near real-time campaigns with accountability for production supportPartners with IT to maintain development and production environments and platform/tool enablement to meet solution requirements, including cluster and cloud cost managementRuns agile scrum sessions defining engineering requirements, development and release management through partnership and coordination across business, IT, digital product, and data-science modeling teamsBrings strong project, process, stakeholder management and communication skills operating in a highly matrix cross functional organizationalAlthough the desired candidate will have leadership experience in an organization, the expectation in this role is that you will combine this ability to lead with the ability to roll up your sleeves and getting into the weeds when necessary
Required Qualifications3+ years Management experience leading Data Engineers and/or analytics-focused teams to deliver complex analytics projects on aggressive timelines7+ years of professional IT or Business Analytics experience including the following:Hands-on experience with “big data” platforms including Hadoop (preferably Azure or AWS) and Spark as well as experience with traditional RDBMS (eg, Teradata, Oracle).Proficiency in “big data” technologies including Spark, Airflow, Kafka, Hbase, Pig, NoSQL databases, etc.Proficiency in the following programming languages: PySpark, Python, shell scripting, SQL (preferably Teradata and PL/SQL syntax) and Hive, Pig, C++ Java, or ScalaAbility to design and build a framework to orchestrate data pipelines and Machine Learning modelsProficiency with tools to automate CI/CD pipelines (eg, Jenkins, GIT, Control-M)Design and implement end-to-end solutions using Machine Learning, Optimization, and other advanced technologies, and own live deployments.Experience with frameworks for either Machine Learning or NLP (Scikit-Learn, SpaCy, Pytorch, Spark NLP)Strong coursework/experience in programming and working with complex datasets is required
Preferred QualificationsExperience leading a team and coaching others to improve the overall team’s skill setsExperience working via an agile, sprint-based working styleExperience working side-by-side with business owners, and translating business needs into analytics solutionsProven ability to successfully balance near-term results (e.g., ability to design and execute on a ‘MVP’ model), with long-term goalsComfortable balancing quality of output with short timelines required to enable downstream functionsRetail / Healthcare data and domain knowledgeExperience with cloud computing environment (ideally Microsoft Azure)Experience working on a large scale Spark implementation