Christopher is a Data Engineer with 8 years of experience with Cloud Computing. He has strong skills with Python, Airflow, and Google Cloud Platform with diverse experience in the Healthcare, Financial and Retail industries.
Hire ChristopherCreated an automated DAG creator to efficiently onboard new healthcare partners and create new data pipelines without having to rely on boilerplate code for all DAGs. This decreased the complexity of creating data pipelines and allowed newer hires to work on data pipelines.
Created robust data pipelines using PySpark to clean, aggregate, and enrich data. Worked with epidemiologists to enhance analysis and convert legacy SAS code into PySpark jobs. Automated a previously manual process by setting up a data connection that calls CSV data from an API, parses/processes the data, and then transforms it into a PySpark data frame.
Evaluated Google Cloud Composer as a potential tool to replace Talend as the main data pipeline tool for the company. Created a QA data pipeline that increased reliability and ease of use over existing Talend jobs. Received approval from engineering management to rewrite 100+ Talend jobs into Composer data pipelines.
Created analytics pipeline that also incorporated 3rd party sales data to produce a Tableau report that broke down sales by the mentioned categories. Categorized 85% of the available data using this method, which led to an uptick of 3% in sales for the specific product.
Led an India-based engineering team as a project manager for a Fortune 500 client. Gathered requirements from multiple stakeholders and coordinated the team to deliver multiple analytics projects.