Data Engineer (Los Angeles, CA)

Share this job

Data Engineer

Los Angeles, CA

About the Role

This role focuses on designing, building, and maintaining scalable Azure Databricks-based data pipelines and architectures that enable analytics, AI/ML, and reporting across commercial functions. You'll lead data engineering initiatives, ensure governance and compliance, optimize performance and costs, and collaborate across teams to advance the company's data and AI strategy.

Responsibilities

Design, build, and maintain scalable, reliable, and cost-efficient data pipelines using Azure Databricks in support of analytics, machine learning, data science, and operational use cases
Lead data engineering initiatives that enable AI/ML model development, LLM integrations, and AI-driven applications
Architect and implement data integration frameworks across Adtech and Martech ecosystems, incorporating Google Analytics, media campaign data, and third-party marketing APIs like Salesforce
Manage and optimize data ingestion, transformation, and storage processes using SQL, Python, and PySpark
Design and maintain API integrations with internal and external systems, including AI/LLM-powered APIs for advanced analytics and automation
Administer and maintain Azure-based data tools and platforms (Databricks, ADF)
Ensure data quality, integrity, and consistency through robust validation, monitoring, and alerting within Azure Databricks
Implement and enforce data governance, security, and compliance standards in collaboration with IT and InfoSec
Partner with analytics, marketing, commercial operations, and technology teams to deliver commercial data products
Monitor and optimize data infrastructure costs, performance, and scalability across Azure cloud environments
Participate in architecture discussions, design reviews, and CI/CD workflows as part of an Agile engineering team

Required Qualifications

Bachelor's degree required; 5+ years in Data Engineering, Data Architecture, or Cloud Platforms
Pharma or biotech industry experience strongly preferred, specifically within commercial data (sales, market access/payer, marketing)
Hands-on experience with Azure Cloud (Databricks, DevOps, DataFactory)
Strong Python, PySpark, and SQL skills
Experience building and leveraging API integrations in ETL pipeline development
Experience integrating AI models and APIs (OpenAI API, Databricks AI models, etc.)
Self-driven; able to independently design end-to-end data pipelines while ensuring architectural best practices

Preferred Qualifications

Databricks Unity Catalog or Databricks One experience
Familiarity with Martech and Adtech tools and practices
Knowledge of Consumer, Patient, or HCP data ecosystems
Experience with identity providers such as LiveRamp, Acxiom, or Experian
Experience with Customer Data Platforms (CDP)
Experience with marketing automation tools

Working Conditions

General office environment. Travel up to 20%.

Apply for this job