Role: Principal Architect, Databricks
Work Location: Houston, TX (4 days in office with 1 hybrid day)
Number of Positions: 1
Position Type: Fulltime
Company Description: Automotive
US CITIZENS AND GREEN CARD HOLDERS ARE ENCOURAGED TO APPLY. WE ARE UNABLE TO PROVIDE SPONSORSHIP AT THIS TIME.
JOB SUMMARY
The Principal Architect, Databricks will act as our top technical expert and lead practitioner for the Databricks Platform. This role combines architectural leadership with practical implementation. You will be responsible not only for defining the architectural vision but also for developing core, reusable patterns and reference architectures that will accelerate our teams. The ideal candidate is a master of the Databricks ecosystem who leads by example, demonstrates what's possible through hands-on development, and empowers teams to build robust, scalable data solutions.
QUALIFICATIONS
· Bachelor's Degree Computer Science or related field. Req
· 10+ years Experience in software engineering, including significant experience in architecting systems. Required
· Proven experience acting as a hands-on technical architect and advisor on large-scale data projects. Required
· Databricks Mastery: Deep, expert-level knowledge of the Databricks Platform, including: Unity Catalog: Designing and implementing data governance and security. Delta Lake & Delta Live Tables: Architecting and building reliable, scalable data pipelines. Performance & Cost Optimization: Expertise in tuning Spark jobs, optimizing cluster usage, and managing platform costs. MLOps: Strong, practical understanding of the machine learning lifecycle on Databricks using tools like MLflow. Databricks SQL: Knowledge of designing and optimizing analytical workloads. Mosaic AI: Knowledge of designing and optimizing AI Agents.
· Cloud & Infrastructure: Deep knowledge of cloud architecture and services on AWS. Strong command of Infrastructure as Code (Terraform, YAML).
· Software Engineering & Programming: Strong background in software engineering and building large fault-tolerant systems.
· CI/CD & Automation: Experience with designing and implementing CI/CD pipelines (preferably with GitHub Actions) for data and ML workloads.
· Observability: Familiarity with implementing monitoring, logging, and alerting for data platforms.
· Automation: The platform is ephemeral, and all changes are implemented using Terraform and Python. Expertise in Terraform and Python is a must.
· Excellent communication and interpersonal skills, with the ability to influence and guide technical teams and stakeholders effectively.
· A strategic mindset with a passion for solving complex data challenges and driving business outcomes through technology.
· The ability to think critically, challenge assumptions, and make clear, well-reasoned architectural decisions.
TRAVEL REQUIRED
Minimal travel is required for this position (up to 10% of the time and on a domestic basis).
RESPONSIBILITIES
STANDARD BENEFITS
· Medical, Dental & Vision- eligible after 30 days of employment
· 401K company match is 4% 1:1 - starts day one and you vest after 2 years.
· 27 days of PTO in a full year. 10 paid holidays.
· Eligible to participate in vehicle program and performance bonuses