Share this job
Applied AI / LLM Engineer – Production Systems
New York City, NY
Apply for this job

Position: Applied AI / LLM Engineer – Production Systems

Location: New York City, NY (Hybrid – 3 days onsite, Tue–Thu)

Travel: None

Type: Full-time, Permanent

Salary: Approx. $250,000 – $800,000 base + bonus (role & level dependent)


Overview:


We’re hiring exceptional Applied AI / LLM Engineers to join elite research and engineering teams building production-grade AI systems at the intersection of machine learning, large language models, and real-world scientific and enterprise applications.


These roles focus on moving far beyond experimentation — designing, training, scaling, and deploying high-impact ML and LLM systems used daily by researchers, engineers, and operational teams. This includes foundational model development, large-scale training, and applied GenAI systems integrated directly into core workflows.


You’ll work in highly collaborative, technically rigorous environments where AI is treated as core infrastructure, not a side project.


𝗪𝗵𝗮𝘁 𝗬𝗼𝘂’𝗹𝗹 𝗗𝗼:

▪️ Design, train, and deploy ML and LLM systems used in real production environments

▪️ Build and scale large-model training and inference pipelines

▪️ Develop and optimize deep learning systems for NLP, multimodal, and scientific applications

▪️ Implement post-training techniques (instruction tuning, contrastive learning, RL-based methods)

▪️ Build robust evaluation, monitoring, and experimentation frameworks

▪️ Partner closely with researchers, scientists, and engineers to productionize models

▪️ Own projects end-to-end from concept through deployment and continuous improvement


𝗪𝗵𝗮𝘁 𝗬𝗼𝘂 𝗕𝗿𝗶𝗻𝗴:


▪️ Strong Python engineering background

▪️ Proven experience shipping ML or LLM systems into production

▪️ Hands-on experience with deep learning, NLP, or large-scale ML systems

▪️ Experience with model training, fine-tuning, or large-scale inference

▪️ Exposure to distributed systems, ML infrastructure, or HPC environments

▪️ Ability to work end-to-end across research, engineering, and deployment


𝗪𝗵𝘆 𝗜𝘁’𝘀 𝗮 𝗚𝗿𝗲𝗮𝘁 𝗢𝗽𝗽𝗼𝗿𝘁𝘂𝗻𝗶𝘁𝘆:


▪️ Work onsite with world-class researchers and engineers in New York City

▪️ Build AI systems that directly support scientific discovery and enterprise platforms

▪️ Access to cutting-edge compute infrastructure and real model ownership

▪️ Highly competitive compensation and long-term technical growth

▪️ Deep technical problems with visible, real-world impact


Agile Staffing is a leading recruitment firm supporting healthcare organizations across North America. We specialize in delivering experienced, hard-to-find talent to hospitals and healthcare systems through strong partnerships, high-integrity service, and modern recruitment practices. Our team is passionate about connecting great people with great organizations and building long-term success for both.

Apply for this job
Powered by