Staff Software Engineer(MLOps)
VerifiedAbout the Role
<p><span style="font-size: 10pt;">Toast creates technology to help restaurants and local businesses succeed in a digital world, helping business owners operate, increase sales, engage customers, and keep employees happy.</span></p> <p><span style="font-size: 10pt;">Now, more than ever, the Toast team is committed to our customers. We’re taking steps to help restaurants navigate these unprecedented times with technology, resources, and community. Our focus is on building the restaurant platform that helps restaurants adapt, take control, and get back to what they do best: building the businesses they love. And because our technology is purpose-built for restaurants, by restaurant people, restaurants can trust that we’ll deliver on their needs for today while investing in experiences that will power their restaurant of the future. </span></p> <p><span style="font-size: 10pt;"><strong><em>Bready*</em></strong><strong> to make a change?</strong></span></p> <p><span style="font-size: 10pt;">Toast is looking for a <strong>Staff Machine Learning Engineer</strong> to serve as a technical linchpin for our AI Platform team. At the P4 level, you aren't just deploying models; you are designing the fundamental infrastructure that enables dozens of teams to build, deploy, and monitor AI at scale. You will act as a force multiplier, mentoring senior engineers and setting the architectural standards for our MLOps lifecycle—from feature stores and automated retraining to high-performance inference at the edge.</span></p> <p><span style="font-size: 10pt;"><strong>About this </strong><strong><em>Roll*</em></strong><strong>:</strong></span></p> <ul> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Architectural Leadership:</strong> Design and lead the evolution of a unified MLOps platform that supports diverse needs across Toast, ensuring high availability, scalability, and security of ML services.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Engineering Excellence:</strong> Champion and institutionalize best practices for CI/CD for ML (MLOps), automated testing, and infrastructure-as-code (Terraform).</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Cross-Functional Synergy:</strong> Lead collaborative efforts across Data Engineering, DevOps, and Product teams to bridge the gap between model prototyping and production-grade reliability.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Strategic Roadmapping:</strong> Partner with leadership and Product Managers to define the 1-2 year technical vision for AI infrastructure, prioritizing long-term stability over short-term fixes.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Operational Ownership:</strong> Set the standard for observability and incident response for ML systems, driving root-cause analysis for complex system failures.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Mentorship:</strong> Actively mentor P2 and P3 engineers, fostering a culture of technical rigor and continuous learning.</span></li> </ul> <p><span style="font-size: 10pt;"><strong>Do you have the right </strong><strong><em>ingredients*</em></strong><strong>?</strong></span></p> <ul> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Education:</strong> Bachelor’s or Master’s degree in Computer Science, AI, or a related technical field.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Experience:</strong> A minimum of <strong>10-12+ years</strong> of professional software engineering experience, with at least <strong>6-7 years</strong> specifically focused on productionizing and scaling ML systems at the enterprise level.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Core Tech Stack:</strong> Expert-level proficiency in <strong>Python, Scala, or Java/Kotlin</strong>. Extensive experience with <strong>PySpark</strong> and high-performance computing.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Generative AI & LLMOps:</strong> Proven track record of taking LLM applications from research to production, including experience with <strong>Vector Databases, LangChain/LangGraph, and A2A protocols</strong>.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>System Design:</strong> Superior ability to design distributed systems that handle millions of requests with sub-second latency.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;">Experience with microservice based architecture, preferably with AWS tooling (SageMaker, DynamoDB, Athena, Glue, etc.)</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;">Experience in software engineering best practices and tools including object-oriented programming, test-driven development, CI/CD, git, shell scripting, task orchestration, MLflow</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;">Profound knowledge of model deployment, orchestration (Apache airflow, <strong><em>Prefect</em></strong>), scaling, and managing CPU/GPU resources efficiently. </span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;">Exceptional problem-solving, analytical skills and the ability to tackle complex problems with a critical thinking approach. </span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;">Outstanding communication and interpersonal skills, coupled with a demonstrated ability to work collaboratively within a team environment.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;">Foundational knowledge in statistical concepts (e.g. classification, regression, etc) and deep learning algorithms (e.g. CNN, RNN) is desirable</span></li> </ul> <h3><span style="font-size: 10pt;"><strong>Bonus Ingredients (The "Staff" Edge)</strong></span></h3> <ul> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Data Strategy:</strong> Experience implementing enterprise-grade <strong>Feature Stores</strong> (e.g., Tecton, Feast) and real-time streaming frameworks like <strong>Apache Flink or Ray</strong>.</span></li> <li style="font-size: 10pt;"><span style="font-size: 10pt;"><strong>Full-Stack Visibility:</strong> Ability to dive into the UI/UX layer (React) or deep into the kernel/networking layer to debug performance bottlenecks.</span></li> </ul> <p><span style="font-size: 10pt;"><strong>Open Source/Community:</strong> Contributions to relevant open-source projects (MLflow, Kubeflow, etc.) or a history of speaking at industry conferences.</span></p> <p data-pm-slice="1 1 []"><span style="font-size: 10pt;"><strong>AI at Toast</strong></span></p> <p><span style="font-size: 10pt;">At Toast, one of our company values is that we're hungry to build and learn. We believe learning new AI tools empowers us to build for our customers faster, more independently, and with higher quality. We provide these tools across all disciplines, from Engineering and Product to Sales and Support, and are inspired by how our Toasters are already driving real value with them. The people who thrive here are those who embrace changes that let us build more for our customers; it’s a core part of our culture.</span></p> <p><span style="font-size: 10pt;"><strong>Our Total Rewards Philosophy </strong></span><br><span style="font-size: 10pt;">We strive to provide competitive compensation and benefits programs that help to attract, retain, and motivate the best and brightest people in our industry. Our total rewards package goes beyond great earnings potential and provides the means to a healthy lifestyle with the flexibility to meet Toasters’ changing needs. Learn more about our benefits at <a class="c-link" href="https://careers.toasttab.com/toast-benefits" target="blank" data-stringify-link="https://careers.toasttab.com/toast-benefits" data-sk="tooltip_parent">https://careers.toasttab.com/toast-benefits</a>.</span></p><div class="content*
Ready to apply?
Click below to apply directly on Toast's careers page.
Get the top 10 hyper-growth roles delivered to your inbox every Tuesday.