CS

Applied AI/ML Scientist

Verified
Cerebras Systems
Posted 2 weeks ago
Posted 15 April 2026
2 views
full-time

About the Role

<div class="content-intro"><p><span data-contrast="none">Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.&nbsp;</span><span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;201341983&quot;:0,&quot;335559685&quot;:0,&quot;335559737&quot;:240,&quot;335559738&quot;:240,&quot;335559739&quot;:240,&quot;335559740&quot;:279}">&nbsp;</span></p> <p>Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups.&nbsp;<a href="https://openai.com/index/cerebras-partnership/&quot;&gt;OpenAI recently announced a multi-year partnership with Cerebras</a>, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.&nbsp;</p> <p>Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.</p></div><h4>About The Role</h4> <p><span data-contrast="auto">As an Applied AI Scientist in the&nbsp;FieldML&nbsp;team, you will&nbsp;be responsible for&nbsp;developing and customizing large language models and more broadly large-scale deep learning models to solve specific customer problems. You&nbsp;won't&nbsp;just advise; you will build.&nbsp;You will bridge the gap between&nbsp;state-of-the-art&nbsp;research and real-world&nbsp;applications&nbsp;by helping customers harness the power of the Cerebras Wafer-Scale Engine (WSE) for their AI initiatives.&nbsp;</span><span data-ccp-props="{}">&nbsp;</span></p> <p><span data-contrast="auto">We are looking for&nbsp;experienced&nbsp;AI Scientists who are passionate about the "applied" side of machine learning&nbsp;-&nbsp;those who enjoy not just reading papers, but implementing, training, and scaling models to solve complex business and scientific problems. You will work on a diverse range of projects, from training bespoke models from scratch to fine-tuning and&nbsp;optimizing&nbsp;the latest Large Language Models (LLMs) for specific industry verticals, to designing and building components for custom agentic systems.</span><span data-ccp-props="{}">&nbsp;</span></p> <p><span data-contrast="auto">The ideal candidate has experience in large model training and/or post-training, a deep understanding of training dynamics and model convergence, and&nbsp;expertise&nbsp;in data curation, combined with&nbsp;strong communication&nbsp;skills.&nbsp;&nbsp;</span><span data-ccp-props="{}">&nbsp;</span></p> <h4><span data-ccp-props="{}">Key Responsibilities&nbsp;</span></h4> <ul> <li><strong><span data-contrast="auto">Customer Use Case Discovery &amp; Project Scoping</span></strong><span data-ccp-props="{}">&nbsp;</span> <ul> <li><span data-contrast="auto">Collaborate with customer stakeholders to&nbsp;identify&nbsp;the best approaches&nbsp;to their&nbsp;business problem with AI.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Contribute to the technical scoping of engagements, including feasibility analysis, data quality/availability/readiness assessments, and the selection of&nbsp;optimal&nbsp;model architectures.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Define project milestones, success metrics, and rigorous evaluation benchmarks to ensure the solution delivers measurable value to the customer’s business.</span><span data-ccp-props="{}">&nbsp;</span></li> </ul> </li> <li><strong><span data-contrast="auto">Custom SOTA Models and AI Systems&nbsp;Development</span></strong><span data-ccp-props="{}">&nbsp;</span> <ul> <li><span data-contrast="auto">Architect and execute&nbsp;end-to-end training recipes for custom models,&nbsp;tailoring&nbsp;model architecture and training recipes&nbsp;to meet customer-specific performance and accuracy requirements.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Design and implement sophisticated adaptation strategies, including continuous pre-training on private datasets, supervised fine-tuning (SFT), and post-training alignment via RLHF or DPO.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Take full ownership of the training pipeline, from high-performance data preprocessing and tokenization to hyperparameter tuning and loss-curve analysis.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Navigate the nuances of model convergence on specialized hardware, performing deep-dive analysis into loss dynamics and gradient stability.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Scale training workloads across Cerebras clusters, ensuring efficient&nbsp;utilization&nbsp;of the hardware for multi-billion parameter models.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Build and&nbsp;optimize&nbsp;the core components of agentic systems, focusing on tool-use capabilities, long-context reasoning, and multi-step planning.</span><span data-ccp-props="{}">&nbsp;</span></li> </ul> </li> <li><strong><span data-contrast="auto">Technical Customer Leadership</span></strong><span data-ccp-props="{}">&nbsp;</span> <ul> <li><span data-contrast="auto">Serve as an AI/ML subject matter expert during technical&nbsp;deep-dives, translating customer requirements into precise training recipes.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Build and&nbsp;maintain&nbsp;strong customer relationships to become their go-to&nbsp;AI/ML&nbsp;expert.</span><span data-ccp-props="{}">&nbsp;</span></li> </ul> </li> <li><strong><span data-contrast="auto">Internal Research and Engineering Collaboration</span></strong><span data-ccp-props="{}">&nbsp;</span> <ul> <li><span data-contrast="auto">Act as the "voice of the customer" for internal R&amp;D and engineering teams to drive improvements in our software stack and hardware&nbsp;utilization.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Partner with internal ML teams and product teams on prioritization of novel model architectures with Cerebras software stack, development of training recipes and internal case studies.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Distill customer-facing&nbsp;successful projects&nbsp;into internal playbooks, helping scale the&nbsp;FieldML&nbsp;team’s ability to deliver specialized models.</span></li> </ul> </li> </ul> <h4><span data-contrast="none">Skills And Qualifications&nbsp;</span> <span data-ccp-props="{&quot;134233117&quot;:false,&quot;134233118&quot;:false,&quot;201341983&quot;:0,&quot;335559685&quot;:0,&quot;335559737&quot;:240,&quot;335559738&quot;:240,&quot;335559739&quot;:240,&quot;335559740&quot;:279}">&nbsp;</span></h4> <ul> <li><span data-contrast="auto">Education:</span><span data-contrast="auto">&nbsp;Master’s or PhD in Computer Science, Machine Learning, or&nbsp;related&nbsp;fields.</span><span data-ccp-props="{}">&nbsp;</span></li> <li><span data-contrast="auto">Broad Deep Learning Expertise:</span><span data-contrast="auto">&nbsp;Expert-level understanding of modern model architectures, including dense transformers,&nbsp;MoEs, multimodal and sequence mod

Related Searches

Explore more opportunities matching this role's title, location, and skills.

Job Title PagesLocation PagesCompany PagesSkill Pages

Ready to apply?

Click below to apply directly on Cerebras Systems's careers page.

Get the top 10 hyper-growth roles delivered to your inbox every Tuesday.