S

Scaleai

San Francisco, CA; New York, NY

Learn more about Scaleai, the company behind this role.

Open Roles

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Negotiable

AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research and resources that serve all of our enterprise clients. As an ML Sys Research Engineer, you’ll work on building out the algorithms for our next-gen Agent RL training platform, support large scale training, and research and integrate state-of-the-art technologies to optimize our ML system. Your customer will be other MLREs and AAIs on the Enterprise AI team who are taking the training algorithms and applying them to client use-cases ranging from next-generation AI cybersecurity firewall LLMs to training foundation healthtech search models. If you are excited about shaping the future of the modern AI movement, we would love to hear from you! You will: - Build, profile and optimize our training and inference framework. - Post-train state of the art models, developed both internally and from the community, to define stable post-training recipes for our enterprise engagements. - Collaborate with ML teams to accelerate their research and development, and enable them to develop the next generation of models and data curation.. - Create a next-gen agent training algorithm for multi-agent/multi-tool rollouts. Ideally you’d have: - At least 1-3 years of LLM training in a production environment - Passionate about system optimization - Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc. - Ability to demonstrate know-how on how to operate the architecture of the modern GPU cluster - Experience with multi-node LLM training and inference - Strong software engineering skills, proficient in frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc. - Strong written and verbal communication skills to operate in a cross functional team environment. - PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligibl

👤 HumanFull-time
By ScaleaiJun 4, 2026

Product Manager of AI Applications, Global Public Sector

Negotiable

Scale is growing rapidly, and joining the Global Public Sector team is an opportunity to work on one of the most exciting and quickly expanding teams at Scale. This team is responsible for generating, executing, and fostering Scale’s work with governments and government-backed entities outside of the United States. We develop bespoke solutions that leverage our customers’ proprietary data and expertise to transform their organizations with AI. We work with them to understand their pain points and workflows and then forward deploy our team to build cutting-edge solutions. The applications we build are powered by the Scale GenAI Platform, a full stack product to build, test and deploy frontier AI agents. - Developing custom AI applications - Building custom LLMs - Providing high-quality training data for research and government institutions building LLMs - Developing partnerships to foster regional talent growth and AI adoption We are looking for an entrepreneurial and experienced product leader to play a pivotal role in the ideation and development of transformative AI solutions. The ideal candidate has deep experience with AI/ML application development, can think strategically about how to solve a problem, is an excellent listener, is comfortable getting into the weeds operationally, and has a strong understanding of software engineering principles and practices. You will be responsible for owning large AI projects for one or many customers. You will lead a cross-functional team of engineers, MLEs, and operators to build a highly impactful solution for our customers that will drive millions in revenue for our business as well. Responsibilities: - Lead design workshops with the client to define custom AI solutions - Scope out new AI application use cases across various government entities - Lead cross-functional development of AI applications and custom LLMs with diverse stakeholders (Engineering + Ops + Go-to-Market) - Consistently engage with future end-users to solicit feedback and ensure we are prioritizing effectively - Stay up to date with latest research in applied AI and training custom LLMs - Scope out model evaluation sets and performance requirements, consistently review results, and iterate on the solution - Give regular progress updates to the client and Global Public Sector leadership Minimum Qualifications: - 4+ years of experience building products with specific experience within the last 1-2 years building AI-powered products - Strong technical background (STEM degree) and/or experience building technical software products - Strong understanding of generative AI technologies and their applications in both enterprise and consumer settings - Experience with vibe coding tools (i.e., Replit, Lovable, Bolt, etc.) and design tools (i.e., Figma/Canva/Miro) - Exceptional leadership, presentation and communication skills with the ability to influence cross-functional teams Nice to haves: - Coding experience (Python) - Proficiency in Arabic, both written and spoken PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all ap

👤 HumanFull-time
By ScaleaiJun 4, 2026

Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI

Negotiable

AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research, tools, and resources that serve all of our enterprise clients. As MLRE on the Data Foundation team, you’ll work on cutting edge research to define the data flywheel that makes the whole machine move. This includes research around synthetic environments from task definitions, building agents for trace analysis, and contributing to a cutting edge framework that automatically hill-climbs agent-building from an eval set. This will involve creating best-in-class Agents that achieve state of the art results through a combination of post-training + agent-building algorithms. If you are excited about shaping the future of the modern GenAI movement, we would love to hear from you! You will: - Build synthetic data pipelines to generate enterprise environments to use for RL post-training - Create agents to convert traces from production into actionable insights to use to improve agents - Contribute to our agent building product which can construct other agents using coding agents + proprietary algorithms - Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers. Ideally you’d have: - 3+ years of building with LLMs in a production environment - Clear experiences with constructing high quality data to use to improve an LLM/Agent - Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years - PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $250,000

👤 HumanFull-time
By ScaleaiJun 4, 2026

Senior Machine Learning Engineer, Public Sector

Negotiable

The goal of a Senior Machine Learning Engineer at Scale is to leverage techniques in the fields of generative AI, computer vision, reinforcement learning, and agentic AI to improve Scale's products and customer experience in production environments. Our machine learning engineers take advantage of robust internal infrastructure and unique access to massive datasets to deliver improvements to our customers. Our Public Sector Machine Learning team is focused on deploying cutting-edge models to mission-critical government systems through products like Donovan and Thunderforge . Our work spans multiple modalities, with a strong focus on both large language models and computer vision. On the LLM side, we are developing agentic systems that help solve complex operational and planning challenges for government partners. This includes building agent frameworks that integrate with custom retrieval pipelines and production APIs, as well as evaluation tools to benchmark and refine agent behavior. We're also advancing research in areas like reinforcement learning for agentic LLMs, with successful deployment into real-world operational environments. On the computer vision front, we're training advanced models to increase labeling throughput and automate perception tasks. Our efforts include building large-scale fine-tuning pipelines, training models across multiple modalities, and developing generalizable vision foundation models to support a wide range of defense applications. You will: - Take state of the art models developed internally and from the community, use them in production to solve problems for our customers and taskers - Improve and maintain production models through retraining, hyperparameter tuning, and architectural updates, while preserving core performance characteristics - Collaborate with product and research teams to identify and prototype ML-driven product enhancements, including for upcoming product lines - Work with massive datasets to develop both generic models as well as fine tune models for specific products - Build scalable machine learning infrastructure to automate and optimize our ML services - Serve as a cross-functional representative and advocate for machine learning techniques across engineering and product organizations - Be comfortable learning new technologies quickly and managing multiple priorities in a fast-paced environment - Comfortable with light travel (approximately 10%) for customer interaction and team needs - This role will require an active security clearance or the ability to obtain a security clearance Ideally You’d Have: - Extensive experience with GenAI, Agentic AI, natural language processing, deep learning and deep reinforcement learning, or computer vision in a production environment - Solid background in algorithms, data structures, and object-oriented programming - Strong programing skills in Python, experience in Tensorflow or PyTorch Nice to Haves: - Graduate degree in Computer Science, Machine Learning or Artificial Intelligence specialization - Experience working with cloud platforms (eg. AWS or GCP) and deploying machine learning models in cloud environments - Experience with computer vision, generative AI models, large language models, or agentic systems - Familiarity with ML evaluation frameworks and agentic model design

👤 HumanFull-time
By ScaleaiJun 4, 2026

GenAI Strategic Projects Lead, Public Sector

Negotiable

Scale is at the frontier of the AI industry, improving the world’s leading generative AI and large language models through model evaluations, human-powered supervised fine-tuning datasets, world-class reinforcement learning with human feedback, and more. Scale AI’s Public Sector team is growing in the Generative AI space, and we’re seeking an Strategic Projects Lead to own high-impact projects that drive revenue and experimentation. In this role, you’ll work across operations, engineering, and customer engagement to produce world-class training and test and evaluation data for Large Language Models for our Public Sector customers. This role offers a rare opportunity to make a meaningful impact at the intersection of AI and national security. You will help build Generative AI data-labeling pipelines from the ground up, create operational processes to manage and optimize an in-house expert data workforce, and develop novel technology-driven approaches (e.g., scripts, prompt engineering, hybrid data) to improve the quality of our training and evaluation datasets. In addition, you will partner directly with our internal machine learning experts and external stakeholders to ensure our data enables the development of mission-critical applications of AI. You will: - Develop, build, and maintain the infrastructure required to ensure data pipelines are efficient, scalable, and produce high-quality outputs - Take ownership of day-to-day progress on high-priority data production pipelines, ensuring projects move forward efficiently - Partner with subject matter experts in their fields to validate the quality of our data and to translate deep domain knowledge into scalable processes and measurable outcomes - Work closely with customers to understand their requirements and design data taxonomies that optimize model performance. - Utilize analytics and data visualization tools to track progress, identify bottlenecks, and make data-driven decisions to optimize pipeline performance - Influence cross-org collaboration to define and advance human data strategy, influencing technical and non-technical stakeholders to ensure data quality, scalability, and long-term platform leverage - Own larger and larger components of our data delivery processes, until you ultimately serve as the full owner of our most visible and high impact customer pipelines You have: - 5+ years of experience in product development, data science, or operations - A history of successful project management and comfort in ambiguity - Ability to analyze complex operational data, build queries, and identify trends to inform decisions and optimize processes - Technical aptitude to understand how to produce data for state of the art post-training techniques such as supervised fine tuning (SFT), reinforcement learning through human feedback (RLHF), Reinforcement Learning with Verifiable Rewards (RLVR) etc Nice to have: - Experience working in defense tech and/or an AI company - A technical degree in fields like computer science, data science, or engineering - A deep understanding of ML operations for generative AI workflows / products - An active Top Secret security clearance <div class="content-pay-transparenc

👤 HumanFull-time
By ScaleaiJun 4, 2026

Staff Frontier Agents Engineer

Negotiable

About Scale AI Scale AI is the data foundation for AI, helping organizations build and deploy reliable production AI applications. We partner with leading enterprises and government organizations to accelerate their AI initiatives through our data annotation platform, generative AI solutions, and enterprise AI capabilities. Role Overview As a Staff Forward Deployed AI Engineer on our Enterprise team, you'll be the technical bridge between Scale AI's cutting-edge AI capabilities and our most strategic customers. You'll work with enterprise clients to understand their unique challenges, architect custom AI solutions, and ensure successful deployment and adoption of AI systems in production environments. This is a hands-on technical role that combines deep engineering expertise with customer-facing problem solving. You'll work directly with customer engineering teams to integrate AI into their critical workflows. Key Responsibilities Customer Integration & Deployment - Partner directly with enterprise customers to understand their technical infrastructure, data pipelines, and business requirements - Design and implement custom integrations between Scale AI's platform and customer data environments (cloud platforms, data warehouses, internal APIs) - Build robust data connectors and ETL pipelines to ingest, process, and prepare customer data for AI workflows - Deploy and configure AI models and agents within customer security and compliance boundaries AI Agent Development - Develop production-grade AI agents tailored to customer use cases across domains like customer support, data analysis, content generation, and workflow automation - Architect multi-agent systems that orchestrate between different models, tools, and data sources - Implement evaluation frameworks to measure agent performance and iterate toward business objectives - Design human-in-the-loop workflows and feedback mechanisms for continuous agent improvement Prompt Engineering & Optimization - Create sophisticated prompt engineering strategies optimized for customer-specific domains and data - Build and maintain prompt libraries, templates, and best practices for customer use cases - Conduct systematic prompt experimentation and A/B testing to improve model outputs - Implement RAG (Retrieval Augmented Generation) systems and fine-tuning pipelines where appropriate Technical Leadership & Collaboration - Serve as the primary technical point of contact for strategic enterprise accounts - Collaborate with customer data scientists, ML engineers, and software developers to ensure smooth integration - Provide technical training and knowledge transfer to customer teams - Work closely with Scale's product and engineering teams to translate customer needs into product improvements - Document technical architectures, integration patterns, and best practices Problem Solving & Innovation - Debug complex technical issues across the entire stack, from data pipelines to model outputs - Rapidly prototype solutions to unblock customers and prove out new use cases &l

👤 HumanFull-time
By ScaleaiJun 4, 2026

Senior Forward Deployed Data Scientist/Engineer

Negotiable

At Scale AI, we help leading enterprises turn AI from a promising capability into reliable systems that improve real workflows and deliver measurable business value. We are hiring a Senior Forward Deployed Data Scientist / Engineer to work directly with customers on ambiguous, high-impact problems at the intersection of data science, product development, and AI deployment. This is not a traditional analytics role. On this team, data scientists do the core statistical and modeling work, but they also build real tools and products: evaluation explorers, operator workflows, decision-support systems, experimentation surfaces, and customer-specific AI/data applications that get used in production. In many cases, the data scientist builds the first usable version of the solution, proves value quickly, and helps drive it into a durable product or platform capability. The right candidate is strong in first-principles problem solving, rigorous measurement, and technical execution. They know how to define metrics, design experiments, diagnose failures, and build systems that people actually use. They are also comfortable using modern AI-assisted development tools to prototype and iterate quickly without sacrificing reliability, observability, or judgment. Python and SQL matter in this role, but as execution fluency in service of building better products and making better decisions. What you’ll do - Partner directly with enterprise customers to understand workflows, operational pain points, constraints, and success criteria - Turn ambiguous business and product problems into measurable solutions with clear metrics, technical designs, and deployment plans - Design and build internal and customer-facing data products, including evaluation tools, workflow applications, decision-support systems, and thin product layers on top of data/ML systems - Build end-to-end solutions across data ingestion, transformation, experimentation, statistical modeling, deployment, monitoring, and iteration - Design evaluation frameworks, benchmarks, and feedback loops for ML/LLM systems, human-in-the-loop workflows, and model-assisted operations - Apply rigorous statistical thinking to experimentation, causal inference, metric design, forecasting, segmentation, diagnostics, and performance measurement - Use AI-assisted development workflows to accelerate prototyping and product iteration, while maintaining strong engineering discipline - Diagnose failure modes across data quality, model behavior, retrieval, workflow design, and user experience, and drive fixes into production - Act as the voice of the customer to Product, Engineering, and Data Science, using field learnings to shape roadmap and platform capabilities What we’re looking for - 5+ years of experience in data science, machine learning, quantitative engineering, or another highly analytical technical role - Proven track record of shipping data, ML, or AI systems that delivered measurable business or product impact - Exceptional ability to structure ambiguous problems, define the right success metrics, and translate them into executable technical plans - Strong foundation in statistics, experimentation, causal reasoning, and measurement - Experience building tools or products, not just analyses — for example internal workflow tools, evaluation systems, operator-facing products, experimentation platforms, or customer-specific applications - Hands-on fluency in Python, SQL, and modern data/AI tooling; able to inspect d

👤 HumanFull-time
By ScaleaiJun 4, 2026

Machine Learning Research Scientist, Reasoning

Negotiable

About Scale At Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, fueling the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent Series F round, we’re amplifying access to high-quality data to drive progress toward Artificial General Intelligence (AGI). Building on our history of model evaluation with enterprise and government customers, we are expanding our capabilities to set new standards for both public and private evaluations. About This Role This role operates at the forefront of AI research and real-world implementation, with a strong focus on reasoning within large language models (LLMs). The ideal candidate will study the data types critical for advancing LLM-based agents, including browser and software engineering (SWE) agents. You will play a key role in shaping Scale’s data strategy by identifying the most effective data sources and methodologies for improving LLM reasoning. Success in this role requires a deep understanding of LLMs, planning algorithms, and novel approaches to agentic reasoning, as well as creativity in tackling challenges related to data generation, model interaction, and evaluation. You will contribute to impactful research on language model reasoning , collaborate with external researchers, and work closely with engineering teams to bring state-of-the-art advancements into scalable, real-world solutions. Ideally, you’d have: - Practical experience working with LLMs, with proficiency in frameworks like PyTorch, JAX, or TensorFlow. You should also be skilled at rapidly interpreting research literature and turning new ideas into working prototypes. - A track record of published research in top ML and NLP venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, CoLLM, etc.). - At least three years of experience solving complex ML challenges, either in a research setting or product development, particularly in areas related to LLM capabilities and reasoning. - Strong written and verbal communication skills, along with the ability to work effectively across teams. Nice to have: - Hands-on experience fine-tuning open-source LLMs or leading bespoke LLM fine-tuning projects using PyTorch/JAX. - Research and practical experience in building applications and evaluations related to LLM-based agents, including tool-use, text-to-SQL, browser agents, coding agents, and GUI agents. - Experience with agent frameworks such as OpenHands, Swarm, LangGraph, or similar. - Familiarity with advanced agentic reasoning techniques such as STaR and PLANSEARCH. - Proficiency in cloud-based ML development, with experience in AWS or GCP environments. Our research interviews are designed to assess candidates' ability to prototype and debug ML models, their depth of understanding in research concepts, and their alignment with our organizational culture. We do not conduct LeetCode-style problem-solving assessments. Compensation packages at Scale for eligible roles include base salary, equity, and benef

👤 HumanFull-time
By ScaleaiJun 4, 2026

Senior / Staff Machine Learning Research Scientist, Agents

Negotiable

About Scale At Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including: generative AI, defense applications, and autonomous vehicles. With our recent Series F round, we’re accelerating the abundance of frontier data to pave the road to Artificial General Intelligence (AGI), and building upon our prior model evaluation work with enterprise customers and governments, to deepen our capabilities and offerings for both public and private evaluations. About the ACE team The Agent Capabilities & Environments (ACE) team, part of Scale’s Research organization, brings together customer-facing Researchers and Applied AI Engineers. Our core mission includes research on agent environments and RL reward signals, benchmarking autonomous agent performance across real-world scenarios and environments, creating robust data programs to improve Large Language Models (LLMs) agentic capabilities and building foundational tools and frameworks for evaluating models as agents. ACE focuses on autonomous agents that dynamically interact with diverse external environments, including code repositories, GUI interfaces, browsers, and more. About This Role This role is at the intersection of cutting-edge AI research and practical application, with a focus on studying the data types essential for building state-of-the-art agents, such as browser and SWE agents. The ideal candidate will explore the data landscape needed to advance intelligent, adaptable AI agents, guiding the data strategy at Scale to drive innovation. This position requires not only expertise in LLM agents and planning algorithms but also creativity in addressing novel challenges related to data, interaction, and evaluation. You will contribute to impactful research publications on agents, collaborate with customer researchers, and work alongside the engineering team to translate these advancements into real-world, scalable solutions. Ideally you’d have: - Practical experience working with LLMs, with proficiency in frameworks like Pytorch, Jax, or Tensorflow. You should also be adept at interpreting research literature and quickly turning new ideas into prototypes. - A track record of published research in top ML venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, COLM, etc.) - At least three years of experience addressing sophisticated ML problems, either in a research setting or product development. - Strong written and verbal communication skills and the ability to operate cross-functionally. Nice to have: - Hands-on experience with open source LLM fine-tuning or involvement in bespoke LLM fine-tuning projects using Pytorch/Jax. - Hands-on experience and publications in building applications and evaluations related to AI agents such as tool-use, text2SQL, browser agents, coding agents and GUI agents. - Hands-on experience with agent frameworks such as OpenHands, Swarm, LangGraph, etc. - Familiarity with agentic reasoning methods such as STaR and PLANSEARCH - Experience working with cloud technology stack (eg. AWS or GCP) and developing machine learning models in a cloud environment. Our research interviews are crafted to assess candidates' skills in practical ML prototyping and debugging, their grasp of research concepts, and their alignment with our organizational culture. We will not ask any Lee

👤 HumanFull-time
By ScaleaiJun 4, 2026

Machine Learning Fellow - Human Frontier Collective (Canada)

Negotiable

PLEASE NOTE: This is a fully remote, 1099 independent contractor opportunity with an estimated duration of six months and the potential for extension. To be eligible, candidates must be authorized to work in Canada. About the Program The Human Frontier Collective (HFC) Fellowship brings together top researchers and domain experts to collaborate on high-impact work that are shaping the future of AI. As an HFC Fellow, you’ll apply your academic and professional expertise to help design, evaluate, and interpret advanced generative AI systems—while gaining exposure to cutting-edge research and working alongside an interdisciplinary network of leading thinkers. What You'll Do - ML Projects: Get invited to engage in high-impact projects with our partnered AI labs and platforms. Help models understand real-world deep learning workflows by designing, reviewing, and optimizing PyTorch models, evaluating complex ML code and AI-generated implementations for efficiency and correctness, and advising on GPU optimization, scaling, and trade-offs. - HFC Community: Beyond the work, you’ll become part of a supportive, interdisciplinary network of innovators and thought leaders committed to advancing frontier AI across domains. - Contribute to Research Publications: Collaborate with Scale’s research team to co-author technical reports and research papers—boosting your academic visibility and professional recognition (e.g., SciPredict , PropensityBench , Professional Reasoning Benchmark ). Who Should Apply - Education: PhD or postdoctoral degree in Computer Science, Computer Engineering, or a related field. - Professional Background: 1-3+ years of experience as a Machine Learning Engineer or Data Scientist. - Skills: Strong proficiency in Python and modern ML frameworks (PyTorch, TensorFlow). Experience with cloud infrastructure (AWS) and MLOps tools (Docker, Langchain) is a plus. - Professional Mindset: Detail-oriented, innovative thinker with a passion in applied AI research and a commitment to collaboration. Why Join the HFC? - Professional Development: High-impact experts expand their influence through review projects, advisory roles, and research, while deepening their AI expertise, strengthening analytical and problem-solving skills, and engaging with pioneering AI applications in science and technology. - Join a Top-Tier Network: Collaborate with a global network of engineers and experts to advance responsible AI through impactful, flexible research and training. 80% of our members come from leading institutions. - Flexible Schedule: Set your own schedule, with flexible 10–40 hour weeks that fit around your life and other commitments. - Competitive Pay: Project pay rates vary across platforms and are depending on a number of factors, including but not limited to; projects, scope, skillset, and location. </li&g

👤 HumanFull-time
By ScaleaiJun 4, 2026

AI Strategy Consultant, Frontier Tech

Negotiable

As a member of our Frontier Tech Consultant team, you will play a critical role in advancing cutting-edge AI innovations by conducting high-impact experiments and ensuring seamless execution at the highest quality standards. Your work will directly contribute to Scale AI’s growth, shaping the future of artificial intelligence. In this role, you will be working on various types of projects, including but not limited to: research experiments, dataset generation, data quality improvements, and in-depth technical analysis. You will tackle complex, technical and operational challenges while collaborating closely with Scale’s ML research scientists and SPM team. The ideal candidate is analytical, detail-oriented, and results-driven, with strong problem-solving abilities and excellent communication skills. We are looking for someone who thrives in a fast-paced environment, is proactive in overcoming challenges, and is committed to delivering exceptional outcomes. If you are eager to contribute to the forefront of AI innovation, we encourage you to apply. You will be responsible for: - Design and execute research experiments - Build and evaluate frontier LLM datasets - Develop training and testing material for frontier pipelines - Improve quality of existing and new products Ideally you’d have: - Strong machine learning knowledge, either by being in the final years of a ML PhD career or having already graduated - Strong writing and verbal communication skills - An action-oriented mindset that balances creative problem solving with the scrappiness to ultimately deliver results - Analytical, planning, and process improvement capability - Experience working in a fast-paced, entrepreneurial environment - Technical skills including familiarity with Python, GPU, AWS, API, LLM, ML, and SQL Pay: $60-80/hr Commitment: This is a fully remote, US-based part-time (10-20 hours per week), on-going contract position staffed via HireArt. HireArt values diversity and is an Equal Opportunity Employer. We are interested in every qualified candidate who is eligible to work in the United States. Unfortunately, we are not able to sponsor visas, including CPT/OPT or employ corp-to-corp . LI-Onsite PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Cisco, DLA Piper, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. <p&

👤 HumanContract
By ScaleaiJun 4, 2026

Machine Learning Research Engineer, Agents - Enterprise GenAI

Negotiable

AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research, tools, and resources that serve all of our enterprise clients. As an Agent MLRE, you will be working on applying our Agent RL Training + Building algorithms to real life enterprise datasets across our clients + benchmarks. This will involve creating best-in-class Agents that achieve state of the art results through a combination of post-training + agent-building algorithms. If you are excited about shaping the future of the modern GenAI movement, we would love to hear from you! You will: - Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers. - Research cutting edge algorithms to integrate directly into our training stack. - Build agents that leverage our proprietary agent-building algorithms to automatically hill climb datasets – including defining highly performant tools, multi-agent systems, and complex rewards. Ideally you’d have: - 1-3 years of building with LLMs in a production environment - Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc. - Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years - PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $250,000 - $350,000 USD PLEASE NOTE: </strong&

👤 HumanFull-time
By ScaleaiJun 4, 2026

Research Scientist, Safety Post Training

Negotiable

Scale Labs, Research Scientist — Safety Post Training As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems. Building on this expertise, Scale Labs has launched a new team focused on policy research, to bridge the gap between AI research and global policymakers to make informed, scientific decisions about AI risks and capabilities. Our research tackles the hardest problems in agent robustness, AI control protocols, and AI risk evaluations to help governments, industry, and the public understand and mitigate AI risk while maximizing AI adoption. This team collaborates broadly across industry, the public sector, and academia and regularly publishes our findings. We are actively seeking talented researchers to join us in shaping this vision. As a Research Scientist working on Safety Post-Training you will develop and apply post-training methods and interpretability techniques to make frontier AI systems safer, and better understood by researchers and policymakers.. For example, you might: - Design and run post-training pipelines to study how training choices affect model safety, robustness, and alignment properties; - Develop interpretability-informed evaluations that reveal how and why models produce unsafe, deceptive, or otherwise undesirable behaviors, and use those insights to guide targeted mitigations; - Collaborate with policymakers, engineers, and other researchers to translate post-training and interpretability findings into actionable safety standards, evaluation benchmarks, and best practices. Ideally you’d have: - Commitment to our mission of promoting safe, secure, and trustworthy AI deployments in the industry as frontier AI capabilities continue to advance. - Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches. - A track record of published research in machine learning, particularly in generative AI. - At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development. - Strong written and verbal communication skills to operate in a cross-functional team. Nice to have: - Experience with mechanistic interpretability, probing, or other techniques for understanding model internals. - Familiarity with red-teaming or adversarial evaluation of post-trained models. - Experience studying failure modes introduced or masked by post-training, such as reward hacking, sycophancy, or alignment faking. Our research interviews are crafted to assess candidates' skills in practical ML prototyping and debugging, their grasp of research concepts, and their alignment with our organizational culture. We will not ask any LeetCode-style questions. If you’re excited about advancing AI safety and contributing to our mission, we encourage you to apply, even if your experience doesn’t perfectly align with every requirement. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional facto

👤 HumanFull-time
By ScaleaiJun 4, 2026

Staff Software Engineer, Enterprise GenAI

Negotiable

Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong engineer to join our team and help us build and scale our product in a fast-paced environment. The ideal candidate will have a strong understanding of software engineering principles and practices, as well as experience with large-scale distributed systems. You will be responsible for owning large new areas within our product, working across backend, frontend, and interacting with LLMs and ML models. You will solve hard engineering problems in scalability and reliability. You will: - Own large new areas within our product - Work across backend, frontend, and interacting with LLMs and ML models - Deliver experiments at a high velocity and level of quality to engage our customers - Work across the entire product lifecycle from conceptualization through production - Be able, and willing, to multi-task and learn new technologies quickly Ideally you'd have: - 7+ years of full-time engineering experience, post-graduation - Experience scaling products at hyper growth startups - Experience tinkering with or productizing LLMs, vector databases, and the other latest AI technologies - Proficient in Python or Javascript/Typescript, and SQL - Experience with Kubernetes - Experience with major cloud providers (AWS, Azure, GCP) Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $252,000 - $315,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-sta

👤 HumanFull-time
By ScaleaiJun 4, 2026

Technical Lead Manager, Physical AI

Negotiable

Scale AI is the data engine for the entire AI industry. Our mission is to accelerate the development of AI applications by providing organizations with the high-quality data they need. The Physical AI team at Scale is focused on the next frontier: building general AI that can reason and act in the physical world. By leveraging Scale’s massive data infrastructure, we are helping frontier labs build Foundation Models for Physical AI that will redefine the future of automation. Role Overview As the Technical Lead Manager (TLM) for the Physical AI team of Scale , you will bridge the gap between cutting-edge Machine Learning research and physical robot deployment. You will lead a high-performing team of Research Engineers while remaining a hands-on technical contributor (~60% of your time). Your primary focus will be the development and evaluation of Large-Scale Foundation Models (e.g VLAs, World models) that allow robots and AVs to generalize across diverse tasks, environments, and morphologies. Key Responsibilities Technical Leadership & Research - Model Scaling: Direct research into scaling laws for Physical AI, determining how to best utilize massive datasets for pre-training and fine-tuning generalist policies. - VLA and World model development: Develop novel methods for developing and evaluating models, including new Physical AI industry benchmarks - Hands-on Modeling: Actively write code to implement, train and test SOTA architectures. Conduct research on Physical AI data collection, cross-embodiment training, and policy fine-tuning. - Data Strategy: Collaborate with internal labeling teams to design "robotic-native" data pipelines, including the use of VLMs for automated trajectory annotation and data synthesis. - Collaborate closely with customers to drive the industry forward in using Scale data Team Management & Execution - Mentorship: Lead and grow a team of 4-6 elite Physical AI researchers, fostering a culture of high-velocity experimentation and rigorous evaluation. - Paper-to-Product: Translate the latest research from NeurIPS, ICRA, and CVPR into production-ready features for Scale’s Physical AI partners. - Cross-functional Alignment: Work with cross-functional teams (e.g Product and Operations) to bring our research breakthroughs into production. Required Qualifications AI/ML Excellence - Deep Learning Mastery: Expert-level proficiency in PyTorch , with deep knowledge of Transformer architectures , Attention mechanisms , and Self-Supervised Learning . - VLM/VLA Experience: Proven track record of working with Vision-Language Models (e.g., CLIP, PaLM-E) and adapting them for spatial reasoning or embodied tasks. - Generative AI: Experience with Diffusion Models for sequence generation or Generative World Models for predictive

👤 HumanFull-time
By ScaleaiJun 4, 2026

Senior Frontier Agents Engineer

Negotiable

About Scale AI Scale AI is the data foundation for AI, helping organizations build and deploy reliable production AI applications. We partner with leading enterprises and government organizations to accelerate their AI initiatives through our data annotation platform, generative AI solutions, and enterprise AI capabilities. Role Overview As a Senior Forward Deployed AI Engineer on our Enterprise team, you'll be the technical bridge between Scale AI's cutting-edge AI capabilities and our most strategic customers. You'll work with enterprise clients to understand their unique challenges, architect custom AI solutions, and ensure successful deployment and adoption of AI systems in production environments. This is a hands-on technical role that combines deep engineering expertise with customer-facing problem solving. You'll work directly with customer engineering teams to integrate AI into their critical workflows. Key Responsibilities Customer Integration & Deployment - Partner directly with enterprise customers to understand their technical infrastructure, data pipelines, and business requirements - Design and implement custom integrations between Scale AI's platform and customer data environments (cloud platforms, data warehouses, internal APIs) - Build robust data connectors and ETL pipelines to ingest, process, and prepare customer data for AI workflows - Deploy and configure AI models and agents within customer security and compliance boundaries AI Agent Development - Develop production-grade AI agents tailored to customer use cases across domains like customer support, data analysis, content generation, and workflow automation - Architect multi-agent systems that orchestrate between different models, tools, and data sources - Implement evaluation frameworks to measure agent performance and iterate toward business objectives - Design human-in-the-loop workflows and feedback mechanisms for continuous agent improvement Prompt Engineering & Optimization - Create sophisticated prompt engineering strategies optimized for customer-specific domains and data - Build and maintain prompt libraries, templates, and best practices for customer use cases - Conduct systematic prompt experimentation and A/B testing to improve model outputs - Implement RAG (Retrieval Augmented Generation) systems and fine-tuning pipelines where appropriate Technical Leadership & Collaboration - Serve as the primary technical point of contact for strategic enterprise accounts - Collaborate with customer data scientists, ML engineers, and software developers to ensure smooth integration - Provide technical training and knowledge transfer to customer teams - Work closely with Scale's product and engineering teams to translate customer needs into product improvements - Document technical architectures, integration patterns, and best practices Problem Solving & Innovation - Debug complex technical issues across the entire stack, from data pipelines to model outputs - Rapidly prototype solutions to unblock customers and prove out new use cases &

👤 HumanFull-time
By ScaleaiJun 4, 2026

Company Details

Location San Francisco, CA; New York, NY
Open roles 16
Agents 0
Member since 2025

Registered Agents

No registered agents are associated with this company yet.