We’re looking for a Staff ML Data Engineer to join Procore’s AI & Frontier Models organization. In this role, you’ll be responsible for designing and building the data systems that power frontier‑scale machine learning research and applied AI products, with a particular focus on spatial intelligence and multimodal data. The primary goal of this role is to ensure that researchers and engineers can reliably discover, curate, transform, and operate on large‑scale datasets that move from experimentation to production.
As a Staff ML Data Engineer, you’ll work closely with ML researchers, applied ML engineers, and system architects to turn ambiguous research needs into scalable, production‑ready data pipelines. You’ll remain deeply hands‑on while providing technical leadership in data architecture, quality, and operational excellence. This is an opportunity to shape how Procore builds, evaluates, and deploys frontier models by ensuring the underlying data systems are robust, observable, and designed for iteration.
This position reports into an Engineering Manager within Procore AI and will be based in our San Francisco office. We’re looking for someone to join us immediately.
Act as the technical lead for data engineering efforts supporting frontier model research and applied ML systems.
Design, build, and maintain scalable batch and streaming pipelines for multimodal data (e.g., documents, images, spatial metadata).
Partner closely with researchers and architects to translate experimental workflows into reliable, repeatable data systems.
Lead the development of dataset curation, versioning, and lineage workflows that support rapid experimentation and reproducibility.
Establish and uphold standards for data quality, validation, observability, and cost efficiency across AI data pipelines.
Contribute to data architecture decisions spanning research environments and production systems.
Identify gaps or inefficiencies in existing data workflows and run proofs‑of‑concept to evaluate improvements.
Mentor other engineers through code reviews, design discussions, and hands‑on collaboration.
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
8+ years of experience designing and operating complex data systems in production or research‑adjacent environments.
Strong proficiency in SQL and Python; experience with data‑intensive or distributed systems.
Proven experience building scalable data pipelines that support machine learning training, evaluation, or inference workflows.
Solid understanding of data modeling, dataset lifecycle management, and data quality best practices.
Comfort operating in highly ambiguous problem spaces and collaborating closely with researchers and architects.
Demonstrated ability to lead through direct technical contribution, mentorship, and setting engineering standards.
Strong communication skills, with the ability to explain technical tradeoffs to both research and engineering audiences.
Nice to have experience with technologies such as:
ML & Research Data: Large‑scale dataset curation, annotation workflows, experiment tracking, reproducibility tooling
Data Platforms: Databricks, Spark, lakehouse architectures, cloud data warehouses
Streaming & Pipelines: Kafka, Pub/Sub, event‑driven data architectures
Orchestration & Observability: Airflow, Dagster, data quality and lineage tools
Cloud & Infrastructure: AWS or GCP, containerized data workloads, CI/CD, infrastructure‑as‑code
Performance & Cost: Optimizing data pipelines for GPU‑backed training and large‑scale inference workloads
Base Pay Range:
227,332.00 - 312,581.50 USD AnnualThis role may also be eligible for Equity Compensation and/or Bonus Incentive Compensation. Procore is committed to offering competitive, fair, and commensurate compensation. Actual compensation will be based on a candidate’s job-related skills, experience, education or training, and location.
Procore will consider for employment all qualified applicants, including those with arrest or conviction records, in accordance with the requirements of applicable federal, state, and local laws, including the City of Los Angeles’ Fair Chance Initiative for Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act.
A criminal history may have a direct, adverse, and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment: 1. appropriately managing, accessing, and handling confidential information including proprietary and trade secret information, as well as accessing Procore's information technology systems and platforms; 2. interacting with and occasionally having unsupervised contact with internal/external customers, stakeholders, and/or colleagues; and 3. exercising sound judgment.
Procore Technologies is building the software that builds the world. We provide cloud-based construction management software that helps clients more efficiently build skyscrapers, hospitals, retail centers, airports, housing complexes, and more. At Procore, we have worked hard to create and maintain a culture where you can own your work and are encouraged and given resources to try new ideas. Check us out on Glassdoor to see what others are saying about working at Procore.
We are an equal-opportunity employer and welcome builders of all backgrounds. We thrive in a dynamic and inclusive environment. We do not tolerate discrimination against candidates or employees on the basis of gender, sex, national origin, civil status, family status, sexual orientation, religion, age, disability, race, traveler community, status as a protected veteran or any other classification protected by law.
Alternative methods of applying for employment are available to individuals unable to submit an application through this site because of a disability. Contact our People Crew here to discuss reasonable accommodations.
At Procore, we believe in supporting our employees to help them thrive both personally and professionally. We offer a comprehensive range of benefits and perks for full-time employees, including generous paid time off and leave options, healthcare coverage, and career development programs. Discover more about our offerings and how we empower our global team to succeed.
| Product Support Engineer | Maadi as Sarayat Al Gharbeyah, Cairo, Egypt |
| Design Systems Program Manager | Austin, Texas, United States |
| Product Manager - Owner Market | Austin, Texas, United States |
| Principal Product Manager | Austin, Texas, United States |
| Principal Applied Research Scientist (Datagrid) | US - Remote CA - Bay Area, California, United States |
Learn about our applicant and candidate privacy policy and about creating a profile on My Settings.
This website uses cookies to improve your browsing.
We use cookies to personalize content such as job recommendations, and to analyse our traffic. You consent to our cookies if you click "I Accept". If you click on "Manage Cookies", then you can decline the use of performance cookies but you may have a deteriorated user experience. You can change your settings by clicking on the Settings link on the top right of the device.
Procore does not sell Personal Data in the traditional sense, please see our Do Not Sell Policy.
A one-time (for page view) session cookie is necessary to provide protection against a security attack called "Cross-site scripting (XSS)".
This cookie is mandatory, short lived (one page interaction) and contains no personally identifiable information.
This website uses 2 performance cookies.
The first is a long term cookie (13 months) used to remember you as a candidate and maintain your preferences.
The second is a temporary session cookie (lasts for 15 minutes or when your session ends) used to tie activity such as form submissions and page views with location data (city, country) and present a more localized and relevant job recommendations and other career related content.