We’re looking for an Engineering Manager for our Evaluation Platform team to join Procore’s Construction Intelligence organization. In this role, you’ll build the infrastructure and tooling that enables users and internal teams to measure, benchmark, and improve the quality of AI agents — including Search Agent, RFI Create Agent, Invoice Agent, and future agentic products. You will own the end-to-end evaluation lifecycle: from defining quality metrics and building evaluation frameworks, to delivering intuitive interfaces that surface actionable insights about agent performance.
This position reports into Sr Director of the Procore AI Engineering team and will be 2 days per week hybrid role in our Austin office. We’re looking for someone to join us immediately.
What you’ll do:
Lead and grow a team of engineers focused on evaluation infrastructure, quality measurement, and developer tooling for AI agents.
Define the technical vision and roadmap for the Evaluation Platform — covering offline evaluations (batch benchmarks, regression suites) and online evaluations (live traffic quality monitoring, A/B testing).
Partner with AI/ML, Product, and Agent teams to define quality metrics for agents (relevance, accuracy, latency, safety, user satisfaction, token usage) and build automated pipelines to compute them at scale.
Design and deliver user-facing evaluation tools that allow customers and internal teams to assess agent output quality, compare model versions, and identify regressions.
Build frameworks for human-in-the-loop evaluation — annotation workflows, rating interfaces, and inter-rater reliability measurement.
Establish CI/CD quality gates so that new agent versions cannot ship without passing evaluation thresholds.
Drive engineering excellence: code quality, system reliability, test coverage, on-call health, and technical debt management.
Recruit, mentor, and develop engineers — fostering a culture of ownership, curiosity, and rigorous experimentation.
What we’re looking for:
5+ years managing engineering teams or technical leads, with 7+ years total in software engineering.
Experience building evaluation, quality measurement, or observability platforms for LLM-based or agentic systems (RAG pipelines, multi-step agents, tool-use agents).
Strong understanding of evaluation methodologies: precision/recall, LLM-as-judge, human annotation, A/B testing, and statistical significance frameworks.
Proven ability to translate ambiguous problem spaces into clear technical strategies and executable roadmaps.
Hands-on technical depth in backend systems, data pipelines, or distributed infrastructure (Python, Go, or similar)
Familiarity with evaluation frameworks such as RAGAS, DeepEval, LangFuse, or custom eval harnesses.
Background in search relevance (NDCG, MRR) or information retrieval quality systems.
Experience with construction-tech, procurement, or enterprise B2B SaaS domains.
Base Pay Range:
168,560.00 - 231,770.00 USD AnnualThis role may also be eligible for Equity Compensation and/or Bonus Incentive Compensation. Procore is committed to offering competitive, fair, and commensurate compensation. Actual compensation will be based on a candidate’s job-related skills, experience, education or training, and location.
Procore will consider for employment all qualified applicants, including those with arrest or conviction records, in accordance with the requirements of applicable federal, state, and local laws, including the City of Los Angeles’ Fair Chance Initiative for Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act.
A criminal history may have a direct, adverse, and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment: 1. appropriately managing, accessing, and handling confidential information including proprietary and trade secret information, as well as accessing Procore's information technology systems and platforms; 2. interacting with and occasionally having unsupervised contact with internal/external customers, stakeholders, and/or colleagues; and 3. exercising sound judgment.
Procore Technologies is building the software that builds the world. We provide cloud-based construction management software that helps clients more efficiently build skyscrapers, hospitals, retail centers, airports, housing complexes, and more. At Procore, we have worked hard to create and maintain a culture where you can own your work and are encouraged and given resources to try new ideas. Check us out on Glassdoor to see what others are saying about working at Procore.
We are an equal-opportunity employer and welcome builders of all backgrounds. We thrive in a dynamic and inclusive environment. We do not tolerate discrimination against candidates or employees on the basis of gender, sex, national origin, civil status, family status, sexual orientation, religion, age, disability, race, traveler community, status as a protected veteran or any other classification protected by law.
Alternative methods of applying for employment are available to individuals unable to submit an application through this site because of a disability. Contact our People Crew here to discuss reasonable accommodations.
At Procore, we believe in supporting our employees to help them thrive both personally and professionally. We offer a comprehensive range of benefits and perks for full-time employees, including generous paid time off and leave options, healthcare coverage, and career development programs. Discover more about our offerings and how we empower our global team to succeed.
| Product Manager - Domain-specific AI Enablement (Datagrid) | San Francisco, California, United States |
| Integration Software Engineer, Boomi (Remote) | Remote, California |
| Engineering Manager, Evaluation Platform | Austin, Texas, United States |
| Senior Engineering Manager, Agent Studio Platform | Austin, Texas, United States |
| API Specialist | Remote, Costa Rica |
Learn about our applicant and candidate privacy policy and about creating a profile on My Settings.
This website uses cookies to improve your browsing.
We use cookies to personalize content such as job recommendations, and to analyse our traffic. You consent to our cookies if you click "I Accept". If you click on "Manage Cookies", then you can decline the use of performance cookies but you may have a deteriorated user experience. You can change your settings by clicking on the Settings link on the top right of the device.
Procore does not sell Personal Data in the traditional sense, please see our Do Not Sell Policy.
A one-time (for page view) session cookie is necessary to provide protection against a security attack called "Cross-site scripting (XSS)".
This cookie is mandatory, short lived (one page interaction) and contains no personally identifiable information.
This website uses 2 performance cookies.
The first is a long term cookie (13 months) used to remember you as a candidate and maintain your preferences.
The second is a temporary session cookie (lasts for 15 minutes or when your session ends) used to tie activity such as form submissions and page views with location data (city, country) and present a more localized and relevant job recommendations and other career related content.