top of page

June 30, 2026: An Insurance Company Just Made an AI Software Engineer a Standard Engineering Tool

  • Writer: James Sale
    James Sale
  • 1 day ago
  • 6 min read

In this post.

  • Hippo deploys Devin, Cognition's AI software engineer, across its entire engineering team

  • Openprise cuts AI token costs by up to 80% for agentic sales and RevOps workflows

  • Salesforce bets on pay-per-resolution pricing, and why the definition matters more than the price

  • RAISE US launches with $500M+ from Amazon, Microsoft, Anthropic, and others to reskill displaced workers

  • Two vendor signals on human-in-the-loop AI design in contact centers and recruiting


Hippo Holdings just deployed Devin, Cognition's AI software engineer, across its full engineering organization to accelerate software development across the insurance lifecycle. Not a team pilot. Not a proof of concept. Engineering-wide, in a regulated industry where the code touches claims processing, underwriting, and policy management.


That's the kind of named, production-grade commitment that separates this week from the typical wave of vendor announcements. There are three other substantive moves alongside it, plus a $500M workforce transition bet that points at the other end of the AI adoption equation.


Hippo's Engineering-Wide Devin Deployment Sets a Regulated-Vertical Benchmark

Insurance software carries compliance obligations that most industries don't. Code errors in claims systems or policy management don't just create technical debt, they create regulatory exposure. Deploying an AI software engineer (a tool that writes, tests, and iterates on code autonomously, with engineers reviewing its output) across the entire engineering function in that context is a deliberate organizational decision, not an experiment.


The engineering-wide scope is the meaningful signal here. Hippo isn't testing whether Devin works on one codebase. The company has decided this is how its software development function will operate going forward.


If you lead or contribute to an engineering team in a regulated vertical, the question your organization should be asking isn't whether AI coding agents are production-ready in complex environments. A named insurance company just provided that answer. The more useful question is whether your review and approval processes for AI-generated code are robust enough to catch errors before they reach production. Teams that have clear checkpoints will absorb these tools faster and more safely than those improvising the governance on the fly.


Openprise Finds the Hidden Cost Driver in Agentic Sales Workflows

The ROI conversation around AI in sales has focused mostly on outputs: more contacts reached, faster outreach, better lead scoring. Openprise is pointing at the input problem.


According to the company, enterprises using its RevOps data automation platform can reduce AI token costs, the fees paid per unit of text processed by a large language model, by up to 30% on general workflows, 40-60% on data-intensive tasks, and up to 80% on agentic use cases. The agentic category includes AI SDR outreach, automated account research, and lead scoring. The mechanism is data preparation handled before the AI model ever sees the data: cleaning, deduplication, and standardization upstream so the model processes less redundant material.


For a workflow processing 10,000 contacts per month, Openprise reports, the cost reduction compounds significantly. Agentic workflows amplify the waste most, because an AI agent makes repeated model calls to complete a multi-step task, each call carrying any upstream data inefficiency forward.


These figures come from Openprise's own analysis and haven't been independently verified, so treat the specific percentages as directional rather than guaranteed. The underlying logic, however, is straightforward and merits pressure-testing against your own AI spend data. If you manage RevOps or sales operations and your AI usage bills have been climbing faster than expected, data quality going in is the variable your team can actually control.


Salesforce Prices Its Customer Service AI on Whether It Actually Works

Salesforce introduced pay-per-resolution pricing for its Agentforce Help Agent, tying the cost of AI customer service directly to whether the AI resolves a customer's issue. A Futurum Group survey puts roughly 18.7% of enterprises as currently using outcome-based pricing for AI (noted as a vendor-adjacent survey, so treat as a directional figure rather than a hard industry count).


The structural shift matters more than the pricing itself. Seat-based or usage-based pricing gives the vendor no financial stake in performance. Pay-per-resolution inverts that relationship: if the agent doesn't resolve the issue, the customer doesn't pay. Salesforce is betting its own revenue on resolution rates.


For operations and customer service leaders, the relevant negotiation has changed. The question is no longer primarily "what does the license cost?" It becomes "what counts as a resolution, who defines it, and how is it verified?" Those definitions will be contested in practice. A resolved ticket in a system log and a customer who actually had their problem solved are not always the same thing. Getting the definition language right in the contract matters more than the per-resolution price.


$500 Million Is Now Flowing Toward Workers, Not Just Infrastructure

Every week in this coverage has included some form of AI-driven displacement news. Oracle disclosed 21,000 jobs cut attributing the reductions to AI deployment. Government filings are getting more explicit about workforce reductions tied to automation. This week brought a different kind of signal.


Former Commerce Secretary Gina Raimondo and former Indiana Governor Eric Holcomb launched RAISE US, a bipartisan nonprofit with more than $500 million in initial funding. Anchor partners include Amazon, Microsoft, Anthropic, the OpenAI Foundation, Bank of America, UPS, General Motors, Eli Lilly, Mastercard, AMD, Cisco, and IBM. Initial state-level pilots are running in Arkansas, Connecticut, Maryland, and Utah, focused on education, training, and workforce transition programs.


Several of the organizations named in that partner list are among those most directly accelerating AI-driven automation across industries. Their participation in a reskilling fund doesn't resolve that dynamic, but it does indicate that the workforce transition problem is now being treated as a shared corporate obligation, not something to leave entirely to government or individuals.


For HR and workforce leaders, RAISE US is an early signal that state-level transition infrastructure is being built at scale. Whether it reaches the workers who need it, and on what timeline, remains the open question. The funding is substantial. The program design and delivery still need to prove themselves in practice.


Two Vendors Building Human Approval Into the Default Architecture

A secondary pattern appears in vendor announcements this week. Several products are explicitly designed to keep humans in the decision chain rather than treating oversight as optional.


Retell AI launched Conductor, described as the first graph-native review system for production voice agents. It shows every proposed change inside the agent's workflow and requires human approval before any change executes. The tool also helps enterprises identify failures, run simulation tests, and improve voice agents at scale for contact centers.


uRecruits launched version 2.0 with seven connected hiring capabilities and a new AI layer called uR Agent. Per CEO Thomas Alexander: "AI assists, humans decide." The platform does not automatically advance, reject, or hire candidates.


Both products are positioning human oversight as a core feature, not a constraint on capability. That positioning reflects where enterprise buyers are right now: willing to deploy AI agents in consequential workflows, but not willing to remove the human checkpoint. Whether these tools deliver on that promise in production depends on implementation quality, as any design that lets humans approve things quickly at high volume can become a rubber stamp rather than a genuine review. For now, they are market signals of where buyer expectations are landing.


Act on These This Week

  • Audit your AI token spend by workflow type before your next budget review. If you're running agentic processes at volume, the cost driver is likely how much data you're passing to the model, not the model itself. Check data quality and deduplication upstream before scaling.


  • If you're evaluating outcome-based AI contracts, draft the resolution definition before you discuss price. The Salesforce pay-per-resolution model shows this pricing structure is becoming available. The risk is ambiguous outcome definitions. Get legal and operations aligned on what constitutes a resolved interaction before signing.


  • Map which engineering workflows in your organization have clear human review checkpoints. The Hippo deployment shows that production-grade AI coding agents are now in use in regulated environments. The limiting factor isn't the AI, it's whether your review and approval process can absorb AI-generated output safely.


  • If you work in HR, workforce planning, or L&D, track the RAISE US state pilot outcomes over the next 12 months. The funding and partner network are real. The delivery model will determine whether this reaches workers who need it or becomes a well-resourced credential program that misses the most exposed roles.


  • Which roles in your organization face the most exposure to AI-driven workflow changes in the next 18 months, and do those employees currently have access to any transition pathway?


If you want to stay current on how AI is reshaping enterprise software development, workforce economics, and the operational decisions underneath it all, Agenticism covers those stories every day. For the curated weekly, monthly, and quarterly digest delivered to your inbox, subscribe at Agenticism on Substack.


Sources

Recent Posts

See All
bottom of page