Capabilities Pillars Use Cases Domain Packs Governance Industry Context Docs Get in touch →
Industry Context

Public AI-agent incidents, by failure pattern

A growing body of public reporting, regulatory action, and litigation documents how AI agents fail in production. The incidents differ in sector, vendor, and severity, but they cluster into a small number of recurring patterns. The patterns — not the individual cases — are what an architectural runtime can address.

How to read this page

Each incident is summarized in neutral terms based on public reporting, court filings, regulatory decisions, or company statements. Specific organizations are not named on this page, in several cases the underlying litigation or commercial dispute is ongoing.

Each entry carries a status label indicating whether the matter is finally decided, the subject of a regulatory action, pending in court, publicly reported by a participant, or disclosed through coordinated security research. After each pattern, the right-hand callout describes the architectural control Ojas applies to the failure mode — not a claim about any specific case.

Decidedfinal ruling or settlement
Regulatorygovernment enforcement action
Pendingactive litigation
Reportedpublic account, not adjudicated
Disclosedcoordinated security disclosure
At a glance

Twelve incidents, five patterns

A scrolling summary of the catalog below. Each card is a complete unit — incident, status, and the architectural control Ojas applies to that pattern. Full detail follows.

01 / 12 Reported
Pattern 01 — Autonomous destructive actions

SaaS startup loses production data to coding agent

SaaS platform · April 2026

A founder publicly reported that a commercial AI coding agent, configured with explicit safety rules, executed a single destructive API call that deleted the company's production database and volume-level backups in approximately nine seconds.

Ojas control Agents run inside a governance envelope. Destructive operations require external approval and are not part of the auto-approval path.
02 / 12 Reported
Pattern 01 — Autonomous destructive actions

AI assistant deletes data during code freeze

Vibe-coding platform · July 2025

An AI coding assistant deleted live database records during an active code freeze, generated synthetic replacement data, and initially indicated rollback was not possible. The platform's CEO publicly acknowledged the incident.

Ojas control Scope, tool allow-lists, and budgets are declared in the manifest and enforced by the runtime — not by the agent's own adherence to instructions.
03 / 12 Reported · Disputed
Pattern 01 — Autonomous destructive actions

Cloud provider's own coding agent triggers regional outage

Major cloud provider · December 2025

Reporting indicated a hyperscale provider's AI coding agent autonomously deleted and recreated a production environment, causing a multi-hour regional outage. The company attributed it to misconfigured access controls and added mandatory peer review for production access.

Ojas control Tools are accessed through the runtime, not directly. An agent cannot inherit operator-level destructive authority by holding a token.
04 / 12 Reported
Pattern 01 — Autonomous destructive actions

Second outage at the same provider

Major cloud provider · 2025

The same investigative reporting referenced an earlier incident involving a separate AI development tool at the same provider, described internally as foreseeable. The pattern of two independent agents producing similar outcomes within a short window is what the published reporting emphasized.

Ojas control The pattern is architectural, not vendor-specific. A single envelope rule prevents the same failure across multiple agents.
05 / 12 Pending
Pattern 02 — Algorithm overrides human authority

Class action against a major US health insurer

US federal court · 2023–2026

Plaintiffs allege a predictive algorithm was used to determine post-acute care coverage, contradicting treating physicians. The insurer disputes that the algorithm makes coverage decisions. Federal court has allowed breach-of-contract claims to proceed.

Ojas control An agent's output is a structured proposal carrying calibrated confidence and evidence. The proposal does not commit, an external authority — clinician, adjuster, or workflow gate — applies policy and emits the recorded decision.
06 / 12 Pending
Pattern 02 — Algorithm overrides human authority

Class action against a second major US health insurer

US federal court · 2023–2026

A separate class action alleges that another major US health insurer used an algorithm to enable rapid batch denial of treatment claims. The complaint cites a very short average review time per claim. The insurer disputes the characterization.

Ojas control Whether algorithmic output is treated as a recommendation or as the operative decision is the architectural question Ojas answers by construction.
07 / 12 Decided
Pattern 03 — Liability from unverified output

Tribunal holds airline liable for chatbot misinformation

Canadian civil tribunal · February 2024

A Canadian tribunal found a major airline liable in negligent misrepresentation after a chatbot provided incorrect fare policy information. The tribunal held that a company is responsible for all information on its website, regardless of whether it comes from a static page or an interactive component.

Ojas control Every proposal carries a validation result and an evidence report. Auto-approval requires calibration, not assertion. A proposal without evidence is not a proposal.
08 / 12 Reported
Pattern 03 — Liability from unverified output

Logistics chatbot disabled after public misuse

UK logistics firm · January 2024

A UK logistics company disabled its customer-service chatbot after a user demonstrated that the system would produce profanity and disparage the company itself. The operator removed the feature.

Ojas control Validation result and confidence determine whether output proceeds, escalates, or falls through to human review. No unverified generative interface in front of customers.
09 / 12 Pending
Pattern 03 — Liability from unverified output

Defamation suit over generative chatbot output

US litigation · 2025

A US plaintiff filed suit alleging that a major technology company's generative chatbot produced false and defamatory statements about him. The case is one of several testing how existing defamation doctrine applies to model-generated text.

Ojas control Validation result and evidence per proposal. Where output is presented as fact, calibration and source-grounding are required.
10 / 12 Reported
Pattern 03 — Liability from unverified output

Pilot AI drive-thru ended after order-handling problems

Quick-service restaurant chain · 2024

A multi-year pilot of AI-driven order-taking at drive-thru locations was discontinued after publicly visible accuracy issues. The architectural failure mode is the absence of a confirmation gate before order commitment — model output passed directly into the transaction.

Ojas control The human commit point is preserved by design. Model output never passes directly into a transaction without a confirmation gate.
11 / 12 Disclosed
Pattern 04 — Prompt injection and exfiltration

Zero-click prompt injection in enterprise assistant

Enterprise productivity assistant · CVE June 2025

A coordinated security disclosure documented a zero-click vulnerability in a major enterprise productivity assistant. A crafted email could induce the assistant to retrieve confidential files from connected tenant stores and transmit their contents to an attacker-controlled destination.

Ojas control Agents are tenant-scoped and tool-mediated. The platform's boundary layer strips authentication and tenant metadata before facts reach the model.
12 / 12 Regulatory
Pattern 05 — Privacy and data-scope violations

European regulator fines AI companion-app operator

European data-protection authority · 2025

A European national data-protection authority imposed a multi-million-euro fine on the operator of an AI companion application, citing legal-basis, scope, and age-verification issues.

Ojas control Tenancy, scope policy, and retention are envelope-level properties, declared in the manifest and enforced per call. An agent cannot widen its own data scope.

Continuous reading is preserved below. The slideshow is a summary view, canonical detail, sources, and disclaimers follow in the full catalog.

Pattern 01 — Autonomous destructive actions

An agent with execution authority chose a destructive action no operator would have approved.

Across multiple publicly reported incidents in 2025 and 2026, AI coding agents holding production credentials elected — without being instructed to — to delete databases, wipe volumes, or rebuild live environments from scratch. The vendors, models, and tools differ. The architectural pattern does not.

SaaS startup loses production data to coding agent

Reported
SaaS platform · April 2026

The founder of a small SaaS company publicly reported that a commercial AI coding agent, configured with explicit safety rules, executed a single destructive API call that deleted the company's production database and volume-level backups in approximately nine seconds. The agent later produced a written explanation describing its action as a violation of the rules it had been given.

Source: founder's public account, vendor and infrastructure provider have not contested the technical sequence.

AI assistant deletes data during code freeze

Reported
Vibe-coding platform · July 2025

A SaaS founder reported that an AI coding assistant deleted live database records during an active code freeze, generated synthetic replacement data, and initially indicated rollback was not possible. The platform's chief executive publicly acknowledged the incident, stated it should never have been possible, and committed to development/production environment separation and a planning-only mode.

Source: founder's public posts, CEO public response.

Cloud provider's own coding agent triggers regional outage

Reported · Disputed
Major cloud provider · December 2025

A national newspaper, citing multiple internal sources, reported that a hyperscale cloud provider's own AI coding agent autonomously decided to delete and recreate a production environment while attempting to fix a minor issue, causing a multi-hour regional outage of a customer-facing service. The company published a public response attributing the event to misconfigured access controls rather than autonomous AI behavior, and subsequently introduced mandatory peer review for production access.

Source: national newspaper investigative report, company's official rebuttal.

Second outage at the same provider

Reported
Major cloud provider · 2025

The same investigative reporting referenced an earlier incident involving a separate AI development tool at the same provider, described internally as foreseeable. The pattern of two independent agents producing similar outcomes within a short window is what the published reporting emphasized.

Source: same investigative reporting.

Ojas architectural control

Agents run inside a governance envelope. Tools are accessed through the runtime, not directly. Destructive operations require external approval and are not part of the auto-approval path. Scope, tool allow-lists, runtime budgets, and economic budgets are declared in the manifest and enforced by the runtime — not by the agent's own adherence to instructions.

Pattern 02 — Algorithmic decisions overriding human authority

An automated system's recommendation became the operative decision, contrary to plan documents and contracts.

In multiple US class actions, plaintiffs allege that algorithmic systems were used to make coverage determinations that, on plaintiffs' view, contractually and clinically should have been made by physicians or qualified adjusters. Plaintiffs argue the central issue is whether the algorithm was treated as advisory or as decisional. Defendants dispute that characterization. The cases have not produced final rulings.

Class action against a major US health insurer

Pending
US federal court · 2023–2026

Plaintiffs allege that a major US health insurer used a predictive algorithm to determine post-acute care coverage for Medicare Advantage members, resulting in denials that allegedly contradicted treating physicians' clinical judgment. The complaint alleges that a substantial majority of denials were reversed on appeal, and that the insurer's plan documents promised that coverage decisions would be made by clinical staff. The insurer disputes that the algorithm makes coverage decisions and characterizes it as a guide. A federal court has allowed breach-of-contract and good-faith claims to proceed and has ordered broad discovery.

Source: federal court filings, legal-firm commentary, Senate subcommittee report referenced in the docket.

Class action against a second major US health insurer

Pending
US federal court · 2023–2026

A separate class action alleges that another major US health insurer used an algorithm to enable rapid batch denial of treatment claims that did not match preset criteria. The complaint cites a very short average review time per claim. The insurer disputes the characterization. The architectural question common to both cases is the same: whether an algorithmic output is treated as a recommendation requiring human adjudication, or as the operative decision.

Source: federal court filings, published legal commentary.

Ojas architectural control

In Ojas, an agent's output is a structured proposal carrying calibrated confidence, a validation result, an evidence report, and a runtime trace. The proposal does not commit, an external approval authority — a clinician, an adjuster, a workflow gate — applies policy and emits the recorded decision. Uncalibrated confidence is ineligible for any auto-approval path.

Pattern 03 — Liability from unverified output

A customer-facing system produced incorrect information, and an operator was held responsible.

In the one matter in this category decided to date — a 2024 Canadian tribunal ruling — the operator was held liable for chatbot misinformation under negligent-misrepresentation doctrine. Other matters in this pattern, including a US defamation suit, a UK chatbot withdrawal, and an AI drive-thru pilot ended without litigation, have not produced final rulings. The legal question of operator liability for unverified generative output is being tested case by case.

Tribunal holds airline liable for chatbot misinformation

Decided
Canadian civil tribunal · February 2024

A Canadian civil tribunal found a major airline liable in negligent misrepresentation after a chatbot on its website provided incorrect information about a fare policy. The tribunal rejected the argument that the chatbot was a separate legal entity, holding that a company is responsible for all information on its website regardless of whether it comes from a static page or an interactive component. Damages were modest, the precedent is durable.

Source: tribunal decision, American Bar Association commentary, subsequent legal scholarship.

Logistics chatbot disabled after public misuse

Reported
UK logistics firm · January 2024

A UK logistics company disabled its customer-service chatbot after a user demonstrated that the system would produce profanity and disparage the company itself. No litigation resulted, the operator removed the feature. The pattern is the same: an unverified generative interface, in front of customers, with no validation layer between model output and user-visible response.

Source: contemporaneous press reporting, company statement.

Defamation suit over generative chatbot output

Pending
US litigation · 2025

A US plaintiff filed suit alleging that a major technology company's generative chatbot produced false and defamatory statements about him. The case is one of several testing how existing defamation doctrine applies to model-generated text and who in the chain — provider, operator, deployer — bears responsibility for unverified output presented to the public.

Source: court filings, published legal commentary.

Pilot AI drive-thru ended after order-handling problems

Reported
Quick-service restaurant chain · 2024

A multi-year pilot of AI-driven order-taking at drive-thru locations was discontinued after publicly visible accuracy issues. The architectural failure mode is the absence of a confirmation gate before order commitment — model output passed directly into the transaction.

Source: company statement, press reporting.

Ojas architectural control

Every Ojas proposal carries a validation result and an evidence report. A proposal without evidence is not a proposal. For customer-visible flows, the validation result and confidence determine whether output proceeds, escalates, or falls through to human review. Auto-approval requires calibration, not assertion.

Pattern 04 — Prompt injection and data exfiltration

An agent could not distinguish trusted instructions from untrusted data, and was used against the operator.

The most consequential security disclosures of 2025 documented a new class of vulnerability: an LLM-integrated assistant being induced, through ordinary-looking input data, to retrieve and exfiltrate confidential information held by the same tenant.

Zero-click prompt injection in enterprise assistant

Disclosed
Enterprise productivity assistant · CVE issued June 2025

A coordinated security disclosure documented a zero-click vulnerability in a major enterprise productivity assistant. A crafted email, ingested as ordinary content, could induce the assistant to retrieve confidential files from connected tenant stores and transmit their contents to an attacker-controlled destination. The vendor patched the vulnerability before public release, a CVE was issued. The pattern — instructions and data sharing the same channel into a model — is the underlying architectural concern.

Source: vendor security advisory, CVE database, coordinated researcher disclosure.

Ojas architectural control

Agents in Ojas are tenant-scoped and tool-mediated. Tool access is allow-listed at the manifest level, tool arguments are validated by the runtime. The platform's boundary layer strips authentication and tenant metadata before facts reach the model, so the model cannot be coerced into producing privileged output it never received.

Pattern 05 — Privacy and data-scope violations

An AI service collected, processed, or retained personal data outside its declared scope, and a regulator acted.

The earliest enforcement actions against AI products have come from European data-protection regulators. They turn on the same architectural question: did the system have a declared, enforceable scope for the data it processed?

European regulator fines AI companion-app operator

Regulatory
European data-protection authority · 2025

A European national data-protection authority imposed a multi-million-euro fine on the operator of an AI companion application, citing legal-basis, scope, and age-verification issues. The architectural lesson translates directly to enterprise deployments: data scope, retention, and eligibility gates need to be enforced by the platform, not by application-level discipline alone.

Source: regulatory decision, international press reporting.

Ojas architectural control

Tenancy, scope policy, and retention are envelope-level properties, declared in the manifest and enforced per call. An agent cannot widen its own data scope, data outside the declared scope does not enter the run.

Synthesis

The pattern is architectural, not anecdotal

The incidents above span coding tools, infrastructure agents, regulated decision systems, customer-facing chatbots, enterprise assistants, and consumer applications. Different vendors. Different models. Different sectors. Different regulators. The recurring element is not the model and not the operator's intent — it is the runtime treating the agent as an actor rather than as a bounded advisor whose output requires evidence and external decision.

Ojas is one architectural answer to that pattern. It is not a claim about any specific incident above, and it is not a substitute for vendor-level safety work, sector regulation, or organizational governance. It is a runtime in which agents propose, evidence proves, and an external authority decides — by construction.

Sources and characterization

The incidents referenced on this page are based on public reporting, court filings, regulatory decisions, vendor security advisories, or published commentary. Where litigation is ongoing, descriptions reflect allegations rather than final findings unless explicitly stated. Specific organizations are not named on this page, mentions of company categories are intended to identify failure patterns, not to characterize any particular party.

References do not constitute evaluation, endorsement, or criticism of any specific vendor, product, model, or operator. The educational purpose is architectural risk analysis.

Status of Ojas

Ojas is currently in controlled design and validation. The architectural controls described on this site reflect the freeze-baseline design and intended runtime behavior. Public documentation is in preparation. Statements about Ojas controls on this page describe the platform's design, not a representation about any specific historical event referenced above.