Behavioural verification

Find the bug your tests never scripted — and prove it against your own data.

A population of AI personas explores your app like real users, reaching states your scripts never visit. When one breaks a rule, Abantu confirms it against your app's own server-side state — a pass/fail verdict you can block a release on, not a model's opinion.

abantu · combined report
VERDICTWEAKNESS FOUND

An order shipped while its payment was still pending.

The required PAID step never happened — confirmed against Sylius's own order history, not the persona's say-so.

NEWorder placed
AWAITING PAYMENTnot settled
PAIDDev · order-skipperstep skipped
SHIPPEDDev · order-skippershipped anyway
order #19 — payment: AWAITING_PAYMENT · shipment: SHIPPED (shipped before paid)
Why Abantu

Real users don't follow your happy path. Neither does Abantu.

Scripted tests check the steps you anticipated. Real people do the unexpected — and that's where business logic cracks. Abantu behaves like real users and pushes on the rules, surfacing the weaknesses and flakiness scripts never reach.

01 / behave

Behaves like real users

Personas pursue real goals across roles — with the messiness, impatience and improvisation of actual people.

02 / probe

Stresses your business logic

Out-of-order actions, invalid state transitions, permission gaps and race conditions — probed, not assumed.

03 / expose

Exposes flakiness & weak spots

Intermittent failures and states that should never be reachable, caught before your users hit them.

The verification gap

AI writes the code. Something has to prove it still works.

Agents now generate features in loops — and their own tests along with them — faster than any team can read the diffs. The question moves from “did it compile?” to “does the behaviour still hold for real users?” That gap is where Abantu lives.

∞ / volume

Code is cheap now

Agents ship features and their own passing tests on repeat. Volume stopped being the constraint.

? / trust

Trust is the bottleneck

Knowing the behaviour is right — across roles, edge cases and state — is the hard part. Reading every diff doesn't scale.

✓ / verify

Verify behaviour, not code

Abantu checks what users actually experience and whether your business rules hold — the layer agent-written tests skip.

Live demo

Watch an order ship before it's paid.

In a self-hosted Sylius store, Abantu sends four real users through the shop at once — they browse, fulfil, and probe whether the rules actually hold: can an order reach a state it shouldn't? One ships goods whose payment never cleared. The rail below is read from Sylius's own order ledger; the dashed gap is the required PAID step that never happened.

NEWorder placed
AWAITING PAYMENTnot settled
PAIDstep skipped
SHIPPEDshipped anyway
verified transition severed — shipped before payment cleared
How it works

Three steps. Minutes, not test scripts.

01 / describe

Describe your users

Write a persona — goal, role, tech-savviness, patience. No brittle test code to maintain.

02 / drive

Abantu drives your app

Each persona observes the live page, decides like that user, and acts — until it reaches the goal or realistically gives up.

03 / verdict

You get the truth

A hard pass/fail on the workflow, the business-logic weaknesses and flaky behaviour it found, and the friction along the way.

Pricing

Priced on the logic you put behind the gate.

Not per-seat, not per-run, not per-app. One app, gated — you climb as you verify more of its business logic: more critical flows, deeper verifier types, regression baselines. The bill grows because your coverage did.

Free / OSS

$0
forever · one app
  • Leads only — the model's findings, no gate
  • Personas + combined HTML report
  • Capped runs
  • Community support
Start free

Team

$300–600
per month · one app
  • The verified gate — server-confirmed, CI-safe
  • REST & GraphQL verifiers
  • CI gate (GitHub & GitLab), run history
  • Saved-session auth (SSO/MFA), regression baselines
Start here
Most popular

Business

$1.5–3k
per month · one app, deeper
  • Advanced verifiers · temporal, cross-record, SQL upcoming
  • More gated flows & invariants
  • Multiple environments (staging → prod)
  • SSO & priority support
Book a demo

Enterprise

$30k+
per year · multi-app
  • Unlimited apps & flows
  • On-prem / VPC runner
  • SLA & security review
  • Dedicated support
Talk to us

Priced as CI infrastructure — a gate on every PR, not a per-seat IDE tool. Anchored against the cost of one prevented business-logic incident, not hours saved.

Responsible by design

Powerful AI, kept accountable.

Tech that earns trust. The AI assists; people stay in control.

authorized

Authorized use only

Abantu runs only against systems you own or are verified to control. It identifies its own traffic and ships no anti-detection tooling — a guardrail, not a weapon.

human-in-loop

Human-approved

Agents surface findings and drive flows — your team reviews and signs off. No silent autonomy over what ships.

eu / gdpr

EU-based & GDPR-aware

Built and run from the EU, with data handling designed around GDPR from the start.

transparent

Transparent runs

Every run produces a readable trace — what each persona did, decided, and where it got stuck. No black box.

Get in touch

See where your users get stuck.

Book a 20-minute demo, or send a note. No pitch — we'll see if it fits.