Tech Reads
Operations7 min read

AP/AR Automation: The Reality Behind the 80% Reduction Claims

One of our clients spent eight months stuck at 67% automation after a vendor promised 80%. That plateau is not a bug — it is the natural ceiling for AP automation deployed without confronting the hard 20% of invoices that no demo ever shows you.

Where the 80% figure comes from

The 80% processing time reduction claim has a real source. It comes from studies on straight-through processing rates for standard invoices — clean PDFs with a single PO line, a vendor that exists in your supplier master, an amount that matches to the cent. For that document type, modern OCR plus rules-based matching does reduce manual touchpoints by 75–85%. The studies are not fabricated.

The problem is what “standard invoice” actually means in practice. Vendors define it as the documents their system handles well. Then they measure reduction rates on those documents. The rest — disputed amounts, multi-currency PO matching, handwritten amendment notes, partial delivery invoices — those do not appear in the case study. They are not excluded intentionally. They just never make it into the denominator.

So the claim is technically accurate and practically misleading. You will get 80% reduction on the invoices that were already closest to automated. The ones that cost you time and headcount? Those are the ones that do not move.

72–75%the realistic automation ceiling for enterprise AP without significant process change — not the 80–90% in vendor materials

What the hard 20% actually looks like

We got this wrong on the first two projects. We scoped the automation based on a sample of recent invoices the client provided — which, naturally, were the clean ones they could pull quickly. Then we went live and met the real document corpus.

The hard documents fall into a few recurring categories. First: disputed invoices where the supplier has billed for a quantity that does not match the goods receipt, and someone has annotated the paper copy or replied to an email thread with a revised figure. That revised figure is not in your ERP. It exists in an inbox or a file share. Automation cannot resolve it without access to that context.

Second: multi-currency orders. A PO raised in USD, invoiced in EUR, with a spot rate that does not match the hedge rate on the contract. Your three-way match fails not because the data is wrong but because the currency conversion logic is ambiguous. Rules-based systems do not handle ambiguity — they reject and escalate.

Third: partial deliveries. A supplier invoices for 60% of a 10-line PO. Some lines are fully received. Some are not received at all. One line was received but the quantity was wrong. Each of those sub-scenarios requires different handling, and the permutations compound quickly. At 500 invoices per day, even a 15% exception rate is 75 documents that need a human.

The three-way match problem at scale

Three-way match — comparing the purchase order, the goods receipt, and the supplier invoice — is the backbone of AP controls. At low volume it is manageable manually. At scale it is where automation either proves its value or exposes its limits.

The match logic itself is not complicated. What breaks at scale is the quality of the inputs. GR notes entered with wrong quantities. POs that were amended verbally without an ERP change order. Invoice line items that do not map cleanly to PO line items because the supplier uses different product codes. At 200 invoices per day these discrepancies are manageable. At 2,000 per day, a 3% discrepancy rate means 60 daily exceptions — and each one requires someone with context to resolve it.

The failure mode we see most often is not that the automation is bad at matching. It is that the data quality in the ERP was never good enough to support high-accuracy matching in the first place. Automation makes that visible fast. Before automation, a human could fill in gaps with memory and institutional knowledge. The system cannot.

Watch out

If your PO data quality is poor going into an AP automation project, expect a 4–6 week data remediation phase before your exception rate drops to a manageable level. Scope this explicitly or it will consume your ROI timeline.

Exception handling: the part no vendor demo shows

Every AP automation vendor demo runs the same three invoices. Clean PDF, single PO line, exact match. The demo takes four minutes and the automation rate is 100%. Then you go live.

The question that actually determines your ROI is not what happens to the clean invoices. It is what the system does with exceptions. Does it route to a queue? Who owns the queue? What information does the reviewer see? Can they resolve the exception in the tool or do they have to go back to the ERP? How long does an exception sit before it escalates? What happens to payment terms while an invoice is in exception?

Most implementations we have reviewed either route everything to a generic “exceptions” queue with minimal context, or they route to email, which means the exception is now untracked. Neither approach is acceptable at volume. A well-designed exception workflow requires the same engineering attention as the happy-path automation. It usually gets a fraction of the investment.

In SAP-based environments, the exception interface is almost always worse than the main AP workflow. In NetSuite, the customisation required to build a good exception queue adds 3–5 weeks to implementation timelines. Plan for it.

8 monthshow long one client was stuck at 67% automation before exception handling was properly redesigned — the plateau almost every enterprise AP project hits

A real case: the 67% plateau

A manufacturing client with around 1,800 invoices per month went live with AP automation in Q1 of last year. Initial results were strong — they moved from 15% straight-through processing (almost everything had a human touchpoint) to 67% in the first six weeks. The vendor declared success. The client was pleased.

Then they stayed at 67% for eight months. The remaining 33% of invoices — roughly 600 per month — required human intervention. The AP team had not shrunk. They had just changed what they spent their time on. Instead of data entry, they were resolving exceptions. The workload was different. The headcount was the same.

We came in at month nine. The exceptions clustered into four types: partial delivery invoices (31% of exceptions), multi-currency mismatches (24%), invoices from three specific high-volume suppliers who used non-standard layouts that the OCR model had never been trained on (28%), and disputed line items with supporting email threads (17%).

The fix required three things. A targeted fine-tuning pass on the three supplier layouts. A currency conversion rule that pulled the hedge rate from a treasury API rather than using the invoice rate. And a partial delivery matching rule that allowed configurable tolerance thresholds by supplier category. After implementation: 84% straight-through processing. The remaining 16% are genuine edge cases that require human judgment — and that is fine.

The 67% plateau is not a failure. It is where most AP automation lands without deliberate exception engineering. It just does not look like the 80% the vendor promised.

The 80% reduction claim selects its own denominator. Any vendor can hit 80% if they get to decide which invoices count. The number that matters is your straight-through processing rate across your full document volume — including the messy suppliers, the amended POs, and the disputed line items.

What realistic AP automation ROI actually looks like

Let's run real numbers. A company processing 2,000 invoices per month at an average manual cost of $12 per invoice (a conservative figure that includes AP staff time, approval routing, and ERP data entry) spends $24,000 per month on AP processing. That is $288,000 per year.

At 80% automation, assuming the vendor's figure holds, you are processing 1,600 invoices automatically at roughly $1.50 each and 400 manually at $12 each. Monthly cost: $2,400 + $4,800 = $7,200. Annual saving: ~$249,000. That is a compelling ROI and it is why the pitch works.

Now run it at 67% — the realistic plateau before exception engineering. You process 1,340 invoices automatically and 660 manually. Monthly cost: $2,010 + $7,920 = $9,930. Annual saving: ~$169,000. Still good, but 32% below the headline figure. And that assumes no exception handling headcount. If you needed to hire one additional exceptions analyst at $55,000 per year, your net saving is $114,000 — about 40% of what the vendor's ROI model predicted.

The ceiling for most enterprise AP automation deployments, without significant process re-engineering and supplier onboarding, sits at 72–75% straight-through processing. Not 80–90%. Scope your business case accordingly.

How to evaluate an AP vendor honestly

The vendor demo will be perfect. Here is how to stress-test it before you sign.

Pull your last 90 days of invoices. Not a curated sample — all of them. Sort by exception rate: which suppliers generate the most manual touchpoints today? Send those invoices to the vendor and ask for a processing demo on that subset specifically. Most vendors will push back. The ones who do not are the ones worth talking to.

Ask what the system does when a three-way match fails within a 2% tolerance vs a 15% tolerance. Ask whether exception routing is configurable by supplier, by invoice amount, or by exception type. Ask how long an invoice can sit in exception before it auto-escalates to a manager. Ask whether you can see the confidence score the OCR model assigned to each extracted field.

Our take

The vendors who have genuinely solved exception handling can answer these questions in detail without checking with the product team. The ones who cannot have a good UI on top of an exception queue they have not thought through.

Finally: ask for a reference from a client who has been live for at least 18 months and processes more than 1,500 invoices per month. Ask that reference what their actual straight-through processing rate is. The answer tells you more than any case study.

Share

Related reading