In early 2024, a finance employee at the engineering firm Arup joined what looked like a routine video call. The CFO was there. So were several colleagues he recognised. Over the call, he was instructed to make a series of transfers. He made fifteen of them, totalling roughly $25 million. Every person on that call except him was a deepfake. The faces, the voices, the mannerisms — all AI-generated.
That case was a warning. In 2026, it has become a pattern. Deepfake fraud has moved from a novelty that security researchers demonstrated at conferences to one of the fastest-growing categories of enterprise crime. Deloitte's Center for Financial Services projects that generative-AI-enabled fraud losses in the US alone will reach $40 billion by 2027, up from $12.3 billion in 2023, a compound annual growth rate of 32 percent. Finance teams, executives, and HR departments are the primary targets.
What makes this threat different is that it defeats the instinct we have all relied on our entire lives: believing what we see and hear. A wire-transfer request from a stranger over email triggers suspicion. The same request from your CFO's face and voice on a video call does not. That gap between human trust and synthetic reality is exactly what attackers are exploiting.
This article explains how deepfake fraud actually works in 2026, why traditional controls fail against it, and a practical, layered playbook every enterprise should put in place now.
How Deepfake Fraud Actually Works in 2026
The technology has crossed three thresholds that together made this threat industrial in scale.
Voice cloning is near-instant and nearly free. In 2026, a usable voice clone can be generated from as little as three seconds of audio. Attackers harvest that audio from earnings calls, conference talks, podcasts, webinars, YouTube videos, and even voicemail greetings. The result is a real-time voice clone that can hold a live conversation, complete with the target's accent, cadence, and filler words.
Real-time video deepfakes are now possible on consumer hardware. What required a render farm two years ago now runs live on a high-end laptop. Face-swap and full-synthesis tools can put a convincing version of a known executive into a live video call, responding in real time. The quality is no longer perfect under close scrutiny, but it does not need to be. It needs to be good enough for a 90-second call where the victim is not looking for fakery.
Fraud has been automated and productised. Criminal groups now sell deepfake-fraud-as-a-service on dark web marketplaces. For a few hundred dollars, a low-skill attacker can buy voice cloning, video synthesis, spoofed caller ID, and social-engineering scripts as a bundle. This productisation is why the volume has exploded. You no longer need to be a sophisticated actor to run a deepfake scam.
The typical attack chain in 2026 looks like this: reconnaissance on the target organisation and its executives using public data and LinkedIn; harvesting of voice and face samples; creation of the synthetic media; a pretext (an urgent acquisition, a confidential deal, a regulatory deadline) that justifies secrecy and speed; and then the live contact — a call, a video meeting, a voice message — that pressures an employee into an irreversible action before they can verify.
The Five Most Common Enterprise Attack Patterns
- CEO and CFO impersonation (executive fraud). The classic and most lucrative. A finance or treasury employee receives a call or video meeting from a synthetic executive instructing an urgent transfer, often framed around a confidential acquisition or a regulatory emergency. The Arup case is the template.
- Vendor and supplier impersonation. Attackers clone the voice of a known supplier contact to authorise a change in bank details for an upcoming payment. Business email compromise meets deepfake voice confirmation, defeating the callback control many companies rely on.
- Recruitment and onboarding fraud. Deepfake candidates pass video interviews for remote roles, gaining insider access to systems. This has become a serious problem for technology companies hiring globally, with state-linked actors using synthetic identities to infiltrate organisations.
- Help-desk and credential-reset attacks. Attackers use a cloned voice of an employee to call IT support and request a password reset or MFA bypass, exploiting the human at the help desk rather than the technology.
- Investor and market manipulation. Deepfake videos of executives making false announcements to move markets, damage reputation, or manipulate stock prices. Several incidents in 2025 and 2026 targeted listed companies with fake executive statements distributed on social media.
Why Traditional Controls Fail
The reason deepfake fraud is so effective is that the controls most enterprises rely on were designed for a world where seeing and hearing were proof of identity. That assumption is now false.
- Caller ID and video presence are not identity. Caller ID is trivially spoofed. A familiar face on a video call is no longer evidence of who you are talking to. Yet most approval cultures still treat both as implicit verification.
- Email security does not cover voice and video channels. Enterprises have invested heavily in email-based phishing defence. Deepfake fraud simply moves the attack to voice and video, channels with far weaker controls.
- Callback verification is defeated by voice cloning. The standard control of calling back a known number fails when the attacker can clone the voice that answers, or when the attacker has already socially engineered a number change.
- Urgency and authority bypass process. Deepfake attacks deliberately invoke senior authority and time pressure, the two factors most likely to make employees skip verification steps. The technology amplifies a social-engineering technique that already worked.
- Detection technology is in an arms race. Automated deepfake detection tools exist and are improving, but generation technology improves just as fast. Detection alone is necessary but not sufficient. It cannot be the only line of defence.
The Layered Defence Playbook
There is no single product that solves deepfake fraud. Effective defence in 2026 is layered across process, people, and technology. The enterprises defending themselves well have built all three layers, not just bought a detection tool.
Layer 1: Process and Controls (the most important layer)
The strongest defences against deepfake fraud are procedural, because they do not depend on detecting the fake at all. They make the fake useless.
- Out-of-band verification for all high-value actions. Any transfer, bank-detail change, or sensitive action above a threshold must be verified through a separate, pre-established channel that the requester did not choose. Not a callback to a number given on the call. A verification through a known system, a pre-agreed app, or an independently sourced contact.
- Multi-person authorisation. No single individual should be able to execute a high-value transfer based on a single instruction, regardless of who appears to be giving it. Dual or triple control removes the single point of human failure that deepfakes exploit.
- Pre-agreed verification protocols and code words. Executives and finance teams agree in advance on out-of-band verification steps or challenge phrases for unusual requests. Simple, low-tech, and highly effective.
- Mandatory cool-down on urgent requests. A policy that explicitly removes time pressure: any urgent, secret, high-value request automatically triggers a slow-down and verification rather than fast compliance. Attackers rely on speed. Build friction precisely where they want none.
- Hardened vendor bank-detail change process. Changes to supplier payment details require independent verification through a channel established at onboarding, never through the channel requesting the change.
Layer 2: People and Awareness
- Train for the new reality. Employees, especially in finance, treasury, executive assistance, HR, and IT support, need to understand that voice and video are no longer proof of identity. This is a mindset shift as much as a training topic.
- Run deepfake simulations. Just as phishing simulations transformed email security awareness, controlled deepfake voice and video simulations build the instinct to verify. Several enterprises now run these quarterly.
- Make verification socially acceptable. The biggest cultural barrier is that employees feel awkward asking to verify a senior executive. Leadership must explicitly authorise and praise verification, so that an employee who slows down a CEO's request is rewarded, not punished.
- Reduce the executive attack surface. Be deliberate about how much executive voice and video is publicly available, and brief executives on how their public media is used as raw material for clones.
Layer 3: Technology
- Deepfake detection at key channels. Deploy real-time deepfake detection on video conferencing and call-centre channels where high-value interactions happen. Treat it as one signal among many, not a verdict.
- Liveness and identity verification. For onboarding, recruitment, and high-risk access, use robust liveness detection and identity verification designed to resist synthetic media, not just basic face matching.
- Voice biometrics with anti-spoofing. Where voice is used for authentication, ensure it includes anti-spoofing and synthetic-voice detection, and never rely on voice alone for high-value actions.
- Content provenance and watermarking. Adopt and require content credentials (C2PA) for official executive communications, so that genuine corporate media can be cryptographically verified and synthetic impersonations stand out by their absence of provenance.
- Monitoring for executive impersonation. Brand and executive-impersonation monitoring across social and web channels to catch deepfake videos and fake announcements quickly, before they spread.
A 60-Day Action Plan
- Days 1 to 15: Assess exposure. Map your highest-risk processes (wire transfers, vendor payment changes, credential resets, remote hiring) and your highest-risk people (finance, treasury, executive assistants, IT help desk, HR). Identify where a deepfake instruction could currently trigger an irreversible action.
- Days 15 to 30: Close the process gaps. Implement out-of-band verification, multi-person authorisation, and mandatory cool-downs on the highest-risk actions first. These procedural controls deliver the most protection for the least cost and do not depend on any detection technology.
- Days 30 to 45: Train and simulate. Roll out targeted awareness training for the highest-risk roles and run your first controlled deepfake simulation. Make verification culturally safe from the top down.
- Days 45 to 60: Add technology layers. Deploy detection, liveness, and provenance controls at the key channels, and stand up executive-impersonation monitoring. Integrate the signals into your existing fraud and security operations.
The sequence matters. Process first, people second, technology third. Enterprises that start by buying a detection tool and skip the process layer remain exposed, because no detector is perfect and the procedural controls are what actually make a successful deepfake useless.
Questions Every Board and Executive Team Should Be Asking
- Could a single employee, acting on a convincing video or voice instruction, move significant money or grant significant access in our organisation today?
- Do our high-value financial and access processes require out-of-band, multi-person verification that does not depend on caller ID or video presence?
- Have our finance, treasury, HR, and IT help-desk teams been trained specifically on deepfake fraud, and have we tested them?
- What is our incident response plan if an executive is deepfaked in a fraud attempt or a fake announcement?
- How much executive voice and video is publicly available, and have we briefed our leaders on how it is used against us?
- Are we using content provenance so that our genuine executive communications can be verified?
The Bigger Picture
Deepfake fraud is the first mass-scale example of AI being used as a weapon against enterprises rather than a tool for them. It will not be the last. The same generative capabilities driving productivity gains are available to attackers, and the gap between what is real and what is synthetic will keep narrowing.
The organisations that handle this well are not the ones with the most advanced detection technology. They are the ones that accepted early that seeing and hearing are no longer proof, redesigned their highest-risk processes around verification rather than trust, and built a culture where slowing down to verify is a sign of competence, not paranoia. In an era of synthetic media, verification is the new trust, and the enterprises that internalise that will be far harder to defraud.
How Ellvero Helps Enterprises Defend Against Deepfake Fraud
At Ellvero, we help enterprises understand and defend against AI-enabled fraud, combining deep expertise in computer vision and AI with practical security and process design. Our work in this area typically spans four pillars:
- Deepfake Risk Assessment. We map your exposure across processes, people, and channels, identify where a synthetic instruction could trigger irreversible harm, and prioritise the controls that close the highest-risk gaps first.
- Process and Control Redesign. We help you design and implement out-of-band verification, multi-person authorisation, vendor-change hardening, and cool-down policies that make a successful deepfake useless regardless of how convincing it is.
- Detection and Verification Technology. We advise on and integrate deepfake detection, liveness, voice anti-spoofing, and content provenance into your existing security and fraud operations, with the AI and computer vision depth to separate real capability from vendor hype.
- Awareness, Simulation, and Culture. We design targeted training and controlled deepfake simulations for your highest-risk teams, and help leadership build a culture where verification is expected and rewarded.
If you are concerned about deepfake fraud exposure, preparing for a board discussion on AI-enabled threats, or want an honest assessment of where your organisation actually stands, we would welcome the conversation.