Fraud Detection 2026

Fraud detection has been an ML success story for years. The 2026 wave shifts the landscape: transformer-based sequence models compete with classical gradient-boosting baselines, and graph neural networks address the parts neither handles well. The architectures that ship now combine all three.

What’s actually working in production fraud detection.

The classical baseline: tabular ML on engineered features#

XGBoost / LightGBM / CatBoost on carefully engineered features (transaction amount, velocity, merchant category, device fingerprint, geographic patterns) remains the production baseline for most fraud workflows.

Strengths: explainable, fast, well-understood operationally, predictable cost.

Limitations: misses temporal patterns longer than the feature window, misses relationships between entities, misses subtle behavioral signals.

This is still the right starting point. Don’t skip it.

Transformer models on event sequences#

Sequence models (LSTM, Transformer, sometimes the same architectures used for language) trained on user/account/device event sequences detect fraud patterns that tabular features miss:

Bot-driven activity with characteristic timing patterns
Account takeover sequences across multiple events
Long-term behavioral drift before fraudulent action

Strengths: capture temporal dependencies; learn features automatically.

Limitations: harder to explain individual decisions, higher compute cost, more sensitive to training-data quality.

Graph neural networks#

Fraud involves networks of entities — accounts, devices, addresses, merchants, IPs. Graph neural networks (GNNs) learn from the relationships, not just the entity attributes.

Best for:

Synthetic identity fraud (clusters of fake accounts)
First-party fraud rings
Money mule networks
Merchant collusion

Strengths: capture relational signals invisible to tabular or sequence approaches.

Limitations: graph construction is non-trivial, scaling to billions of edges is engineering work, explainability is hard.

The hybrid architecture#

The 2026 production pattern we see at credible institutions:

Fast path: tabular ML for sub-100ms decisions on every transaction
Behavioral path: transformer model on the account’s recent sequence, scoring slightly higher latency
Network path: GNN run on a scheduled basis (hours to daily) producing entity-level risk scores
Ensemble layer: combines all three plus rules
Human review for the ambiguous middle band

Each model has a specific job. Ensembling is where the production gains come.

The compliance layer#

Fraud detection sits alongside AML and broader financial-crime compliance. The architecture must handle:

Customer notification requirements (timing varies by regulation)
Adverse-action analogs where a decline is also a compliance event
Suspicious activity reporting (SAR / STR) workflows
Recordkeeping requirements (often 5+ years)

Our data engineering practice builds fraud architectures that satisfy these.

Where AI doesn’t (yet) earn its place#

Eliminating human review. Even the best models have a ~5% review band where human judgment is required.

Replacing rules entirely. Hard-rule patterns (sanctions lists, known-bad devices) execute before ML; you don’t want a model deciding to score a sanctioned entity.

Continuous learning without oversight. Models that update online without governance produce drift you don’t want.

What we ship for banks and fintechs#

For fraud engagements:

Hybrid architecture matched to the institution’s risk profile
Tabular baseline first
Transformer and GNN augmentation as the program matures
Explainability layer for human reviewers
Compliance integration with SAR workflows
Performance monitoring and drift detection

The cost reality#

Fraud detection is one of the few AI use cases where the model’s quality directly maps to dollars saved or lost. Marginal improvement in detection at constant false-positive rate maps to seven or eight figures of avoided losses for mid-large institutions.

Investment in the architecture pays back; under-investment quietly bleeds.

Production fraud detection in 2026 is hybrid — tabular + sequence + graph + rules + humans. Our team builds fraud architectures that ship and hold up. Tell us about the program.

The classical baseline: tabular ML on engineered features#

Transformer models on event sequences#

Graph neural networks#

The hybrid architecture#

The compliance layer#

Where AI doesn’t (yet) earn its place#

What we ship for banks and fintechs#

The cost reality#

Related posts.

Banking AI Roadmap: What to Build First in 2026

Climate Risk Analytics for Finance and Insurance

Insurance Underwriting AI: From Rules to Learned Models