Compilation as defense: using tensor optimizations to confuse attackers

Updated on

December 10, 2025

This work shows how applying compiler driven tensor optimizations can cut side-channel model reconstruction success by up to forty-three percent without redesigning architectures.

William Hackett

TABLE OF CONTENTS

Key Takeaways

Side-channel accelerator attacks like DeepSniffer can reconstruct model architectures by watching low level kernel behavior.
We use TVM and AutoTVM to apply tensor optimizations to ONNX models such as ResNet18, DenseNet121, YoloV4, and RoBERTa.
After 500 optimization trials, attack fidelity drops by up to roughly forty-three percent with no manual architecture changes.
This approach trades GPU time for improved robustness and hints at “moving target” defense strategies for AI workloads.

When people talk about adversarial machine learning, they often focus on inputs and outputs. But there is another class of attacks that never touches the model’s visible interface. Instead, they watch the hardware.

Side-channel accelerator attacks do exactly that. By monitoring kernel level metrics such as cache reads and writes during inference, tools like DeepSniffer can infer which sequence of operators a model is running and reconstruct a close approximation of the underlying architecture.

For organizations that treat their architectures as trade secrets, or that share accelerators in multi tenant environments, this is a serious concern. The question is how to defend without rewriting every model from scratch.

The “Compilation as a Defense” work explores a promising option. Instead of changing model architectures, we use tensor optimization within the TVM compiler stack to change how those architectures are implemented at the kernel level. By doing so, they reduce the effectiveness of side-channel attacks while also improving runtime performance.

‍

How side-channel model extraction works

In the DeepSniffer style of attack, an adversary monitors hardware level traces as a model runs on a GPU. Every operator, such as convolution or pooling, generates a characteristic pattern of memory accesses and cache usage. By learning to associate those patterns with specific operators, the attacker can reconstruct the sequence of layers that make up the model.

The attack does not rely on access to training data or weights. It only needs visibility into low level kernel metrics, which can sometimes be obtained in shared environments or via performance tooling that was not designed with security in mind.

Defenses in the literature often propose larger architectural changes or framework modifications. While useful, these approaches can be costly for engineering teams that have already invested heavily in particular models and pipelines.

‍

Using compilation to blur the picture

Instead of changing the model itself, this research takes aim at the implementation layer. Modern deep learning frameworks rely on shared libraries like cuDNN for core operator implementations. These libraries produce predictable kernel behavior, which is exactly what side-channel attacks exploit.

We take ONNX versions of four popular architectures:

ResNet18
DenseNet121
YoloV4
RoBERTa

They then feed these models into TVM, an optimizing compiler for deep learning workloads, and apply AutoTVM’s tensor optimization routines. AutoTVM uses simulated annealing and a learned cost model to generate many candidate schedules for each operator, selecting those that minimize runtime on a target accelerator.

From a security standpoint, the key idea is simple. If you change how operators are scheduled, tiled, and fused, you change the observed kernel level behavior. That, in turn, makes it harder for a side-channel classifier trained on standard implementations to correctly infer the architecture.

‍

Measuring the impact on DeepSniffer

To quantify this, the team:

Compiled each model with varying numbers of AutoTVM optimization trials, from a baseline of zero up to 500.
Ran inference in the TVM runtime while collecting kernel metrics using the Nsight Systems profiler.
Fed these traces into DeepSniffer and measured how closely its predicted architecture matched the real one, using a fidelity metric from zero to one.

The results show a clear trend. As the number of optimization trials increases, DeepSniffer’s fidelity drops. For example:

DenseNet121’s fidelity falls from about 0.1678 in the baseline to 0.1013 after 500 trials, a reduction of roughly 39.6 percent.
YoloV4’s fidelity falls from about 0.2220 to 0.1259, a reduction of roughly 43.3 percent.

RoBERTa behaves differently, in part because its heterogeneous language model operators were already unfamiliar to DeepSniffer’s classifier. But across the vision models, the trend is consistent. More diverse and optimized schedules make the attack less successful.

‍

Tradeoffs and costs

There is no free lunch. AutoTVM’s search process is computationally expensive. Generating and benchmarking large numbers of candidate schedules required around 83 GPU hours across all models and trials in the study.

However, this cost is paid once per deployment configuration, not per inference. After the compiler finds an optimized schedule, the resulting model not only becomes more robust against the specific side-channel attack, it also runs faster.

For operators of high value models, that can be an acceptable trade. You spend GPU time up front to buy both performance gains and additional security margins.

‍

Toward moving target defenses

One of the more interesting ideas in the discussion is the concept of using compilation as part of a moving target defense. Instead of treating optimization as a one time step, defenders could:

Periodically recompile models with new tensor schedules after detecting suspicious activity.
Focus optimization on operator types that contribute most to side-channel success, in order to shift the most informative parts of the trace.

This would force adversaries to constantly retrain their side-channel classifiers and would shrink the window where any particular model of kernel behavior is valid.

From Mindgard’s perspective, this aligns with a broader principle. Many of the most effective security strategies in traditional computing rely on making systems less predictable over time: address space layout randomization, key rotation, or dynamic sandboxing. Applying similar thinking to AI workloads via compilers is a natural extension.

‍

What this means for practitioners

For teams responsible for securing accelerator heavy AI workloads, this research offers several practical takeaways:

View compilation as a security control
Do not treat the compiler stack as solely a performance tool. Configuration choices in TVM or similar systems can meaningfully change how observable and reconstructable your model is from low level traces.
Harden high value models first
Side-channel defenses are most important for proprietary architectures that run on shared infrastructure or that you expect to be attractive IP targets.
Combine with higher layer protections
Compilation will not stop other forms of extraction or adversarial abuse. It should complement access control, monitoring, and adversarial testing that focus on the input output surface.

Side-channel attacks remind us that AI models are not just mathematical functions. They are software artifacts running on complex hardware stacks, and every layer leaks some information. Work like this shows that defenders can use the same toolchains that power performance to quietly reshape what an attacker sees.

Read the full paper on ArXiv

‍

Introducing Mindgard MITRE ATLAS™ Adviser

MITRE ATLAS™ Adviser in Mindgard that helps standardise AI red teaming reporting.

Analyst Report: OWASP LLM and Generative AI Security Solutions Landscape

The LLM and Generative AI Security Solutions Landscape is an industry report developed by OWASP that maps out key vendors and solutions in the AI security space. This landscape provides a comprehensive view of tools and technologies that help organizations safeguard their AI-powered applications.

Talk: Deconstructing AI Risk — From Research to Real-World Exploits

In this talk, Peter Garraghan demonstrates how adversaries are already exploiting AI systems and why current security practices are often ill-equipped to stop them.

Mindgard, the leading provider of Artificial Intelligence security solutions, helps enterprises secure their AI models, agents, and systems across the entire lifecycle. Mindgard’s solution uncovers shadow AI, conducts automated AI red teaming by emulating adversaries, and delivers runtime protection against attacks like prompt injection and agentic manipulation. Trusted by leading organizations in finance, healthcare, and technology, Mindgard is backed by investors including .406 Ventures, IQ Capital, Atlantic Bridge, and Lakestar.