In the future, the world might want international agreements to govern AI development. Such agreements are likely to include provisions about what AI chips can and cannot be used for (e.g. no undeclared models are in deployment), and verification mechanisms to ensure every side is following the rules. One proposed method for this chip use verification is network taps. These are devices that intercept traffic between the AI accelerators and any of their users, or traffic between servers, to help detect malicious data transfer. For network taps to be effective, they must intercept all or almost all communication with the AI accelerators.
I surveyed the literature on side-channels and possible mitigations to find out if—under the scenario named above—air-gapping can be made secure enough that evasion would likely be caught. The setup is a prover owning the facility and a verifier installing monitors (e.g., network taps) and sending inspectors. This prover is assumed to be a “covert adversary”1 who agrees to monitoring but will exploit any gaps they can without a high detection risk. It is the attacker who is defeated if they are caught cheating once, which reverses the “defender’s dilemma” of traditional cybersecurity, where one successful breach means defeat for the defender.
Tolerating single-digit kilobits per second of physically possible2 covert bandwidth substantially reduces the cost and complexity of the defenses needed. With this requirement, my expectation (based on a three-week research sprint) is that defense-dominance is achievable in guarded facilities (personnel, cameras, fences etc.).
I present an example of a defense architecture towards the end of this post. It leverages a synergy of cheap enclosures, sensors, jamming, unannounced hardware inspections and physical security to achieve defense-dominance at an estimated Bill-of-Materials (BOM) below 1% of the cost of the monitored cluster.
A promising idea for how to verify claims about the usage of AI-specialized hardware has emerged recently: Rather than attempting to create custom AI accelerators trusted even by mutually distrusting adversaries, one can target the communication links in and out of such hardware.
In principle, this enables a retrofittable architecture that can make many concrete claims about the usage of that AI verifiable, even if the hardware is untrusted (the network taps need to be trusted/monitored, however, which I expect to be a tractable problem). The intercepted traffic could either be within a datacenter (i.e. one server rack transferring model activations to another), or between the hardware and any users outside the datacenter.
Both kinds of traffic are rich in information that could prove claims such as the following (to an automated process preserving confidentiality, or a trusted third party):
Among other possible, governance-relevant claims.
The viability of such a verification setup (network taps + automated data analysis and compliance-check) critically depends on the network taps not being bypassed: Even if all optical fiber cables out of a cluster are tapped, data could plausibly be smuggled around them.
This is the challenge that this post addresses, with the goal of de-risking this verification approach specifically for the toughest case: Nation-state actors exploiting any physically available information channel they can, going as far as hiding hardware inside servers in a way they would expect even a trained inspector to not find.
This may seem a lost cause at first, given the extreme cyber offensive capabilities of advanced nation states, but there are reasons why I think this is tractable:
I can hardly make assumptions about the cyber-offensive capabilities of the most advanced attackers in the world4, other than that they are bound by the laws of physics. For creating robust side-channel defenses, minimal other assumptions should be made anyway.
In short, I think that none of the required defenses are blocked by the manufacturing cost of the components needed to achieve an upper bound on possible information bandwidth (around kilobits per second) in and out of server racks or pods, besides the monitored optical communication links. I explain this conclusion of my research further below, but we start with the fundamentals:
An attacker’s covert information rate is fundamentally constrained by two variables: The frequency of their signal and the signal-to-noise ratio (SNR).
Wikipedia:
In information theory, the Shannon–Hartley theorem tells the maximum rate C, at which information can be transmitted over a communications channel of a specified bandwidth B in the presence of noise. The simplified form (frequency-independent SNR) looks as follows5:
C =B * log₂(1 + SNR)
We measure relative amplitudes in decibels (dB), for example when comparing signal-to-noise. Absolute power is often measured relative to 1 milliwatt (dBm).
From the defender’s perspective, the objective is suppressing C for all physically possible transmission channels. The number of such independent channels is finite and can be covered comprehensively. I know this is a strong claim, so I appreciate adversarial feedback for the list below:
Some information channels are physically viable, but inherently bandwidth-constrained and are therefore not addressed by the defenses discussed in this post. In fact, this contributes to the argument that upper-bounding covert bandwidth is far cheaper and viable at scale than preventing all information leakage. Magnetic7, thermal, radioactive, and gravitational wave channels are limited by their frequency –and therefore channel rate– as dictated by the Shannon-Hartley theorem.
Defense-dominance is achieved when the risk-reward calculation for the adversary is negative: The expected reward of successfully evading the defenses (in this setup, orchestrating and obtaining results from secret ML workloads), must be (far) outweighed by the risk –and expected cost– of being caught.
I think that this is achievable in our specific scenario, at reasonable cost, by combining multiple layers of defense in a way that these layers reinforce each other.
The attacker’s signal power would then be far below the noise level (i.e. SNR << 1), leading to channel capacity C << B by the Shannon-Hartley theorem. Where a is the attenuation factor and Pcovert signal is the signal power achievable while remaining undetected by inspections and sensors, we have:
Psignal ≤ Pcovert signal·a.
Pnoise ≥ Pjamming.
⇒ SNR ≤ Pcovert signal·a / Pjamming.
⇒ C ≤ B · log2(1 + Pcovert signal·a / Pjamming).
Thus, to upper-bound the channel capacity at 1% of the bandwidth (i.e. to achieve C ≤ 0.01B), it would suffice to have Pcovert signal·a / Pjamming ≤ 20.01 -1 = -22dB. And to upper-bound the capacity of a 1GHz bandwidth to 1 kb/s (i.e. to achieve C ≤ 0.000001B), it would suffice to have Pcovert signal·a / Pjamming ≤ 20.000001 -1 = - 62dB. For reference, testing indicates that some existing enclosures designed to meet NSA 94-106, an NSA specification for side-channel security, already achieve -100dB of attenuation for at least electromagnetic frequencies of 100MHz to 10GHz.10
This section considers what perimeters would need to be subject to defenses.
AI datacenter hardware is hierarchical: individual accelerators comprise nodes, nodes gather in racks, racks assemble into so-called pods or “Scalable Units” (SU, NVIDIA terminology). Some vendors even consider assemblies of pods/SUs as “SuperPODs”.
The verifier has two kinds of communication to monitor: 1) The prover/outside world communicating with the cluster, and 2) the cluster communicating internally.
For inference, pods are reasonably the largest individual units needed11. Bandwidth in-and-out can be surprisingly low: Apart from occasional file transfers, customer service even at thousands of tokens per second per GPU requires only single-digit Gbps and can be monitored cheaply on the so-called in-band fabric, with no other optical fiber links linking a pod to the rest of the datacenter. However, hardware utilization is typically optimized by high-bandwidth communication (of KV caches) between pods using specialized hardware for prefill and decode stages, making the smallest computing unit a pipeline of two pods12. This could be sized down though, since both such pods would usually contain multiple prefill and decode units each.
Training can require extremely high-bandwidth between pods for gradient all-reduce and model checkpoint storage, but an attacker can exploit far lower covert bandwidth at moderate cost to model performance per FLOP.
The finer-grained the perimeters, the smaller the compute units that can communicate unconstrained and unmonitored internally. The drawback is the cost-difference between e.g. per-pod perimeters and per-rack perimeters.
If a randomly sampled batch of devices (sample size n) is checked and every check finds a flaw if there is one, the random sample can upper-bound the total fraction of flawed devices (p) with a certain probability P(detect ≥ 1)
P(detect ≥ 1) = 1 − (1 − p)n

There are levels of inspection, each one with their own tradeoff between thoroughness and invasiveness.
A superficial inspection is the least invasive, visually or with dedicated measuring devices. It can catch blatant violations, such as leakage in RF enclosures. The inspector turns on a jammer inside and measures the leakage from the outside. All verifier-supplied components can be designed to be easily inspectable. For instance, a power filter's or CLT device’s electronics could be embedded in transparent epoxy resin, filled with reflective particles whose exact distribution is known to the auditing body.13
At this shallow level, a majority of a datacenter’s pods could be checked.
Opening or disassembling a server inevitably introduces downtime, but can catch devices hidden inside. The most egregious supply chain attacks can bury malicious hardware deep in superficially unsuspicious hardware, but any commodity hardware beyond agreed specs can be caught this way. Example: Hard drives the attacker did not declare, which may be used to evade scheduled memory wipes.
X-rays can spot deviations from official design spec. Comparing component internals against a known-good reference is already established practice in counterfeit electronic parts detection (SAE AS6171), and is mandated for AI security contexts by the SL5 Standard (controls SR-10 and SR-5(2): "radiographic examination, material analysis, and electrical testing"), following NIST SP 800-161 Rev. 1 supply chain risk management guidance.
Destructive teardowns could deter even egregious supply chain attacks in the best hiding spots.
I am not sure, however, whether this is defense-dominant or offense-dominant. I can only state my intuition that a fully destructive teardown by a capable inspection agency would go a long way towards changing the risk-reward calculation for the attacker. If the inspecting body is allowed to take hardware with them, they have practically unlimited time to find anything suspicious.
Destructive analysis can be selective. Some components are cheaper to replace than others, while being the most critical potential hiding spots.
I can not give confident recommendations here, only my intuition that intelligence services would know what to look for in inspections, or whether any deviation from an official design spec14 could be hidden from them. Off-site inspections do however have an element of psychological deterrence, as the attacker must make unfalsifiable assumptions about what the inspector would miss.
The example defense setup below draws perimeters around 8-rack NVIDIA Blackwell SUs15, but aims many of its defenses against both inter-rack and inter-SU side-channels.
In short, one can combine the following layers of defense:
Let us consider all conductors that can carry electrical signal across racks, or out of the enclosure:
The last two are trivially solved by breaking conductive contact between racks and enclosures. Metal pipes and the cooling water inside can be grounded to the enclosure walls, ideally not right next to each other.
Combating all physically possible communication in power cables, while still keeping insertion losses at a minimum, is more challenging: Low-pass filters for the required currents and voltages are readily available on the market, but they inevitably have a frequency passband up until at least one order of magnitude over the power line’s frequency (~60Hz), otherwise they will introduce significant power loss. Additionally, crude LC filters leak low amplitudes of high-frequency signals as well.
Technically, this is a solved problem for the stop-band: Vendors like Astrodyne TDI, Premier Filters, MPE, and Signal Defense sell filters tested against the NSA’s classified specs (referred to via the “TEMPEST” codename). The components of such devices are not exotic (in fact they are already mass-manufactured), and sceptical readers can convince themselves of this via the readily available technical documentation of such filters.18
This only addresses leakage in the stop-band though, and commercial TEMPEST-grade filters typically leave a passband in the kHz range19 (page 3). Here, CLT is a cheap defense that can push Shannon capacity down to tens of bps per cable. The primary cost drivers of a CLT device would not be the signal generation or processing hardware, but any electronics contacting the high-voltage cable. Both are commodity hardware though, and readily available for cheap.
Against the range of possible channels discussed in the side-channel survey above, the defender can use either isolation, deterrence, or both.
An isolation strategy would be to not allow any prover-produced photons to leave the enclosure. This means verifier-controlled Optical-Electrical-Optical (OEO) conversion at the boundary. If the enclosure fully contains a self-sufficient inference unit (including prefill and decode stages), such OEO conversion can be as cheap as a handful of 200G or 400G transceivers for in-band links, while the more expensive compute and storage fabrics can remain unmodified within the perimeter. For the most challenging case, which is pre-training, this would mean hundreds of high-performance transceivers per pod, which would be within single-digit percent cost of the monitored servers.20
Deterrence can either work by inspecting optical hardware for un-specified components or sensitivities, or by not specifying the sensing capabilities of the verifier's passive optical taps. Where OEO conversion is deemed too costly (or constrained by transceiver supply), passive taps can observe multiple optical links with one sensor, by randomly routing/shuttering across many (e.g. with optical circuit switches). This sensor could detect suspicious signals outside of the usual spectra and modulation, but importantly, the exact capabilities could be made unknown to the prover, which creates psychological deterrence.
ILT is particularly effective here, as sound waves propagate much slower than RF and the electronic’s required latency is much less extreme than for RF ILT. CLT is also viable. The RF enclosure and metal sheets between racks effectively shield ultrasound as well21: Penetration through metal is poor, and around double-right angles as well. Air vents of neighbouring enclosures should not face each other, but this is usually done for airflow reasons anyway.
Lastly, (ultra)sonic transducers have no legitimate presence in ML server hardware and would need to be exposed to air to be effective (for the airborne channel), which makes them difficult to hide from inspectors.
As explained in the side-channel overview above, (some) solids can carry ultrasound at high bandwidth, over large distances. Generally, transmission works best through uninterrupted, uniform material. This makes defense trivial: Air between racks, vibration isolation underneath.
That leaves coolant pipes, which I address below.
Both the coolant pipes and the cooling water can carry ultrasound. One way to interrupt a water column is by letting it fall in free air. A water tank inside the enclosure could be made from transparent plastic. An added benefit is that this would also prevent cabling hidden in coolant lines. Grounding the water inside the tank against the enclosure also negates the conducted channel through water.
The attacker may try tunnelling under the datacenter and drilling holes into servers from below. This can be made evident by placing a tempered glass plate under each rack, which explodes when drilled into. A metal plate between the glass and the rack also shields RF signals.
The prices listed below assume economies of scale, for worldwide deployment in AI clusters.

Bottom line: The BOM is likely dominated by power filters and perhaps RF jamming+monitoring electronics. I do not expect the BOM of the enclosure to be the main cost driver, when factoring in the cost of physical guarding and inspection.
Reliably enforcing access controls requires staff around the facility, likely even trained guards. Destructive teardown inspection of racks at a sample size of 10 would disassemble or destroy tens of millions of dollars of hardware each time.
Given the covert adversary threat model and the softened requirement of upper-bounding covert bandwidth to kbps rather than eliminating it entirely, I think that a multi-layered setup of cheap enclosures, filters, monitors and inspection can be deployed in datacenters quickly and efficiently. Before trusting a design such as the preliminary example setup I proposed above, potential members of an AI agreement will probably want to build and red-team a prototype. There are some remaining uncertainties. For example, I expect the main cost driver of the BOM to be power filters, but pinning down how much exactly mass manufacturing would cost has proven difficult.
In a rushed, time-sensitive scenario, one would first prioritize the side-channels that are high-bandwidth without purposefully hidden, malicious hardware. I expect that the most critical side-channel would be RF emanation from stock components and conducted channels in power lines. As the defender suspects the adversary to go to increasing lengths to evade monitoring, and as manufacturing of components such as power filters ramps up, more defenses can be added incrementally.
The actual monitoring of the computing hardware likely requires retrofitted devices, which I expect to be the most effective on optical links, i.e. via network taps. While these come with their own technical challenges, I hope that this work contributes to advancing defenses against side-channels evading such monitors.