Active Directory security · pre-registered study

The detection illusion

Before running anything, we registered a specific prediction: that more than 30% of successful Active Directory attacks would slip past native Windows logging. We then executed real Kerberoasting and credential-use attacks across eight defensive configurations on a live forest. None of the successful attacks evaded the native logs. Our prediction was wrong, and we are reporting it as a null result. The detection gap turns out to be real but located differently than we expected: it lies between a monitored network and an unmonitored one, not between native logs and a SIEM.

execution 80 real attacks on a live forest signal Windows events 4769 and 4624 result pre-registered null

01The prediction

It is widely believed that many successful attacks are invisible to default logging — that a network can be compromised without leaving a usable trace. We thought this was likely true even for common techniques, so we wrote it down in advance: more than 30% of successful attacks would evade native Windows event logs. Stating the threshold beforehand is what lets a negative result mean something.

02How we measured it

We executed two real attacks — a Kerberoasting service-ticket request (ATT&CK T1558.003) and an SMB credential-use logon (T1078) — across eight held-out defensive postures on a live multi-domain forest. For every attack that succeeded, we recorded whether the corresponding Windows Security event fired: event 4769 for the Kerberoast, event 4624 for the credential-use logon. We scored the same attacks under four detection configurations.

config Ano monitoringthe baseline: nothing is collected, so nothing can be seen.

config B / Dnative event logsWindows events 4769 and 4624, read directly from the live domain controllers.

config CWazuh SIEMa security-monitoring layer with tuned rules over the same events.

figure 1 the four detection configurations. all four observe the same attacks; they differ only in what is collecting and watching the events.

03What we found

Of 30 successful attacks under native logging, all 30 were recorded. The native detection gap was zero. With no monitoring at all, by definition every successful attack was invisible. The contrast that matters is between those two configurations, not among the monitored ones.

100%A · unmonitored
every successful attack invisible

0%B · native event logs
successful attacks that evaded

0%C · Wazuh SIEM
detected all 45 successful attacks

0%D · detect-aware
native logging at full coverage

detection gap by configuration, among successful attacks
configuration	successful	detected	evaded	% evaded
A · no monitoring	30	0	30	100%
B · native event logs	30	30	0	0.0%
C · Wazuh SIEM	45	45	0	0.0%
D · detect-aware (full)	15	15	0	0.0%

Why does the gap close so completely? Both techniques are intrinsically loud in the Windows Security log. A Kerberoast requests a service ticket, and the domain controller records event 4769 every time. A credential-use logon records event 4624. There is no quiet variant in this attack set: when delegation is hardened to AES-only encryption, the Kerberoast does not become stealthy — it simply fails, producing no crackable ticket, so it never enters the "successful but undetected" category at all.

04Native logs versus a SIEM

To complete the comparison, we deployed a Wazuh SIEM — a manager and indexer on the forest, agents on the domain controllers, and rules keyed to the relevant accounts — and re-ran the attacks with both detection sources recording each execution at once. This gives a direct, same-execution comparison rather than two separate runs.

Wazuh detected every one of the 45 successful attacks, the same zero-percent gap as the native logs. The two sources agreed on 95.6% of executions. In every case where they disagreed, Wazuh was the one that caught the event: on two credential-use logons, its continuous collection recorded an event that the native point-in-time query happened to miss. The reverse never occurred.

0%Wazuh detection gap
45 of 45 successful attacks caught

95.6%native–Wazuh agreement
on the same executions

2caught by Wazuh, missed by native
continuous collection vs one-shot query

0caught by native, missed by Wazuh

The difference between the two sources is therefore not about what the logs contain — both see the same Windows events — but about how the events are collected. A SIEM that ingests continuously is a more forgiving collector than a single query run against a live log. The choice of detection source did not open a gap; it narrowly closed a collection-timing one. This is consistent with our broader reading of the result: the events exist, and the operational question is whether anything is collecting and watching them.

note

Config C was not part of the initial run — the SIEM had not yet been deployed — and was added afterward on the same forest with the same postures, which is why we present it as a direct same-execution comparison. One honest detail: the follow-up run recorded 45 successful attacks rather than the original 30, because Active Directory's NTLM grace period lets a just-rotated credential continue to authenticate for a short window, so several credential-rotation environments that blocked the attack the first time allowed it on the re-run. This does not change the detection result — every successful attack was caught — but it is worth recording.

05What this means

We do not read this as "detection is solved." Two techniques is a small sample, and both are unusually loud. The useful conclusion is narrower and, we think, more accurate than the common framing: for these attacks, the native Windows logs are not the weak link. The weak link is operational. The events are written, but only an organization that is collecting and alerting on them benefits. The unmonitored configuration — the one that misses everything — is also the common one in practice. That is where the illusion lives: in believing one is covered because the logs could see the attack, when nothing is watching them.

06Limitations

scope

We measured two high-signal techniques, Kerberoasting and credential-use. The zero-percent gap is specific to them and should not be generalized to stealthier techniques such as directory-replication abuse, token theft, or pre-authentication attacks.

scope

This is a lab forest and native or Wazuh logging, not enterprise scale or production endpoint detection. A real environment has the log volume and alert fatigue that turn a clean technical result into a harder operational one, and a production EDR is a different detection surface that we did not test.

reproducibility

Real attack execution on a live multi-domain forest, scored against the real Windows event log and a real SIEM. The model is a frozen black box, and the full four-configuration design, including this hypothesis, was committed before the run. The null result is reported in full.