Could AI Replace the SIEM? - Andrew Blumhardt's Blog

Traditional SIEMs follow a predictable model: collect logs from many sources, ingest them into a central store, normalize them into a readable schema, index them, and then use queries, dashboards, and alerts to investigate and respond. This design exists because data is scattered, inconsistent, and hard to retrieve. Centralization solves those problems but introduces challenges in cost, performance, and complexity. SIEMs require a large, centralized data store, significant storage capacity, and high operational costs. They also depend on specialized query languages and Python scripting for analysis, along with complex system maintenance and tuning that can make them difficult to manage effectively.

I have been thinking about what happens if those foundations are disrupted. What if we no longer need to collect, normalize, and query everything in one place? Imagine an AI-based system that can read and learn directly from data at its source. Instead of moving every log into a human-readable index, it would learn behaviors, store only what is legally required, and retain that data in a compact, raw, non-human-readable form. The AI would understand how the environment behaves and could identify patterns or anomalies without needing constant data ingestion or manual queries.

In this model, analysts would not depend on complex query languages or static dashboards. Instead, an AI-based dashboard would surface what matters most in real time, highlighting emerging risks, summarizing trends, and answering direct questions conversationally. You could simply ask, “What new risks should I know about?” or “Show me recent activity for this device.” The AI would reason through available data and respond with context, not just rows of data. It would learn from your feedback and adjust over time, reducing noise and emphasizing insights that align with your priorities.

Both SIEMs and modern data lake architectures exist in part to handle long-term archival. Retaining logs for years can satisfy compliance requirements, enable audits, and provide historical insight. But there is another side to that coin: the longer you keep sensitive data, the greater your legal and privacy risk. If a regulation requires you to store logs for three years, keeping them for ten could expose a previously undetected data breach or create unnecessary liability. It is not unlike choosing when to purge your tax records or clear your browser history.

An AI-driven platform could change how we think about retention entirely. Instead of static retention policies, it could learn what information is useful or legally required and selectively forget what is not. You might instruct the AI to discard data that is older than a specific timeframe, to remove records related to certain privacy protections, or even to exclude certain individuals such as executives. The AI could even respond dynamically to changes in data retention regulations in the countries where you operate. The concept of teaching an AI not only what to remember but also what to forget could become a cornerstone of responsible data management in future SOCs.

Role-based access could also be reimagined. Instead of relying on complex and often over-permissive access models, AI could enforce user personas defined through natural language system prompts. For example, one class of users might only be permitted to see data from the United States, another could view redacted personal information such as addresses or social security numbers, and a user might be restricted from viewing data related to peers or immediate management. These personas could be expressed as clear, human-readable statements rather than intricate permission hierarchies, simplifying access control while maintaining strict compliance and data protection.

Early Signs of Change

Nearly every major security provider now incorporates some form of AI enhancement into their products. What was once marketed as breakthrough innovation has quickly become a baseline expectation across the industry. From behavior analytics and anomaly detection to automated investigation and incident summarization, AI features are now standard components of modern security platforms. The challenge is no longer whether AI is present, but how deeply it is integrated into these services.

To explore how the industry is evolving in this direction, I used ChatGPT to gather perspective on leading vendors that are advancing toward more intelligent, AI-augmented operations. Palo Alto Networks’ XSIAM, Google Security Operations (formerly Chronicle), CrowdStrike’s Falcon LogScale, Databricks’ Lakehouse for Security, and Microsoft’s Security Copilot each reflect aspects of this transformation.

There is also growing adoption of open data standards such as the Open Cybersecurity Schema Framework (OCSF) and “security data lake” architectures. Sentinel data lake and AWS Security Lake, for example, stores normalized logs in Parquet format and allows analytics tools to read them directly. This shift toward interoperable, lake-based architectures supports a future where AI can analyze data in place and reason across sources seamlessly, setting the stage for more adaptive and autonomous security operations.

Despite this progress, most of these solutions still feel like an AI layer placed on top of traditional architectures such as SIEM, EDR, AV, and IDS, rather than a complete redesign of how these systems fundamentally operate. They enhance what exists but rarely replace the core model of centralized collection, schema-based parsing, and rule-driven detection.

The Road Ahead

SIEMs as we know it will not disappear, but they will be disrupted. I believe that deeper adoption of AI and large language models will eventually transform the SIEM model entirely by rethinking long-standing assumptions around centralization, long-term retention, human readability, query operations, and storage requirements. Just as important, it may reshape the business and cost structures that have defined SIEMs for many years.

In today’s competitive landscape, nearly every vendor promotes some level of AI integration, and that alone is no longer impressive. The conversation has shifted toward aesthetics, brand loyalty, and pricing rather than true innovation. This environment creates an opportunity for a major disruption, whether from an established leader willing to rebuild the model from the ground up or from a new entrant bold enough to design a truly AI-native solution.

Eventually, the data itself may become completely hidden from the operator, protected from the user, encrypted, and even unreadable to a human. The only interface will be the AI, which will handle reasoning, correlation, and visualization. This evolution could improve accessibility, reduce cost, shrink data size, simplify auditing, and strengthen security by removing direct data exposure. Essentially replacing table data with tokenization (numerical representations that a model can understand and process mathematically).

The next major evolution in cybersecurity operations may not be a new SIEM at all, but the complete abstraction of one.

Early Signs of Change

The Road Ahead

Leave a Reply Cancel reply