AI: The New Attack Surface

As large-language models (LLMs) and AI assistants become increasingly integrated into enterprise workflows, boards of directors are intensifying the pressure for demonstrable productivity gains. However, the very features that make these AI tools valuable – their ability to browse live websites, retain user context across interactions, and connect to various business applications – simultaneously introduce new and potentially significant cybersecurity vulnerabilities.

Recent research by Tenable highlights this escalating risk. Dubbed “HackedGPT,” the research details a series of vulnerabilities and attack vectors centered around indirect prompt injection and related techniques. These attack methods can potentially enable malicious actors to exfiltrate sensitive data and establish persistent malware within corporate systems. While some of these issues have reportedly been addressed, Tenable asserts that others remain exploitable, according to the company’s advisory.

Mitigating these inherent risks requires a fundamental shift in how organizations govern, control, and operate AI assistants. These tools should be treated not merely as productivity enhancers, but as distinct users or devices within the IT ecosystem, subject to rigorous audit and monitoring protocols.

Tenable’s findings underscore the potential for AI assistants to become conduits for security breaches. Indirect prompt injection, for example, involves embedding malicious instructions within web content that the AI assistant accesses during its browsing activities. These instructions can then trigger unintended data access and actions without the user’s awareness or consent. Another attack vector exploits front-end queries to inject and propagate malicious commands.

The business implications of these vulnerabilities are substantial, encompassing incident response costs, legal and regulatory scrutiny, and the potential for significant reputational damage should a breach occur.

Prior research has also demonstrated the susceptibility of AI assistants to leaking personal or sensitive information through injection techniques. AI vendors and cybersecurity experts are engaged in a continuous cycle of identifying and patching these emerging vulnerabilities, as exemplified by OpenAI’s response to a zero-click vulnerability detailed by security researchers.

This pattern is familiar to seasoned technology professionals: expanding functionality inevitably introduces new potential failure modes. A proactive approach that treats AI assistants as live, internet-facing applications – rather than solely as productivity tools – is critical for enhancing overall cybersecurity resilience.

Governing AI Assistants: A Practical Approach

1) Establish a Comprehensive AI System Registry

Maintain a detailed inventory of every LLM, AI assistant, or agent deployed within the organization, regardless of whether it resides in the public cloud, on-premises infrastructure, or within software-as-a-service (SaaS) applications. This aligns with frameworks like the NIST AI RMF Playbook. For each AI system, meticulously record its owner, intended purpose, specific capabilities (such as browsing and API connector access), and the data domains it is authorized to access. Without such a registry, “shadow agents” can persist with undocumented privileges, creating significant security blind spots. The rise of “Shadow AI,” once encouraged by some vendors, presents a considerable threat that IT departments need to handle seriously.

2) Implement Segregated Identities for Users, Services, and Agents

Traditional identity and access management (IAM) systems often conflate user accounts, service accounts, and automated devices. AI assistants that interact with websites, invoke external tools, and modify data require distinct identities. These identities should be governed by zero-trust principles and the principle of least privilege. Implementing robust agent-to-agent chain mapping (tracking who requested what action from whom, involving which data, and at what time) provides a basic audit trail and enhances accountability. It is important to recognize that while agentic AI can generate highly creative output and actions, it lacks the inherent constraints of human employees, such as disciplinary policies.

3) Apply Context-Aware Constraints to Risky Features

Make features like web browsing and autonomous actions opt-in functionalities for each specific use case. For customer-facing AI assistants, establish short data retention periods unless there is a compelling justification and a lawful basis for longer retention. When deploying AI assistants for internal engineering tasks, limit their use to isolated projects with comprehensive logging enabled. Implement data loss prevention (DLP) measures on connector traffic to prevent unauthorized access to file stores, messaging platforms, or email systems. Past vulnerabilities in plugins and connectors have highlighted how integrations can significantly increase an organization’s attack surface.

4) Institute Robust Monitoring of AI Systems

Capture all assistant actions and tool calls as structured logs.
Implement alerts for anomalous behavior, such as:
- Sudden increases in browsing activity to unfamiliar domains.
- Attempts to summarize obfuscated code blocks.
- Unusual memory write patterns.
- Connector access outside of established policy boundaries.
Integrate prompt injection tests into pre-production security assessments.

5) Invest in Human Capital and Expertise

Provide training to developers, cloud engineers, and security analysts to enable them to recognize the warning signs of prompt injection attacks. Encourage users to report any unusual AI assistant behavior, such as unexpectedly summarizing content from websites they did not explicitly open. Establish clear procedures for quarantining suspicious AI assistants, clearing their memory, and rotating their credentials after suspicious events. The skills gap in AI security is real; without continuous upskilling, governance will inevitably lag behind adoption.

Key Decision Points for IT and Cloud Leaders

Question	Why it matters
Which AI assistants can browse the web or write data?	Browsing and memory retention are common pathways for prompt injection and malware persistence. Limit these capabilities based on specific use cases.
Do AI agents have distinct identities and auditable delegation mechanisms?	This prevents traceability gaps when malicious instructions are introduced indirectly.
Is there a centralized registry of AI systems, including owners, scopes, and data retention policies?	This supports robust governance, facilitates the right-sizing of security controls, and provides budget transparency.
How are connectors and plugins being governed?	Third-party integrations have a history of security vulnerabilities. Enforce the principle of least privilege and robust DLP measures.
Are you conducting pre-deployment testing for zero-click and one-click attack vectors?	Public research has demonstrated the feasibility of these attacks through crafted links and content.
Are the AI vendors patching vulnerabilities promptly and publishing detailed fix information?	The rapid pace of feature development means new security issues will emerge frequently. Verify vendor responsiveness.

Risks, Cost Visibility, and the Human Factor

Hidden costs: AI assistants that browse the web or retain memory consume significant compute, storage, and network egress, which may not be properly accounted for by finance teams or those responsible for XaaS usage monitoring. A comprehensive registry and accurate metering are crucial for preventing unexpected cost overruns.
Governance gaps: Audit and compliance frameworks designed for human users will not automatically capture agent-to-agent delegation chains. Align security controls with established frameworks like the OWASP Top 10 for LLMs and the NIST AI RMF.
Security risk: Indirect prompt injection can be invisible to end users and can be transmitted through various media formats, including text and code formatting.
Skills gap: Many organizations have not yet fully integrated AI/ML and cybersecurity practices. Invest in comprehensive training that covers AI assistant threat modeling and prompt injection testing methodologies.
Evolving security posture: Expect a continuous stream of new vulnerabilities and corresponding fixes. OpenAI’s remediation of a zero-click vulnerability serves as a reminder that vendor security postures can change rapidly and require ongoing verification.

The Bottom Line

The key takeaway for executives is clear: treat AI assistants as powerful, networked applications with their own lifecycle and a high propensity for both being the target of cyberattacks and for taking unpredictable actions. Implement a comprehensive AI system registry, enforce segregated identities, restrict risky features by default, log all significant activity, and regularly rehearse incident containment procedures.

By establishing these critical guardrails, organizations can increase the likelihood that agentic AI will deliver measurable efficiency and resilience – without inadvertently becoming their newest and most insidious breach vector.

Original article, Author: Samuel Thompson. If you wish to reprint this article, please indicate the source:https://aicnbc.com/12314.html

Governing AI Assistants: A Practical Approach

1) Establish a Comprehensive AI System Registry

2) Implement Segregated Identities for Users, Services, and Agents

3) Apply Context-Aware Constraints to Risky Features

4) Institute Robust Monitoring of AI Systems

5) Invest in Human Capital and Expertise

Key Decision Points for IT and Cloud Leaders

Risks, Cost Visibility, and the Human Factor

The Bottom Line

About Author

Samuel Thompson

Related News

Flawed AI Benchmarks Endanger Enterprise Budgets

Switzerland Launches Fully Open-Source AI Model

Promise, Skepticism, and Their Implications for Southeast Asia