North Korean macOS Malware Injects Hidden Prompts to Mislead AI Security Tools

A newly identified strain of North Korean malware targeting Apple computers uses a clever trick to evade detection: it injects hidden text designed to manipulate the AI analysis tools that security teams now rely on. Researchers have dubbed this threat macOS.Gaslight.

The malware came to light after an Apple XProtect signature update drew attention to a sample on VirusTotal in late May. The binary initially evaded detection by standard antivirus engines, underscoring its stealthy nature.

The core innovation of macOS.Gaslight is its use of prompt injection. While the malware functions as a remote access tool, it contains a hidden payload specifically crafted for large language models (LLMs) used in security analysis. This payload aims to influence the AI's output, coaxing it into producing misleading reports that downplay the malware's severity or misclassify it as a harmless file. The tactic seeks to deceive both automated systems and the human analysts who depend on their summaries.

Written in Rust, the malware points to a broader strategic shift by its developers. The language's cross-platform capabilities mean the same tool could be adapted to target Windows or Linux systems with relative ease. Researchers have linked the development and infrastructure to operators associated with the Democratic People's Republic of Korea (DPRK).

This campaign marks a notable evolution for DPRK-affiliated cyber actors, who have traditionally focused on Windows environments. The deliberate move to macOS, combined with this pioneering evasion technique, suggests a new phase in their offensive playbook.

For security teams, this discovery is a stark warning. It proves adversaries are actively developing methods to subvert AI-powered defenses. The immediate takeaway is clear: any workflow that feeds untrusted files directly to an LLM for inspection is now susceptible to manipulation. The conclusions drawn by these systems can no longer be implicitly trusted.

As LLMs become integral to security operations centers, their resilience must be rigorously validated. This incident confirms that prompt injection is a practical evasion technique in live malware, not just a theoretical chatbot risk. Defending against such threats will require more than just deploying AI tools—it demands securing and actively verifying the entire AI analysis pipeline. The cybersecurity landscape is now entering a phase of direct "AI versus AI" confrontation.

一種新近識別的朝鮮半島惡意軟件，針對蘋果電腦採用巧妙手法規避偵測：它注入隱藏文字，旨在操控保安團隊現時依賴的人工智能分析工具。研究人員已將此威脅命名為macOS.Gaslight。

該惡意軟件於五月底因Apple XProtect特徵庫更新而引起對VirusTotal上某樣本的關注。該二進制檔案最初成功規避了標準防毒引擎的偵測，突顯了其隱蔽特性。

macOS.Gaslight的核心突破在於其採用「提示注入」手法。雖然該惡意軟件的功能是遠端存取工具，但它包含一個專為保安分析所用的大型語言模型精心設計的隱藏有效負載。此負載旨在影響人工智能的輸出，誘使其生成誤導性報告，以淡化該惡意軟件的嚴重性或將其錯誤分類為無害檔案。此策略旨在同時欺騙自動化系統及依賴系統摘要的人類分析人員。

該惡意軟件以Rust語言編寫，反映其開發者正進行更廣泛的策略轉向。Rust的跨平台能力意味著同一工具可相對輕鬆地改裝以針對Windows或Linux系統。研究人員已將相關開發及基建歸因於與朝鮮民主主義人民共和國相關的操作者。

此次行動標誌著朝鮮相關網絡行為者的一個顯著演進，傳統上他們專注於Windows環境。此次刻意轉向macOS平台，結合此開創性的規避技術，預示其攻擊策略已進入新階段。

對保安團隊而言，此發現是一個嚴厲的警示。它證實對手正積極研發方法，以顛覆人工智能驅動的防禦機制。最直接的啟示是明確的：任何將不受信任檔案直接輸入大型語言模型進行檢查的工作流程，現已易受操縱。這些系統所得出的結論不能再被默認為可信。

隨著大型語言模型成為保安營運中心不可或缺的部分，其穩健性必須接受嚴格驗證。此次事件證實，「提示注入」是現實惡意軟件中一種實用的規避技術，而不僅僅是聊天機器人的理論風險。防範此類威脅，不僅需要部署人工智能工具，更需確保並主動驗證整個人工智能分析流程的安全性。網絡保安領域現已進入一個「人工智能對抗人工智能」的正面交鋒階段。

新聞來源 / Original News Source

Hong Kong Linux User Group 香港Linux用家協會 (HKLUG)

North Korean macOS Malware Injects Hidden Prompts to Mislead AI Security Tools