Impress Computers Warns of Emerging Security Risks in Machine Learning (ML) Software Supply Chain

Posted by Roland Parker On August 26th, 2024

Cybersecurity threats are constantly evolving, and the latest warnings from security researchers highlight significant vulnerabilities within the machine learning (ML) software supply chain. Impress Computers is committed to staying ahead of these threats and helping businesses protect their operations from the risks associated with ML platforms.

Recent discoveries have revealed over 20 vulnerabilities in ML operations (MLOps) platforms that could be exploited by attackers. These vulnerabilities, categorized as inherent and implementation-based flaws, could lead to severe consequences, including arbitrary code execution and the introduction of malicious datasets.

Understanding the Vulnerabilities

MLOps platforms are essential for designing and executing ML model pipelines, with a model registry serving as a repository to store and version-trained ML models. These models can be embedded within applications or made available for other clients to query via an API, also known as model-as-a-service.

However, the very processes and formats used within these platforms can introduce inherent vulnerabilities. For example, certain ML models allow automatic code execution upon loading, a feature that can be exploited by attackers to run malicious code. This risk extends to specific dataset formats and libraries, potentially opening the door to malware attacks simply by loading a publicly available dataset.

Key Examples of Inherent Vulnerabilities

One significant example involves JupyterLab, a popular web-based interactive computational environment that allows users to execute code blocks and view the results. An inherent issue with JupyterLab is its handling of HTML output when running code blocks. If Python code emits HTML and JavaScript, these can be rendered by the browser, creating a potential security risk.

This behavior can be exploited by an attacker who inserts malicious JavaScript code into a JupyterLab notebook, which could then automatically inject and execute arbitrary Python code. This risk is especially concerning when a cross-site scripting (XSS) vulnerability is present.

Researchers also identified an XSS flaw in MLFlow (CVE-2024-27132, CVSS score: 7.5) that could result in client-side code execution in JupyterLab due to insufficient sanitization when running untrusted recipes.

Implementation Weaknesses in MLOps Platforms

Beyond inherent vulnerabilities, there are also critical implementation weaknesses within MLOps platforms. For instance, the lack of proper authentication in some platforms can allow threat actors with network access to exploit the ML Pipeline feature and execute arbitrary code.

These vulnerabilities are not just theoretical; financially motivated attackers have already exploited them, as seen in the case of the unpatched Anyscale Ray (CVE-2023-48022, CVSS score: 9.8), where loopholes were used to deploy cryptocurrency miners.

Another example involves container escape vulnerabilities in Seldon Core, which could enable attackers to move laterally across cloud environments, accessing other users’ models and datasets by uploading a malicious model to the inference server.

The potential for these vulnerabilities to be chained together presents a significant risk, as they could allow attackers to infiltrate, spread within an organization, and compromise critical servers.

The Importance of Securing ML Platforms

For organizations deploying platforms that enable model serving, it’s crucial to understand that anyone who can serve a new model could potentially run arbitrary code on the server. Impress Computers emphasizes the importance of ensuring that the environment running the model is fully isolated and hardened against container escapes and other vulnerabilities.

Recent Security Disclosures in AI Frameworks

In addition to the risks associated with MLOps platforms, recent disclosures have highlighted vulnerabilities in artificial intelligence (AI) frameworks. For example, two now-patched vulnerabilities in the open-source LangChain generative AI framework (CVE-2023-46229 and CVE-2023-44467) could have allowed attackers to execute arbitrary code and access sensitive data.

Other recent findings, such as those revealed by Trail of Bits in the Ask Astro chatbot application, show how AI-powered applications can be exposed to threats like chatbot output poisoning and denial-of-service (DoS) attacks.

The Growing Threat of Data Poisoning in ML

Security researchers are also uncovering techniques to poison training datasets, with the goal of tricking large language models (LLMs) into producing vulnerable code. Unlike previous attacks that embedded malicious payloads in detectable sections of the code, new methods like CodeBreaker leverage LLMs for sophisticated payload transformation, making the detection of vulnerabilities more challenging.

Conclusion

As the landscape of cybersecurity threats continues to evolve, Impress Computers remains dedicated to providing comprehensive protection for businesses leveraging ML and AI technologies. The discovery of these vulnerabilities serves as a stark reminder of the importance of securing all aspects of the software supply chain, particularly in the rapidly growing field of machine learning. By partnering with Impress Computers, businesses can ensure they have the right defenses in place to mitigate these emerging risks.

Free Report:

The 7 Most Critical IT Security Protections Every Business Must Have In Place Now To Protect Themselves From Cybercrime, Data Breaches And Hacker Attacks

Blog

Impress Computers Warns of Emerging Security Risks in Machine Learning (ML) Software Supply Chain

Free Report

How Construction & Manufacturing Business Owners Can Solve Operational Problems by using AI

Contact

IT Services

Company

Service Areas

Social Media