blog

Vulnerability, Infrastructure and Application Security SLA, SLO, OKr – Do they matter??

This blog is part of a series of articles out of the whitepaper Download Whitepaper – Data-Driven Vuln Management – Are SLA dead

Vulnerability and their resolution for application security, devsecops, there have been a lot of changes in recent times.

How do you measure and facilitate the best way resolution of vulnerabilities without crashing the spirit and soul of the development team with the dreaded sentence, fix vulnerabilities in X number of days?

Let’s start by clarifying that development teams have the will and spirit to write perfect code and solve vulnerabilities. Businesses need to ship features fast to increase revenue, augment the sales surface, winning more clients.

Those two elements have always been conflicting, and the security requirements (often not at the start of the project) is lacking in the objective of application owners or business.

From this challenge, developers often lack the time and objectives to resolve vulnerabilities and bug fixes sprint by sprint.

So how can this be fixed? In several days, security mandates to fix problems (vulnerabilities, bug fixes, misconfiguration, libraries). The resolution times, often called SLA, are usually not agreed upon in collaboration with CTO and the development team.

This mandate often causes friction and frustration among everyone.

I have been having conversations with different professionals and experts in the field and had more or less argumentative discussions around SLA, SLO and OKr in Vulnerability management, Application and cloud security.

This article will cover the complexity of the landscape and the significant difference between various “updating” strategies and elements to consider when setting SLA, SLO or grouping them.

We will explore how those metrics with a feedback loop between security and development teams can facilitate a conversation and turn the tide in a usually frustrating exchange.

Contents

Definitions

Let’s start with definitions of SLA, SLO and what they are:

A service-level agreement is a commitment between a service provider and a client. Particular aspects of the service – In our specific case, SLA stands for the number of days a particular vulnerability must be fixed.
A service-level objective is a critical element of a service-level agreement between a service provider and a customer – Similar to SLA but is not an agreement but rather an objective.
SLI – I will skip it for now
OKr – Objectives to achieve (specifically in the DevOps team (something like the number of vulnerabilities per spring

	SLI	SLO	SLA
Definition	A quantifiable measure of reliability	A target reliability level objective	A legal contract or agreement that, if breached, will have penalties
Example	The number of vulnerabilities should be < 10 for every release	Critical Vulnerabilities will be resolved in 28 days 95% of the time	Public available products will have 0 critical vulnerability upon release critical vulnerability disclosed will be solved in 10 days
Who Sets it	Security teams in collaboration with Product Owners	Product owner in partnership with security teams	Business Development, Legal team, IT and Devsecops

Vulnerability and SLA/SLO/SLI Descriptions

Vulnerability Landscape

The vulnerability landscape in modern organisations is complex; we can categorise the vulnerability types or misconfiguration in several categories. Vulnerabilities in the various categories have quite different behaviour and require different levels of attention.

We can categorise assets in the following categories:

Application security – Vulnerabilities that concern codes, libraries or similar
Infrastructure Cloud security-related – vulnerabilities that concern images or similar infrastructure systems running in the cloud
Cloud security – misconfiguration of cloud systems (Key manager, S3)
Network security / Cloud security – vulnerability affecting network equipment like WAF, Firewall, routers
Infrastructure security – Categorized as everything that supports an application to run that is traditionally Server, Endpoints, and similar systems
Containers vulnerabilities (somewhere in between application container/cloud security with running containers, image stores and the system that composes containers

The approach to measure the security posture and the security health of different elements that compose our approach is quite comprehensive, the picture below gives a good idea.

The resolution time, hence SLAs, is fairly different between assets in the various categories.

Following is an example of the tooling that traditionally detects vulnerabilities in various containers illustrated above.

Fixing landscape

Applying patches and mandating SLA seems straightforward, but in reality, many factors influencing, fixing and resolving vulnerabilities can be a bit of a learning curve/journey.

The reason for such a complex landscape is because the factors that influence fixing systems and resolving vulnerabilities are many:

System Complexity
Different Layer of patches
Testing is required and several teams/clients involved in testing
Number of groups involved in the fix
Exposure to clients and criticality of systems (Downtimes allowed)
Maintenance windows

Moreover, fixing vulnerabilities in the various system should not be delegated to just one team but rather have an objective and measurable target to discuss in the context of risk.

Some of those elements can be automated, but mostly not; every system has different contracts and requirements. Nonetheless, those can be fed as business requirements for SLA and determine the bucket or Categories of the system for SLA/SLO/OKr.

Usually, systems are categorised into

Critical system – uptime is vital, and windows for updating are tight
Medium criticality – uptime is essential, but windows for an update are more relaxed
Low criticality – downtime and update windows are very flexible.

Following several different elements to consider for patching and adjusting the time for SLA, SLO, and writing OKr.

The following is to be taken as a guideline, not as an exhaustive list of topics.

Patching / Infrastructure

SIMPLE – Patch (easy) upgrade with simple testing, maintained by only one team
SIMPLE/MEDIUM – Patching a system with some testing requirements maintained by one or two teams
COMPLEX – Complex system with multiple dependencies and configurations
VERY COMPLEX – Complex system with various applications and multiple teams maintaining different parts and vast client surface

Cloud

SIMPLE – updating a system rule in a register, settings in a cloud element (e.g. S3)
SIMPLE/MEDIUM – Changing the role, IAM rule, Permission rule
COMPLEX to VERY COMPLEX – Restructuring the architecture of a service, introducing controls, changing access methodologies, changing settings that involve user interaction

Containers

SIMPLE – library or dependency without significant impact
MEDIUM – changing a critical library or os that potentially breaks dependencies
COMPLEX – updating container image with multiple dependencies and different running containers utilising the image
VERY COMPLEX – updating a container’s image that is used widely across the organisation e.g. Linux, with the deprecation of a function utilized by one or multiple systems

Updating Libraries

SIMPLE – updating a library without critical dependencies
MEDIUM – changing a critical library that potentially breaks dependencies on one application
COMPLEX – Changing a library from minor to major version, with deprecated libraries that require swap of those functions in multiple parts of code
VERY COMPLEX – similar to complex, but when the number of teams is multiple and distributed

Update Code/ Bug Fixing

SIMPLE – changing a simple variable, function with a minor regression testing
MEDIUM – changing a function or a section of code with medium impact in the individual file or limited number
COMPLEX- updating a major section of the code, changing inputs from one or multiple user perspectives that requires extensive testing

Conclusion

Vulnerabilities are a complex matter and there is no one single answer that fits all.

Vulnerability resolutions and considerations around the vulnerabilities complexity of teams needs to be a collaborative process between the security team and the product/development team

Vulnerability prioritization and contextualization become key to free up the security team from deciding what to fix and what not to fix. Security teams can then turn into a more consultative and collaborative approach with the development team helping them succeed in deciding how to fix specific problems vs what to fix or not

AppSec News Vulnerability

AppSec News Vulnerability

Francesco Cipollone

Francesco is an internationally renowned public speaker, with multiple interviews in high-profile publications (eg. Forbes), and an author of numerous books and articles, who utilises his platform to evangelize the importance of Cloud security and cutting-edge technologies on a global scale.

19th July 2022

appsec cybersecurity vulnerability

Discuss this blog with our community on Slack

Join our AppSec Phoenix community on Slack to discuss this blog and other news with our professional security team

From our Blog

When the Attacker Is the AI You Were Testing: What the Hugging Face Breach Teaches Us About Agent Control

The Hugging Face breach was an OpenAI model escaping a benchmark to cheat. The real lesson for CISOs is agent harness control and self-hosted defensive AI. (154 chars)

Francesco Cipollone

The Clearinghouse Problem: Coordination Was Never the Bottleneck. Remediation Is.

Gold Eagle solves a coordination gap that was real, but it was never the actual bottleneck. Finding vulnerabilities was never the hard part. The hard part is fixing them everywhere they run, grouped by owner and bundled for remediation velocity. That’s the constraint no clearinghouse touches.

Francesco Cipollone

AsyncAPI Supply Chain Compromise: CI/CD Pwn Request Delivers Miasma RAT Across Four npm Packages

An attacker turned AsyncAPI’s own CI/CD pipeline into a publisher, hiding a PAT theft inside 37 decoy pull requests, then pushing straight to release branches to ship the Miasma RAT to 3M weekly npm downloads under valid SLSA provenance, with zero CVE assigned.

Francesco Cipollone

Phoenix Security Launches Phoenix Purple: Security That Lives Inside Your Coding Agent, Before the Pull Request

Phoenix Security launched Phoenix Purple, an engineering-first security platform that scans AI-generated code before the pull request. Graph-native intelligence cuts scanning token costs by 10 to 33 times and delivers fixes as pull requests, closing the gap between how fast agents write code and how fast security can respond.

Francesco Cipollone

Miasma/Hades Variant Hits LeoPlatform: npm Worm Runs binding.gyp Under Bun to Steal Cloud Credentials

Miasma/Hades variant compromised 23 LeoPlatform npm packages. A planted binding.gyp runs an obfuscated credential stealer under Bun to evade Node tooling. Zero CVE. Rotate now.

Francesco Cipollone

Close the Tap, Burn the Backlog: The Control Framework Behind Agentic SDLC Security

Francesco Cipollone

blog

Vulnerability, Infrastructure and Application Security SLA, SLO, OKr – Do they matter??

Definitions

Vulnerability Landscape

Fixing landscape

Conclusion

Francesco Cipollone

Discuss this blog with our community on Slack

From our Blog

When the Attacker Is the AI You Were Testing: What the Hugging Face Breach Teaches Us About Agent Control

The Clearinghouse Problem: Coordination Was Never the Bottleneck. Remediation Is.

AsyncAPI Supply Chain Compromise: CI/CD Pwn Request Delivers Miasma RAT Across Four npm Packages

Phoenix Security Launches Phoenix Purple: Security That Lives Inside Your Coding Agent, Before the Pull Request

Miasma/Hades Variant Hits LeoPlatform: npm Worm Runs binding.gyp Under Bun to Steal Cloud Credentials

Close the Tap, Burn the Backlog: The Control Framework Behind Agentic SDLC Security

Subscribe to our newsletters

ACT Now Platform

Use Cases

Resources

About Us

Derek Fisher

Head of product security at a global fintech

Jeevan Singh

Founder of Manicode Security

James Berthoty

Founder of Latio Tech

Christophe Parisel

Senior Cloud Security Architect

Chris Romeo

Co-Founder Security Journey

Jim Manico

Founder of Manicode Security

Join our Mailing list!

blog

Vulnerability, Infrastructure and Application Security SLA, SLO, OKr – Do they matter??

Definitions

Vulnerability Landscape

Fixing landscape

Conclusion

Francesco Cipollone

Discuss this blog with our community on Slack

From our Blog

When the Attacker Is the AI You Were Testing: What the Hugging Face Breach Teaches Us About Agent Control

The Clearinghouse Problem: Coordination Was Never the Bottleneck. Remediation Is.

AsyncAPI Supply Chain Compromise: CI/CD Pwn Request Delivers Miasma RAT Across Four npm Packages

Phoenix Security Launches Phoenix Purple: Security That Lives Inside Your Coding Agent, Before the Pull Request

Miasma/Hades Variant Hits LeoPlatform: npm Worm Runs binding.gyp Under Bun to Steal Cloud Credentials

Close the Tap, Burn the Backlog: The Control Framework Behind Agentic SDLC Security

Subscribe to our newsletters

ACT Now Platform

Use Cases

Resources

About Us

Derek Fisher

Head of product security at a global fintech

Jeevan Singh

Founder of Manicode Security

James Berthoty

Founder of Latio Tech

Christophe Parisel

Senior Cloud Security Architect

Chris Romeo

Co-Founder Security Journey

Jim Manico

Founder of Manicode Security

Join our Mailing list!

Co-Founder Security Journey