Reachability Analysis

As applications grow in size and complexity, so does the potential for vulnerabilities. The attack surface also expands with increased reliance on open-source software components. Given tight deadlines and competing priorities, it is often impractical to remediate every vulnerability. A pragmatic way to approach a near-zero risk state in a limited timeframe is to target the open-source software vulnerabilities that lie along the execution paths of an application i.e. are reachable.

Reachability Analysis allows you to identify these exposed components. When integrated into application scans, this analysis can detect which vulnerable components are accessible, labeling any associated policy violations as "Reachable" in the application report. The remediation efforts can then be targeted towards these "Reachable" policy violations.

How does Reachability Analysis Work?

Reachability Analysis is designed to run on Java (or any JVM language) binaries within the scan target. When enabled during application scans, it examines both the application binaries and their dependency binaries in the scan target folder. Reachability Analysis also supports JavaScript/Node.js projects.

If a vulnerable component is detected and the application code calls a specific method in that component which could be exploitable, the policy violation is labeled as "Reachable".

If a vulnerable component is found but the application code does not execute any calls to the vulnerable method, the policy violation is labeled as "Not Reachable".

If a vulnerable component is detected and the application code calls a specific method in that component which could be exploitable, the policy violation is labeled as "Reachable." Conversely, if a vulnerable component is found but the application code does not execute any calls to the vulnerable method, the policy violation is labeled as "Not Reachable."

Example

Consider an XML reader application that has a dependency on com.vulnerables.utils: vulnerable-deserializer:1.5. It has an endpoint that receives an XML file. The endpoint code reads the content of the XML file using the method XMLReader.readFile() and renders it on-screen. Based on our security data, the method readfile() has known vulnerabilities and public CVEs .

Reachability analyzes the .jar files created by the build tool and scans the method signatures to identify the vulnerable methods that are reachable. The policy violation for the component com.vulnerables.utils: vulnerable-deserializer:1.5 is labeled as "Reachable." This will be displayed in the policy evaluation report for the corresponding scan and on the Violations Dashboard.

The XML reader application also has a feature that allows users to write the XML input to a LaTeX document and export it. One of the methods of the LaTeX library is vulnerable to remote command execution. Due to its known vulnerability, this method is never called and so is not in the execution path. Reachability Analysis will not label the policy violation as "Reachable."

Sonatype Integrations Supporting Reachability Analysis

Reachability Analysis is currently supported on all Sonatype CI integrations. To enable the feature and configure namespaces, see the parameters section in the integration-specific documentation:

Using Java Reachability Analysis on Jenkins

The examples below are for the Sonatype Platform Plugin for Jenkins integration, which currently supports a fine-grained configuration for the feature. However, the principles apply to some extent to all CI integrations.

For Best Results with Reachability Analysis

Results for Reachability Analysis depend on the strategy used to select the types of methods to scan and the selection of namespaces to narrow down the scan entry points.

Strategies for Enabling Reachability Analysis

Minimal Configuration
The example below shows using the Reachability Analysis with minimal configuration.
```
reachability: [
  javaAnalysis: [
    enable: true
  ]
]
```
Method Selection
This is currently available only for Reachability Analysis performed on Sonatype Platform Plugin for Jenkins and Sonatype for Bamboo Data Center plugin.
You can use one of the following available strategies for method selection for scans with Reachability Analysis enabled:
JAVA_MAIN:Selects all methods matching public static void main(String[] args)
PUBLIC_CONCRETE: Selects public non-abstract/synthetic methods from non-interface/annotation classes
ACCESSIBLE_CONCRETE: Selects public/protected non-abstract/synthetic methods from non-interface/annotation classes. This is the default entry point strategy.
CONCRETE: Selects all non-abstract/synthetic methods from non-interface/annotation classes.
ALL: Selects all methods from all non-interface/annotation classes.
You can enable the Reachability Analysis feature with the minimal configuration as below:
Example:
The example below shows how to enable the Reachability Analysis feature with the ACCESSIBLE_CONCRETE strategy.
```
reachability: [
  javaAnalysis: [
    enable: true,
    entrypointStrategy: 'ACCESSIBLE_CONCRETE'
  ]
]
```
Selection of Namespaces
Select the methods that should be considered as entry points (points in code where execution begins) for the analysis.
Namespaces are specified at the level of packages. All packages nested under a namespace are considered for entry point selection. If multiple namespaces are specified, all of them will be included.
Reachability Analysis will start with the namespaces specified and subsequently analyze all others in the execution path.
Example:
Consider the following project structure:
```
	src
	|—main
	|	|—java
	|	|	|—com
	|	|	|	|—sonatype
	|	|	|	|	|—iq
	|	|	|	|	|	|—domain
	|	|	|	|	|	|	|—DomainClassOne.java
	|	|	|	|	|	|—application
	|	|	|	|	|	|—repository
	|	|	|	|	|—wrappers
	|	|	|	|	|	|—OktaLoginWrapper.java
	|	|	|	|	|—configs
	|—test
	|	|—java
	|	|	|—com
	|	|	|	|—sonatype
	|	|	|	|	|—iq
	|	|	|	|	|	|—domain
	|	|	|	|	|	|	|—DomainClassOne.java
	|	|	|	|	|	|—application
	|	|	|	|	|	|—repository
	|	|	|	|	|—wrappers
	|	|	|	|	|	|—OktaLoginWrapper.java
	|	|	|	|	|—configs
```
The Reachability Analysis feature is configured as follows:
```
reachability: [
  javaAnalysis: [
    enable: true,
    entrypointStrategy: 'ACCESSIBLE_CONCRETE'
    namespaces: [
      [namespace: 'com.sonatype.iq']
    ]
  ]
]
```
In the example above, the namespaces property specifies the namespace 'com.sonatype.iq', which will be considered as the entry point for the analysis. This namespace has domain, application, and repository packages under its scope. Methods belonging to DomainClassOne.java class (under the domain package in the 'com.sonatype.iq' namespace) will be analyzed before other methods in the package classes. Similarly, methods belonging to classes under other applications and repository packages will be analyzed at the start of the analysis for the package.
Packages of type wrappers, with namespace 'com.sonatype.wrappers' and config packages with namespace 'com.sonatype.configs' will be omitted when an entry point for the analysis is being established. Similarly, methods belonging to the OktaLoginWrapper class with namespace 'com.sonatype.wrappers.OktaLoginWrapper.java' will be omitted when establishing an entry point.
Notes:
- An entry point is any method signature that aligns with the selected strategy. For example, for JAVA_MAIN strategy, all entry point methods have public static void main as method signature.
- You can use regular expressions when specifying the namespace.
  Example: For org.foo.example, you can use regular expressions with '/' at the start and end of the string as /^org\\.+.*\\.example\$/
Using the Parameter Includes
Multi-module projects could have several .jar files when built. Many of these .jar files are dependencies of another .jar file, which could be the one containing the main application. By specifying the path to this specific .jar when running reachability analysis, you can avoid multiple evaluations of the same .jar files, which would occur when:
1. .jar files are evaluated separately, and
2. .jar files are evaluated when invoked by the main application.
The includes parameter specifies a target path for the artifacts to be analyzed. It limits the scope of the analysis, resulting in better precision and reducing the utilization of system resources.
Example:
```
reachability: [
  javaAnalysis: [
    enable: true,
    entrypointStrategy: 'ACCESSIBLE_CONCRETE'
    includes: [
      [pattern: '**/target/my-project-*.jar']
    ]
  ]
]
```
If includes is omitted, the target location for the analysis will be the same as specified in the iqScanPatterns of the nexusPolicyEvaluation in the Jenkins file. This may increase the scope of the target analysis, leading to reduced precision.
Analysis Algorithm
These are the supported algorithms for Java reachability analysis:
- CHA (Class Hierarchy Analysis): A static call analysis that considers all methods in all possible loaded subclasses.
- RTA (Rapid Type Analysis): Similar to CHA, but improves precision by analyzing only classes instantiated during program execution.
- RTA_PLUS: Sonatype’s version of RTA, offering even greater precision and serving as the default algorithm.

Error Handling

By default, Jenkins will mark the pipeline as FAILURE if there are any error conditions in executing Reachability Analysis.

To avoid a pipeline FAILURE, set the value of the failOnError parameter to false (it is true by default). Jenkins will mark the pipeline as UNSTABLE.

reachability: [
  failOnError: false,
  logLevel: 'DEBUG',
  javaAnalysis: [
    enable: true,
    entrypointStrategy: 'ACCESSIBLE_CONCRETE'
  ]
]

Expected Outputs

Reachability Analysis yields different outputs for a wide range of scenarios. The outputs depend on the type of artifacts analyzed, strategies used, and fine-tuning using the performance-enhancing parameters described above.

Reachability Analysis output is logged within the IQ Policy Evaluation log and can be found at the stage where policy evaluation is called within your pipeline.

Here are descriptions to a few sample outputs:

Sample output 1: Policy violations labeled as Reachable

On successful execution of Reachability Analysis, the number of "Reachable" components found will be logged as:

2024-07-25 15:08:32 GMT-05:00  [INFO] CallflowReachableMethodsCommand - Found 2 reachable methods

To view the actual method signatures of reachable methods, the logLevel should be set to DEBUG. However, this may lead to a lot of logging text and make the logs unreadable.

The Application Report will show the policy violations for components (belonging to the Maven ecosystem) that contain vulnerable method signatures.

Example

Click on the policy violation to open the violation details view.

Sample output 2: Policy violations labeled as "Not Reachable"

This occurs when Reachability Analysis does not find any "Reachable" methods. This means that there are no vulnerable components in the execution path of the analyzed application.

This scenario will appear in the log as:

2024-07-25 15:39:21 GMT-05:00 [INFO] CallflowReachableMethodsCommand - Found 0 reachable methods

Sample output 3: Reachability Analysis analysis skipped

This occurs when there are no vulnerable components found during the policy evaluation. Reachability Analysis is skipped.

This is logged in the pipeline log as:

2024-07-25 15:38:03 GMT-05:00 [INFO] Skipping callflow analysis; missing vulnerable component method data

Warning

The absence of "Reachable" methods does not guarantee safety. The analysis may not have been able to detect these methods due to misconfiguration of the feature. We recommend checking these configurations thoroughly, at the start of the analysis.

Other Considerations While Running Reachability Analysis

Running in multi-module projects
For a project that has multiple modules and produces multiple artifacts, Reachability Analysis should be configured with one of the strategies described above, for accurate results.
If a project produces two different artifacts, for example, one .jar file for a client service and one .jar file for a server, each of these .jar files should be evaluated separately. We recommend setting up a separate pipeline in Jenkins, one that produces the client .jar and one that produces the server .jar. This way you can use the includes parameter to specify the artifact you want to analyze on each pipeline.
If a multi-module project has modules that are meant to be used as a library for other projects, using JAVA_MAIN will not produce any "Reachable" methods. This is because none of the modules will have the methods with the signature public static void main. In such cases, it is best to use ACCESSIBLE_CONCRETE or PUBLIC_CONCRETE in the strategies section above.

Execution Times
Reachability Analysis can be a time and memory intensive process, depending upon the size of the project that is being analyzed. The execution involves going through all entry points specified, creating a call graph, and process the code to detect vulnerable methods in the execution path. This could be a huge overhead if your project has millions of lines of code, lots of dependencies and entry points.
To reduce the execution times, here are some recommendations:
1. If your project has a releaseable main branch, run Reachability Analysis on the main branch instead of all the feature branches on each new commit.
2. If you have changes in the project manifest, run Reachability Analysis in the feature branches. This is a trade off between build time and the extra analysis step due to Reachability Analysis.
3. Run Reachability Analysis at fixed times, for example, on nightly builds.
4. If you have multi-module projects, separate the projects before running Reachability Analysis.

Using JavaScript Reachability Analysis

JavaScript Reachability Analysis is currently supported only by the Sonatype Platform Plugin for Jenkins integration, which also offers fine‑grained configuration for the feature.

Minimal Configuration

To add JavaScript Reachability Analysis to an existing pipeline, add a jsAnalysis clause to the reachability section, enable it, and specify the project source file (Ant‑style glob patterns are supported). These source file serve as the starting point for the analysis. All paths or patterns are relative to the workspace directory. Do not include any files from node_modules; those are project dependencies, not source files. We also recommend omitting file extensions in your patterns, since the analyzer recognizes common JavaScript and TypeScript extensions.

The example below shows using the Javascript Reachability Analysis with a minimal configuration:

reachability: [
  jsAnalysis: [
    enable: true,
    sourceFiles: [
      [pattern: 'src/**/*']
    ]
  ]
]

Optional Parameters

Node.js Executable
Reachability Analysis requires a Node.js executable (v16+) on the pipeline’s PATH. This could be achieved via the Jenkins NodeJS plugin, which allow multiple versions to be configured as global tools in Jenkins. If no Node executable is on the pipeline's PATH, an absolute path can be explicitly specified as below:
```
reachability: [
  jsAnalysis: [
    enable: true,
    node: [
      executable: '/path/to/node/exec' // absolute path expected; may include env. vars. e.g. "${env.WORKSPACE}/path/to/node"
    ],
    sourceFiles: [
      [pattern: 'src/**/*']
    ]
  ]
]
```
File Exclusion
If your project contains other JavaScript files (e.g. tests) that shouldn’t be included in Reachability Analysis, they can be excluded in the same way as sourceFiles, via an excludeFiles section:
```
reachability: [
  jsAnalysis: [
    enable: true,
    sourceFiles: [
      [pattern: 'src/**/*']
    ],
    excludeFiles: [
      [pattern: 'test/**/*']
    ]
  ]
]
```
Project Directory
By default, reachability analysis considers the workspace root where package.json lives as the project directory. If your source lives elsewhere, set projectDirectory relative to the workspace:
```
reachability: [
  jsAnalysis: [
    enable: true,
    projectDirectory: 'app/root',   // relative to the workspace directory
    sourceFiles: [
      [pattern: 'src/**/*']
    ]
  ]
]
```

Note

To enable Java and Javascript Reachability Analysis in the same evaluation step, populate both the javaAnalysis and jsAnalysis sections.