Skip to content

Shortcuts to Explore the Graph

When analyzing software, there are some information which are interesting to explore. To facilitate accessing the information even without in-depth knowledge about the graph and the graph model, we provide a number of shortcuts which can be used on all nodes to find the nodes you're looking for.

All you have to do to use this functionality is to add the import de.fraunhofer.aisec.cpg.graph.*.

AST subtree traversal

It is often useful to find nodes which are in the AST subtree of another node. We provide the following shortcuts to gain a quick overview of relevant types of nodes:

Starting from node n...

  • ...get all function/method calls with n.calls
  • ...get all member calls (i.e., calls which are called on an object or class) with n.mcalls
  • ...get all method declarations with n.methods
  • ...get all function (and method) declarations with n.functions
  • ...get all field declarations with n.fields
  • ...get all parameters with n.parameters
  • ...get all record declarations (e.g. classes, structs) with n.records
  • ...get all namespaces with n.namespaces
  • ...get all variables with n.variables
  • ...get all literals with n.literals
  • ...get all references to variables, fields, functions, etc. with n.refs
  • ...get all assignments with n.assignments

Filtering the results

The lists you get here can be quite long, and it's a good idea to filter them. To do so, we provide different operators:

  • To retrieve a single element, you can use the [] (get) operator and specify your criterion inside the brackets.
  • To retrieve a single element and get an exception if there are multiple options, add the SearchModifiers.UNIQUE to the query.
  • To retrieve a list of nodes, you can use the () (invokes) operator to specify your criterion.

Both notations allow you to quickly filter for the name by providing the respective string or by accessing the fields and writing conditions on them.

Examples:

import de.fraunhofer.aisec.cpg.graph.*

// returns the first variable in the graph which has the name "a"
var a = result.variables["a"]

// returns the only variable with the name "a" or an exception otherwise
var theOnlyA = result.variables["a", SearchModifiers.UNIQUE]

// returns the first variable in the graph which does have an initializer
var anyWithInitializer = result.variables[{ it.initializer != null }]

// returns the only variable in the graph which does not have an initializer or throws an exception
var uniqueWithInitializer = result.variables[{ it.initializer != null }, SearchModifiers.UNIQUE]

// returns a list of all VariableDeclarations in the graph with the name "a"
var aList = result.variables("a")

// returns a list of FunctionDeclarations that have no parameter
var noArgs = result.functions { it.parameters.isEmpty() }

More information needed?

In some cases, the AST-based traversals won't suffice to filter the nodes that you're interested in. For this reason, there are a number of additional methods which search for other patterns in the graph. Note that these are often less stable than the information from above!

  • The size of an array is evaluated using SubscriptExpression.arraySize. Unfortunately, this only works if the size is given in the initialization. Updates are not considered.
  • Control dependencies are currently available via the extensions Node.controlledBy() and IfStatement.controls().
  • Node.eogDistanceTo(to: Node) calculates the number of EOG edges between this node and to.
  • FunctionDeclaration.get(n: Int): Returns the n-th statement of the body of this function.
  • FunctionDeclaration.callees: Returns the functions which are called from this function.
  • TranslationResult.callersOf(function: FunctionDeclaration) determines which functions call the specified function.
  • The methods
    Node.followEOGEdgesUntilHit(
        collectFailedPaths: Boolean,
        findAllPossiblePaths: Boolean,
        direction: AnalysisDirection,
        vararg sensitivities: AnalysisSensitivity,
        scope: AnalysisScope,
        earlyTermination: (Node, Context) -> Boolean,
        predicate: (Node) -> Boolean
    ): FulfilledAndFailedPaths
    
    and
    Node.followDFGEdgesUntilHit(
        collectFailedPaths: Boolean,
        findAllPossiblePaths: Boolean,
        direction: AnalysisDirection,
        vararg sensitivities: AnalysisSensitivity,
        scope: AnalysisScope,
        earlyTermination: (Node, Context) -> Boolean,
        predicate: (Node) -> Boolean
    ): FulfilledAndFailedPaths
    
    enable you a fine-grained configuration of how to collect EOG or DFG paths between the node and the first node matching the predicate unless there is a node matching the earlyTermination criterion on the path.
  • The scope allows to choose between an intraprocedural and interprocedural analysis and to configure the number of steps or depth in the call chain to follow.
  • The methods can further be configured to collect failed paths (to see path exists which does not fulfill a flow requirement) or to identify all possible paths reaching the predicate.
  • By default, they are configured to run a Forward analysis, but you can equally opt for a Backward analysis by setting the direction accordingly.
  • The sensitivities allow you to filter which edges should be followed. These are partially based on typical sensitivities for dataflow analysis(ContextSensitive, FieldSensitive), but actually allow a wider range of filters, e.g. to only follow full DFG edges, reachable EOG paths, or even configure the whole analysis system to follow implicit dataflows by following the program dependence graph.
  • They return all failed and all fulfilled paths. This allows reasoning more precisely about the program's behavior.
  • The methods Node.followPrevCDGEdgesUntilHit(collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, interproceduralAnalysis: Boolean, predicate: (Node) -> Boolean), Node.followNextCDGEdgesUntilHit(collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, interproceduralAnalysis: Boolean, predicate: (Node) -> Boolean), Node.followPrevPDGEdgesUntilHit(collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, interproceduralAnalysis: Boolean, predicate: (Node) -> Boolean), Node.followNextPDGEdgesUntilHit(collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, interproceduralAnalysis: Boolean, predicate: (Node) -> Boolean), collect the CDG/PDG path between the node and the first node matching the predicate. The methods can be configured to collect failed paths (to see if a path exists which does not fulfill a requirement), to identify all possible paths reaching a predicate and partially follow calls as well. They return all failed and all fulfilled paths. This allows reasoning more precisely about the program's behavior.
  • If you're interested in all nodes reachable via one of the sub-graphs from a certain node, the methods Node.collectAllPrevFullDFGPaths(), Node.collectAllNextFullDFGPaths(), Node.collectAllPrevEOGPaths(), Node.collectAllNextEOGPaths(), Node.collectAllPrevCDGPaths(interproceduralAnalysis: Boolean), Node.collectAllNextCDGPaths(interproceduralAnalysis: Boolean), Node.collectAllPrevPDGPaths(interproceduralAnalysis: Boolean), Node.collectAllNextPDGPaths(interproceduralAnalysis: Boolean) can be used.