Shortcuts to Explore the Graph¶
When analyzing software, there are some information which are interesting to explore. To facilitate accessing the information even without in-depth knowledge about the graph and the graph model, we provide a number of shortcuts which can be used on all nodes to find the nodes you're looking for.
All you have to do to use this functionality is to add the import de.fraunhofer.aisec.cpg.graph.*
.
AST subtree traversal¶
It is often useful to find nodes which are in the AST subtree of another node. We provide the following shortcuts to gain a quick overview of relevant types of nodes:
Starting from node n
...
- ...get all function/method calls with
n.calls
- ...get all member calls (i.e., calls which are called on an object or class) with
n.mcalls
- ...get all method declarations with
n.methods
- ...get all function (and method) declarations with
n.functions
- ...get all field declarations with
n.fields
- ...get all parameters with
n.parameters
- ...get all record declarations (e.g. classes, structs) with
n.records
- ...get all namespaces with
n.namespaces
- ...get all variables with
n.variables
- ...get all literals with
n.literals
- ...get all references to variables, fields, functions, etc. with
n.refs
- ...get all assignments with
n.assignments
Filtering the results¶
The lists you get here can be quite long, and it's a good idea to filter them. To do so, we provide different operators:
- To retrieve a single element, you can use the
[]
(get) operator and specify your criterion inside the brackets. - To retrieve a single element and get an exception if there are multiple options, add the
SearchModifiers.UNIQUE
to the query. - To retrieve a list of nodes, you can use the
()
(invokes) operator to specify your criterion.
Both notations allow you to quickly filter for the name by providing the respective string or by accessing the fields and writing conditions on them.
Examples:
import de.fraunhofer.aisec.cpg.graph.*
// returns the first variable in the graph which has the name "a"
var a = result.variables["a"]
// returns the only variable with the name "a" or an exception otherwise
var theOnlyA = result.variables["a", SearchModifiers.UNIQUE]
// returns the first variable in the graph which does have an initializer
var anyWithInitializer = result.variables[{ it.initializer != null }]
// returns the only variable in the graph which does not have an initializer or throws an exception
var uniqueWithInitializer = result.variables[{ it.initializer != null }, SearchModifiers.UNIQUE]
// returns a list of all VariableDeclarations in the graph with the name "a"
var aList = result.variables("a")
// returns a list of FunctionDeclarations that have no parameter
var noArgs = result.functions { it.parameters.isEmpty() }
More information needed?¶
In some cases, the AST-based traversals won't suffice to filter the nodes that you're interested in. For this reason, there are a number of additional methods which search for other patterns in the graph. Note that these are often less stable than the information from above!
- The size of an array is evaluated using
SubscriptExpression.arraySize
. Unfortunately, this only works if the size is given in the initialization. Updates are not considered. - Control dependencies are currently available via the extensions
Node.controlledBy()
andIfStatement.controls()
. Node.eogDistanceTo(to: Node)
calculates the number of EOG edges between this node andto
.FunctionDeclaration.get(n: Int)
: Returns the n-th statement of the body of this function.FunctionDeclaration.callees
: Returns the functions which are called from this function.TranslationResult.callersOf(function: FunctionDeclaration)
determines which functions call the specified function.- The methods and
Node.followEOGEdgesUntilHit( collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, direction: AnalysisDirection, vararg sensitivities: AnalysisSensitivity, scope: AnalysisScope, earlyTermination: (Node, Context) -> Boolean, predicate: (Node) -> Boolean ): FulfilledAndFailedPaths
enable you a fine-grained configuration of how to collect EOG or DFG paths between the node and the first node matching theNode.followDFGEdgesUntilHit( collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, direction: AnalysisDirection, vararg sensitivities: AnalysisSensitivity, scope: AnalysisScope, earlyTermination: (Node, Context) -> Boolean, predicate: (Node) -> Boolean ): FulfilledAndFailedPaths
predicate
unless there is a node matching theearlyTermination
criterion on the path. - The
scope
allows to choose between an intraprocedural and interprocedural analysis and to configure the number of steps or depth in the call chain to follow. - The methods can further be configured to collect failed paths (to see path exists which does not fulfill a flow requirement) or to identify all possible paths reaching the predicate.
- By default, they are configured to run a
Forward
analysis, but you can equally opt for aBackward
analysis by setting thedirection
accordingly. - The
sensitivities
allow you to filter which edges should be followed. These are partially based on typical sensitivities for dataflow analysis(ContextSensitive
,FieldSensitive
), but actually allow a wider range of filters, e.g. to only follow full DFG edges, reachable EOG paths, or even configure the whole analysis system to follow implicit dataflows by following the program dependence graph. - They return all failed and all fulfilled paths. This allows reasoning more precisely about the program's behavior.
- The methods
Node.followPrevCDGEdgesUntilHit(collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, interproceduralAnalysis: Boolean, predicate: (Node) -> Boolean)
,Node.followNextCDGEdgesUntilHit(collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, interproceduralAnalysis: Boolean, predicate: (Node) -> Boolean)
,Node.followPrevPDGEdgesUntilHit(collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, interproceduralAnalysis: Boolean, predicate: (Node) -> Boolean)
,Node.followNextPDGEdgesUntilHit(collectFailedPaths: Boolean, findAllPossiblePaths: Boolean, interproceduralAnalysis: Boolean, predicate: (Node) -> Boolean)
, collect the CDG/PDG path between the node and the first node matching the predicate. The methods can be configured to collect failed paths (to see if a path exists which does not fulfill a requirement), to identify all possible paths reaching a predicate and partially follow calls as well. They return all failed and all fulfilled paths. This allows reasoning more precisely about the program's behavior. - If you're interested in all nodes reachable via one of the sub-graphs from a certain node, the methods
Node.collectAllPrevFullDFGPaths()
,Node.collectAllNextFullDFGPaths()
,Node.collectAllPrevEOGPaths()
,Node.collectAllNextEOGPaths()
,Node.collectAllPrevCDGPaths(interproceduralAnalysis: Boolean)
,Node.collectAllNextCDGPaths(interproceduralAnalysis: Boolean)
,Node.collectAllPrevPDGPaths(interproceduralAnalysis: Boolean)
,Node.collectAllNextPDGPaths(interproceduralAnalysis: Boolean)
can be used.