Specifying and Verifying Secrecy in Workflows with Arbitrarily Many Agents

Bernd Finkbeiner; Helmut Seidl; Christian Müller

doi:10.1007/978-3-319-46520-3_11

International Symposium on Automated Technology for Verification and Analysis

ATVA 2016: Automated Technology for Verification and Analysis pp 157-173

Specifying and Verifying Secrecy in Workflows with Arbitrarily Many Agents

Authors
Authors and affiliations

Bernd Finkbeiner
Helmut Seidl
Christian MüllerEmail author

Conference paper

First Online:: 22 September 2016

Part of the Lecture Notes in Computer Science book series (LNCS, volume 9938)

Abstract

Web-based workflow management systems, like EasyChair, HealthVault, Ebay, or Amazon, often deal with confidential information such as the identity of reviewers, health data, or credit card numbers. Because the number of participants in the workflow is in principle unbounded, it is difficult to describe the information flow policy of such systems in specification languages that are limited to a fixed number of agents. We introduce a first-order version of HyperLTL, which allows us to express information flow requirements in workflows with arbitrarily many agents. We present a bounded model checking technique that reduces the violation of the information flow policy to the satisfiability of a first-order formula. We furthermore identify conditions under which the resulting satisfiability problem is guaranteed to be decidable.

Download fulltext PDF

1 Introduction

Web-based workflow management systems allow diverse groups of users to collaborate efficiently on complex tasks. For example, conference management systems like EasyChair let authors, reviewers, and program committees collaborate on the organization of a scientific conference; health management systems like HealthVault let family members, doctors, and other health care providers collaborate on the management of a patient’s care; shopping sites like Amazon or Ebay let merchants, customers, as well as various other agents responsible for payment, customer service, and shipping, collaborate on the purchase and delivery of products.

Since the information maintained in such systems is often confidential, the workflows must carefully manage who has access to what information in a particular stage of the workflow. For example, in a conference management system, PC members must declare conflicts of interest, and they should only see reviews of papers where no conflict exists. Authors eventually get access to reviews of their papers, but only when the process has reached the official notification stage, and without identifying information about the reviewers.

It is difficult to characterize the legitimate information flow in such systems with standard notions of secrecy. Classic information flow policies are often too strong. For example, noninterference [12] requires that the PC member cannot observe any difference when classified input, such as the reviews of papers where the PC member has a conflict of interest, is removed. This strong requirement is typically violated, because another PC member might, for example, nondeterministically post a message in a discussion about a paper where they both have no conflict. Weaker information flow policies, on the other hand, often turn out too weak. Nondeducibility [19], for example, only requires that an agent cannot deduce, i.e., conclusively determine, the classified information. The problem is that a piece of information is considered nondeducible already if, in the entire space of potential behaviors, there exists some other explanation. In reality, however, not all agents exhibit the full set of potentially possible behaviors, and an actual agent might be able to deduce far more than expected (cf. [15]).

Temporal logics for the specification of information flow [10] are an important step forward, because they make it possible to customize the secrecy properties. HyperLTL [7] is the linear-time representative of this class of logics. As an extension of linear-time temporal logic (LTL), HyperLTL can describe the precise circumstances under which a particular information flow policy must hold. While standard linear or branching-time logics, like LTL or CTL$^*$, can only reason about the observations at a single computation trace at a time, and can thus, by themselves, not specify information flow, HyperLTL formulas use trace quantifiers and trace variables to simultaneously refer to multiple traces. For example, HyperLTL can directly express information flow properties like “for any pair of traces $\pi , \pi '$, if the low-security observer sees the same inputs on $\pi $ and $\pi '$, then the low-security observer must also see the same outputs on $\pi $ and $\pi '$”. The key limitation of HyperLTL for the specification of workflows is that it is a propositional logic. It is, hence, impossible to specify the information flow in workflows unless the number of agents is fixed a-priori. In this paper, we overcome this limitation.

We introduce a framework for the specification and verification of secrecy in workflows with arbitrarily many agents. Our framework consists of a workflow description language, a specification language, and a verification method. Our workflow description language gives a precise description of the behavior of workflow management systems with an arbitrary number of agents. Figure 1 shows a simple example workflow of a conference management system. The workflow manipulates several relations over the unbounded domain of agents, that each characterize a particular relationship between the agents: for example, a pair (x, p) in $\textit{Conf}$ indicates that PC member x has declared a conflict with paper p, a triple (x, y, p) in $\textit{Comm}$ indicates that PC member x has received from PC member y a message about paper p. As a specification language for the information flow policies in such workflows, we introduce a first-order version of HyperLTL. We extend HyperLTL with first-order quantifiers, allowing the formulas to refer to an arbitrary number of agents. We show that the new logic can be used to specify precise assumptions on the behavior of the agents, such as causality: while a nondeterministic agent can take any action, the actions of a causal agent can only reveal information the agent has actually observed. Restricting the behaviors of the agents to the causal behavior allows us to quantify universally over the actions of the agents, as in classic notions of secrecy like noninterference, and, at the same time, eliminate the false positives of these notions. Finally, we introduce a verification method, which translates the verification problem of workflows with arbitrarily many agents and specifications in first-order HyperLTL to the satisfiability problem of first-order logic. While first-order logic is in general undecidable, we identify conditions under which the satisfiability problem for the particular formulas in the verification of the workflows is guaranteed to be decidable.

Fig. 1.
Example workflow from a conference management system.

2 Workflows with Arbitrarily Many Agents

Symbolic Transition Systems. As the formal setting for the specification and verification of our workflows, we chose symbolic transition systems, where the states are defined as the valuations of a set of first-order predicates $\mathcal P$. The initial states and the transitions between states are described symbolically using an assertion logic over $\mathcal P$. For the purpose of describing workflows, we use first-order predicate logic (PL) with equality as the assertion language.

A symbolic transition system$\mathcal S=(\mathcal P, \varTheta , \varDelta )$ consists of a set of predicates $\mathcal P$, an initial condition$\varTheta $, and a transition relation$\varDelta $. The initial condition $\varTheta $ is given as a formula of the assertion language over the predicates $\mathcal P$. The transition relation $\varDelta (P_1,\ldots ,P_k; P_1', \ldots ,P_k')$ is given as a formula over the predicates $\mathcal P=\{P_1, \ldots , P_k\}$, which indicate the interpretation of the predicates in the present state, and the set of primed predicates $\mathcal P'=\{P_1',\ldots ,P_k'\}$, which indicate the interpretation of the predicates in the next state.

Let U be some arbitrary universe. In the case of the workflows, U is the set of agents participating in the workflow. Let $\mathcal P^n$ denote the set of predicates with arity n. A state$s: \bigcup _{n\ge 0}\mathcal P^n \times U^n \rightarrow \mathbb B$ is then an evaluation of the predicates over U. A trace is an infinite sequence of states $s_0,s_1,\dots $ such that (1) $s_0$ satisfies $\varTheta $ (initiation), and (2) for each $i\ge 0$, the transition relation $\varDelta $ is satisfied by the consecutive states $s_i$ and $s_{i+1}$, where the predicates in $\mathcal P$ are evaluated according to $s_i$ and the predicates in $\mathcal P'$ are evaluated according to $s_{i+1}$. We denote the set of all traces of a transition system $\mathcal S$ as $ Traces (\mathcal S)$.

The Workflow Language. We define a language to specify workflows. A workflow is structured into multiple blocks. Each block specifies the behaviour of a group of agents. A block is made of several statements which add (or remove) specific tuples from a given relation depending on a guard clause.

Specifying and Verifying Secrecy in Workflows with Arbitrarily Many Agents

Abstract

1 Introduction

2 Workflows with Arbitrarily Many Agents

Example 1

Example 2

3 Specifying Secrecy with First-Order HyperLTL

Example 3

Example 4

4 Verifying Secrecy

Theorem 1

Theorem 2

Corollary 1

5 Decidability

Theorem 3

Proof

Theorem 4

Proof

6 Completing the Conference Management Example

7 Related Work

8 Conclusion

Acknowledgment

Copyright information

Authors and Affiliations

Personalised recommendations

Cookies