Control-guided Communication: Efficient Resource Arbitration and Allocation in Multi-hop Wireless Control Systems

Control-guided Communication: Ef ﬁcient Resource Arbitration and Allocation in Multi-hop W ireless Control Systems Dominik Baumann 1 , ∗ , Fabian Mager 2 , ∗ , Marco Zimmerling 2 , and Sebastian T rimpe 1 Abstract —In future autonomous systems, wireless multi-hop communication is key to enable collaboration among distrib uted agents at low cost and high ﬂexibility . When many agents need to transmit information over the same wireless network, communi- cation becomes a shared and contested r esource. Event-trigger ed and self-triggered control account for this by transmitting data only when needed, enabling signiﬁcant energy savings. However , a solution that brings those beneﬁts to multi-hop networks and can reallocate freed up bandwidth to additional agents or data sources is still missing. T o ﬁll this gap, we propose control-guided communication , a novel co-design approach for distributed self- triggered control over wireless multi-hop networks. The control system informs the communication system of its transmission demands ahead of time, and the communication system allocates resour ces accordingly . Experiments on a cyber-ph ysical testbed show that multiple cart-poles can be synchronized over wireless, while serving other trafﬁc when resour ces are available, or sa ving energy . These experiments are the ﬁrst to demonstrate and evaluate distributed self-triggered control over low-power multi- hop wireless networks at update rates of tens of milliseconds. Index T erms —Wireless control systems, self-trigger ed control. I . I N T RO D U C T I O N T HE unparalleled ﬂexibility and cost efﬁciency when closing feedback loops over wireless networks enables many cyber-physical applications. For instance, in a smart factory , plants are controlled via remote controllers, mobile robots interact with the plants, and distributed sensors provide additional measurements. Another example is drones regularly exchanging data to ﬂy in formation. These and other applica- tions demand wireless multi-hop communication to cover large distances and fast update intervals of tens of milliseconds to keep up with the dynamics of the systems to be controlled [1]. Challenges. Fast feedback control ov er wireless multi-hop networks is challenging owing to the inherent imperfections of wireless networks, such as transmission delays and message loss. Moreover , the limited network bandwidth can lead to congestion when many agents need to communicate at the same time, and wireless radios draw considerable po wer , which ∗ Equal contribution. 1 Intelligent Control Systems Group, Max Planck Institute for Intelligent Systems, Stuttgart/T ¨ ubingen, Germany . Email: dbaumann@tuebingen.mpg.de, trimpe@is.mpg.de. 2 Networked Embedded Systems Lab, TU Dresden, Dresden, Germany . Email: { fabian.mager, marco.zimmerling } @tu-dresden.de This work was supported in part by the German Research Foundation within the Cluster of Excellence cfaed (grant EXC 1056), SPP 1914 (grants ZI 1635/1-1 and TR 1433/1-1), and the Emmy Noether project NextIoT (grant ZI 1635/2-1); the Cyber V alley Initiativ e; and the Max Planck Society . T able I: Qualitative comparison of prior and our work on integrating STC with wireless communication, ev aluated through real-world experiments. W ork Fast update Multi- Energy Reallo- Distributed intervals hop savings cation implementation [8] 7 7 3 3 7 [9] 7 7 3 3 7 [10] 7 3 3 7 7 [11] 3 7 3 7 7 [12] 3 7 3 7 7 This 3 3 3 3 3 is a major concern for embedded sensors and mobile devices that must be untethered and thus po wered by batteries. For these reasons, adapti ve schemes are needed where agents use the network only when necessary to save energy , and available resources are reallocated at run time to serve those in need. T o use the limited bandwidth and energy more ef ﬁciently , e vent-triggered control (ETC) and self-triggered control (STC) methods have been dev eloped [2], [3]. Unlike periodic control, in ETC and STC the decision whether to communicate or not is based on ev ents, such as an error exceeding a threshold. ETC instantaneously decides whether to communicate, leaving no time to save energy or reallocate bandwidth in case of a negati ve triggering decision. STC, instead, decides ahead of time about the next triggering instant. Howe ver , to utilize freed resources ( e.g. , to serve trafﬁc from additional remote sensors), an integration of STC designs and wireless communication pro- tocols is required. Moreov er , such co-design approaches must be ev aluated on real cyber-physical testbeds to establish trust in feedback control over wireless [4]. While a large body of work on STC exists (see [2], [3], [5]–[7] and the references therein), the integration of STC designs with wireless protocols includ- ing an experimental e valuation has rarely been considered. The few exceptions are listed in T able I and discussed next. Prior work. Existing approaches integrating STC and wireless communication target remote control, for example, of a double- tank process [8], [9], a simulated load-positioning system [10], or a mobile robot [11]. Coordination in multi-robot systems has been studied in [12], but the control commands are computed by a central entity , so the implementation is not distributed. All works show that STC allows for solving the control task with less communication than periodic control, enabling signiﬁcant energy savings. Howe ver , reallocation of freed resources has only been demonstrated in [8], [9], for single-hop networks and update intervals of a few seconds. In fact, STC over a wireless multi-hop network has only been shown in [10], with an update interval of 1 s. Accepted ﬁnal version. T o appear in IEEE Control Systems Letters . ©2019 IEEE. Personal use of this mater ial is permitted. P ermission from IEEE must be obtained for all other uses , in any current or future media, including reprinting/republishing this mater ial f or adv ertising or promotional purposes, creating ne w collective works, for resale or redistribution to ser v ers or lists, or reuse of any copyrighted component of this wor k in other works. Wireless Multi-hop Network Network Manager Physical System i Self Trigger Data Source j Physical System i + 1 Self Trigger Data Source j + 1 Figure 1: W e consider multiple physical systems connected over a wireless multi-hop network. Each system is associated with a self trigger that computes at the current communication instant when it needs to communicate next. This information is piggybacked onto the message it sends. The network manager uses this information to compute a communication schedule respecting these demands and, if possible, reallocating bandwidth to additional data sources. In summary , no solution exists that provides ener gy sa vings and reallocation of freed resources for the control of systems at fast update intervals ov er multi-hop networks. Moreover , no work has sho wn a distributed implementation of a STC law , where agents locally use information obtained over the network to solve a common control task. Howe ver , a complete solution is needed to enable novel applications, such as collaborative multi-robot swarms for future smart production systems. Contribution. W e present a co-design of control and commu- nication for multi-hop wireless networks that ﬁlls this gap. Our approach arbitrates the av ailable communication bandwidth among different types of trafﬁc from any entity in the network, while simultaneously shutting down resources completely to sav e energy when neither the control system nor any other entity needs the full bandwidth. W e e v aluate the approach on a three-hop cyber -physical testbed with multiple physical systems [13], demonstrating improved resource efﬁcienc y at high control performance for update interv als below 100 ms. At the heart of our solution is the nov el concept of contr ol- guided communication : The control system informs the com- munication system at run time about its resource requirements, and the communication system le verages this information to dynamically allocate or shut down resources. Concretely , we consider the setup depicted in Fig. 1. Each agent uses STC to decide at the current communication instant when it will communicate next. The agent piggybacks the decision of its self trigger onto the messages it sends. The network manager uses this information as input when dynamically computing the communication schedule at run time. For example, when some agents do not need to communicate, their share of the bandwidth can be reallocated to serve other trafﬁc ( e.g. , from remote sensors) or can be shut down to conserve energy . The concrete scheduling policy is an exchangeable component of our design and can be adapted to the application requirements. In essence, we make the following two main contributions: • W e propose control-guided communication, a tight inte- gration of STC and wireless multi-hop communication in which the control system informs the network at run time about future communication demands to enable both energy savings and reallocation of network bandwidth. • Using experiments on a real cyber -physical testbed with ﬁv e in verted pendulums, we are the ﬁrst to demonstrate distributed STC over wireless multi-hop networks with update intervals below 100 ms , while showing energy savings of up to 87 % compared to the periodic baseline. I I . P RO B L E M S E T T I N G W e consider N physical systems connected over a wireless multi-hop network, as sho wn in Fig. 1. Each agent is modeled as a stochastic, linear , and time-in v ariant system x i ( k + 1) = A i x i ( k ) + B i u i ( k ) + v i ( k ) , (1) with state x i ( k ) ∈ R n , input u i ( k ) ∈ R m , and v i ( k ) ∈ R n a Gaussian random variable with zero mean and variance Σ i , capturing process noise. W e assume each agent has a local controller that receiv es local observ ations directly , but also needs information from other agents for distributed control. There are v arious methods to design distributed controllers (see, for example, [14]). In this work, we adopt an approach based on the linear quadratic re gulator (LQR) [15]. Using augmented states ˜ x ( k ) = ( x 1 ( k ) , . . . , x N ( k )) T and inputs ˜ u ( k ) = ( u 1 ( k ) , . . . , u N ( k )) T , we deﬁne the cost function J = lim k →∞ 1 K E [ ˜ x T ( k ) Q ˜ x ( k ) + ˜ u T ( k ) R ˜ u ( k )] , (2) with positive deﬁnite weight matrices Q and R . The optimal stabilizing controller that minimizes (2) is of the form u i ( k ) = P j F ij x j ( k ) , where F ij denotes entry ( i, j ) of the feedback matrix F . That is, to implement this controller, each agent needs information from all other agents, which is sent over the wireless multi-hop network. T o provide high-performance control while efﬁciently using limited network bandwidth and energy resources, the system must meet several requirements: • For coordination, the agents need to exchange data; in particular , for optimal control according to (2) , all agents need to communicate with one another (all-to-all). • W ireless multi-hop communication must be reliable and fast to support feedback control of physical systems with fast dynamics; we target mechanical systems requiring update interv als on the order of tens of milliseconds [1]. • The network must arbitrate among multiple types of data trafﬁc as determined by the communication schedule, while always giving highest priority to control traf ﬁc. • If some fraction of the bandwidth is not allocated to any entity , this resource should be shut down to sav e energy . I I I . C O - D E S I G N A P P RO AC H The main goal of this paper is to facilitate high-performance distributed control across multi-hop wireless networks with highly adapti ve resource arbitration and allocation to support multiple trafﬁc types and save unused resources. Prior work failed to reach this goal because the many imperfections of wireless systems, such as time-varying end-to-end delays and limited throughput, complicate the control design and make it difﬁcult to quickly coordinate the system-wide operation and resource usage based on the current control-trafﬁc demands. T o tackle this issue, we propose a nov el co-design approach that integrates the control and communication systems in two ways. First, the design of the communication system tames network imperfections as much as possible, and the control system accounts for the emerging key properties and remaining imperfections. Second, during operation , the control system reasons about its future communication demands and informs data 2 schedule data 1 data K … … t t round period T compute schedule Figure 2: Time-triggered operation of the multi-hop low-po wer wireless protocol. Communication occurs in rounds with a constant period T . Each round consists of a schedule slot and up to K data slots. The schedule slot serves to inform all nodes of the number of subsequent data slots in the round and the allocation of control or other messages to the scheduled data slots. the communication system accordingly . The communication system, on the other hand, adapts to these demands by arbi- trating the av ailable bandwidth among different types of trafﬁc and by shutting do wn resources completely to sa ve energy when neither the control system nor any other participant needs the full bandwidth. W e call this concept control-guided communication , which we detail in the following two sections. In addition, our wireless communication system provides fast and reliable many-to-all communication among any set of agents, e ven when the agents are mobile and thereby causing the network topology to change continuously . This feature is a key dif ference to traditional wireless communication systems, such as W irelessHAR T , and makes our co-design approach directly applicable to solve various kinds of distributed control problems that may be stated in the form of a cost function (2) . I V . W I R E L E S S C O M M U N I C AT I ON S Y S T E M D E S I G N W e ﬁrst describe the design of the wireless communication system, and detail the control design based on the emerging properties in the next section. The wireless system builds on the periodic design in [16] and consists of three elements, where 2) is signiﬁcantly modiﬁed and 3) is a new component: 1) a har dware platform enabling a predictable and efﬁcient ex ecution of all control tasks and message transfers; 2) a multi-hop wir eless pr otocol that provides many-to-all communication with minimal, bounded end-to-end delay; 3) an online scheduler that dynamically assigns bandwidth to each agent based on its communication requirements. Hardwar e platform. W e use a dual-processor platform (DPP) where sensing, actuation, and control execute on an application processor (MSP432P401R, 32 bit, 48 MHz) and the wireless multi-hop protocol executes on a communication processor (CC430F5147, 16 bit, 13 MHz). The processors communicate through the Bolt interconnect [17], which provides bounded worst-case execution times for the bidirectional exchange of messages between both processors. In this way , control and communication can efﬁciently ex ecute in parallel and ne ver interfere with each other , providing timing predictability . Multi-hop wireless protocol. The communication processor of every DPP in the network runs a multi-hop protocol, whose design is inspired by a new breed of protocols that exploit synchr onous transmission based ﬂooding for highly reliable and efﬁcient communication. As shown in Fig. 2, using our protocol, communication occurs in rounds of equal duration that repeat with a constant period T . Each round consists of a sequence of non-overlapping slots . In each slot, one node is allowed to initiate a Glossy ﬂood [18] to send a message to all other nodes. Glossy achieves the theoretical minimum latency for ﬂooding a message in a multi-hop network using half-duplex radios, and provides a reliability above 99.9 % in real-world scenarios [18], [19]. In fact, Glossy’ s reliability can be pushed beyond 99.9999 % by letting nodes transmit more often during a ﬂood, and it time-synchronizes all nodes to within sub-microsecond accuracy at no additional cost [18]. Any node in the netw ork can serv e as the designated network manager that uses the ﬁrst slot in a round to ﬂood the schedule . The schedule informs all other nodes about the number of data slots in the round (up to K ) and the allocation of nodes to these data slots. The transmitted messages carry , for example, high-priority control information from agents or lower -priority data from other nodes, such as measurements from a remote sensor or information about a node’ s health status ( e.g. , its battery’ s state of charge). When sending a message, a node also piggybacks information about its future communication demands; if the network manager does not receive a message, it assumes that the respective node needs to transmit in the next round. Based on all demands, the network manager computes the schedule for the next round after the last data slot. Online scheduler . T o this end, the network manager maintains a list of unserved communication demands, and allocates up to K nodes to the data slots in the next round according to a scheduling policy . The scheduling policy can be adjusted to meet dif ferent application requirements. As an illustrativ e ex- ample, we design in this paper a new policy that aims to strike a balance between resource efﬁcienc y and accommodating lower -priority messages next to control trafﬁc. Speciﬁcally , if there are free data slots after assigning all nodes with pending control messages in the next round, we allocate one of the free data slots to a node for sending some other message (sensor , status, etc.). The next node to send such message is chosen in a round-robin fashion. Any other free slot is left empty . Since nodes have their radios only on during allocated slots and of f otherwise, this example policy illustrates that our wireless communication system allows for both arbitrating bandwidth among different trafﬁc types and not allocating resources at all to save energy , as demonstrated in Sec. VI. Key properties. Our wireless system design provides highly reliable, efﬁcient man y-to-all communication, system-wide time synchronization, and adapts at run time to the nodes’ communication demands. Due to the time synchronization, we can schedule control and communication tasks such that the jitter on the update interval and end-to-end delay is less than ±50 µs, as formally and experimentally validated in [16]. V . S E L F - T R I G G E R E D C O N T R O L D E S I G N W e no w detail the control design, ﬁrst our approach to distributed control and then our self-triggered design. A. Distributed Contr ol The wireless communication system provides a constant update interv al T as the jitter is negligible for the considered scenarios. W e thus set one discrete time step in (1) to T and data that is sent over the network is delayed by one time step. Moreov er , the many-to-all communication scheme ensures that information can be received by all agents in the network. This greatly facilitates control design as essentially arbitrary information patterns can be implemented. For example, this allows for implementing a (centralized) optimal controller in a distributed fashion as we show in this paper . Giv en the high reliability of the wireless embedded system, we assume that data that are sent ov er the network are receiv ed by all agents. As an e xample for distributed control, we consider syn- chronization of multiple agents through an LQR design as in (2) . For ease of presentation, we outline the approach for the two-agent case, but it also extends to multiple agents as shown in Sec. VI. W e choose the quadratic cost function J = lim K →∞ 1 K E h K − 1 X k =0 2 X i =1  x T i ( k ) Q i x i ( k ) + u T i ( k ) R i u i ( k )  + ( x 1 ( k ) − x 2 ( k )) T Q sync ( x 1 ( k ) − x 2 ( k )) i , (3) that is, we penalize deviations between x 1 ( k ) and x 2 ( k ) through the positiv e deﬁnite weight matrix Q sync , as well as de viations from the equilibrium ( Q i > 0 ) and high control inputs ( R i > 0 ). Using augmented states as in (2) , the term in the summation over k becomes ˜ x T ( k )  Q 1 + Q sync − Q sync − Q sync Q 2 + Q sync  ˜ x ( k ) + ˜ u T ( k )  R 1 0 0 R 2  ˜ u ( k ) . As discussed in Sec. II, solving the optimal control prob- lem then leads to a feedback controller that has the form u 1 ( k ) = F 11 x 1 ( k ) + F 12 x 2 ( k ) , that is, agent 1 needs infor- mation from agent 2 . W e account for this by letting agent 2 send u 12 ( k ) = F 12 x 2 ( k ) ov er the network. Thus, agent 1 ’ s control input consists of u 11 ( k ) = F 11 x 1 ( k ) , which it can compute using its local observ ations, and u 12 ( k ) , which it recei ves ov er the network. W e can thus deﬁne the closed-loop matrix ˜ A 1 = A 1 + B 1 F 11 and (1) then reads as follo ws x 1 ( k + 1) = ˜ A 1 x 1 ( k ) + B 1 u 12 ( k ) + v 1 ( k ) . (4) B. Self-trigger ed Approac h Different STC designs hav e been proposed and are con- ceiv able to realize control-guided communication. W e use a design that exploits ideas from previous work on state estimation [20]. Instead of sending states as in [20], we consider the communication of control inputs. Speciﬁcally , rather than sending its entire state, agent 2 only sends the input u 12 ( k ) that is needed by agent 1 . In case of no communication, agent 1 keeps applying u 12 ( k ` ) , where k ` is the last time step at which the input u 12 ( k ) was sent. W e trigger communication based on the error e 12 ( k ) : = u 12 ( k ) − u 12 ( k ` ) as follows γ 2 ( k ) = 1 ⇐ ⇒ ( e 12 ( k )) T e 12 ( k ) > δ. (5) Here, γ 2 ( k ) is a binary variable, denoting whether agent 2 communicates u 12 ( k ) ( γ 2 ( k ) = 1 ) or not ( γ 2 ( k ) = 0 ), while δ deﬁnes the designer’ s trade-off between saving communication (large δ ) and keeping the error to a minimum (small δ ). If we directly implement (5) , agent 2 instantaneously decides on whether to transmit u 12 ( k ) to agent 1 . In case of a negati ve triggering decision, there is no possibility to reallocate bandwidth and hence freed resources remain unused. T o overcome this problem, we use a self-triggered strategy . Whenev er an agent communicates, it already decides when to communicate next. T o this end, we predict the ev olution of the error and look for the smallest M > 1 such that E  ( e 12 ( k + M )) T e 12 ( k + M ) |D 2 ( k )  > δ (6) and set γ ( k + M − 1) = 1 . Here, D 2 ( k ) describes the data agent 2 collected until time step k , that is, its local states x 2 and the inputs u 2 and u 12 that it has applied and sent so far , respectiv ely . The rationale behind this triggering rule is as follows: Information that is sent over the network is delayed by one discrete time step. The inequality in (6) tells us that the error exceeds, in expectation, the threshold δ in M time steps. W e thus seek to communicate next in M − 1 time steps such that the ne w input arrives in M time steps, which is exactly when we expect the error to exceed the threshold. The exact computation of (6) is complicated by the fact that the input u 21 ( k ) is not av ailable at all times at agent 2 . T o deriv e the triggering law , we assume u 21 ( k ) is known and then comment on how we approximate it to yield a tractable implementation. Based on this, we get the error distribution f ( e 12 ( k + M ) |D 2 ( k )) = N ( ˆ e 12 ( k + M | k ) , P 2 ( k + M | k )) , with mean ˆ e 12 and variance P 2 giv en as ˆ e 12 ( k + M | k ) = F 12 ( ˜ A M 2 x 2 ( k ) + M X i =0 ˜ A M − i 2 B 2 u 21 ( k + i )) − u 12 ( k ) (7a) P 2 ( k + 1 | k ) = F T 12 ( ˜ A T 2 P 2 ( k | k ) ˜ A 2 + Σ 2 ) F 12 . (7b) Equations (7) are standard open-loop state and cov ariance predictions of the system in (4) , so the deriv ations follow from Kalman ﬁlter theory [21, p. 111]. Giv en this error distribution, we can now , using E [ e T e ] = k E [ e ] k 2 + T r(V ar[ e ]) , solve for the triggering rule (5) : At ev ery communication instant, ﬁnd the smallest M > 1 such that k F 12 ( ˜ A M 2 x 2 ( k ) + M X i =0 ˜ A M − i 2 B 2 u 21 ( k + i )) − u 12 ( k ) k 2 + T r( F T 12 ( ˜ A T 2 P 2 ( k + M | k ) ˜ A 2 + Σ 2 ) F 12 ) > δ, (8) with T r the trace of a matrix. So far , we assumed that agent 2 has knowledge about the future dev elopment of u 21 ( k + i ) , which does not hold in practice. Because agent 2 has no information about the current state of agent 1 and hence cannot infer the future dev elopment of u 21 ( k + i ) , it approximates u 21 ( k + i ) as u 21 ( k + i ) = u 21 ( k ) ∀ i ∈ [0 , M ) . W ith this, the input u 21 ( k + i ) in (7a) and (8) effecti vely becomes a constant. W e note that one way to let agent 1 reason about agent 2 ’ s state would be to send the entire state x 2 ( k ) instead of the control input u 12 ( k ) . Agent 1 could use this state to compute u 12 ( k ) and to predict the e volution of agent 2 ’ s state. This, ho wev er, incurs higher communication demands at each instant as the state is typically of higher dimension than the input. Figure 3: Cyber-physical testbed with 15 wireless DPP nodes and ﬁv e cart- pole systems (A and B are real systems; C, D, and E are simulated systems). The network has a diameter of three hops. Node 10 is the network manager . V I . E X P E R I M E N T A L E V A L UAT I O N W e ev aluate our approach using experiments on a real cyber- physical testbed [13], [16] shown in Fig. 3. It consists of 15 wireless DPP nodes and ﬁve cart-pole systems (or pendulums), where A and B are real systems and C, D, and E are simulated systems. The nodes are distributed in an ofﬁce space of about 15 m by 20 m , and transmit at −6 dBm and 250 kbit/s in the 868 MHz band, forming a three-hop wireless network. A. Scenario and Metrics Scenario. The control task of each pendulum is to locally stabilize itself and to synchronize its cart position with all others. Since each system has access to its local state x i ( k ) , we can run the local feedback loop at a faster update interval than communication over the network occurs. Here, we choose an update interval of 10 ms for the local loop. Control inputs u ij ( k ) of the other agents are communicated over the wireless multi-hop network, where the exchange of all control inputs takes 50 ms ( i.e. , one communication round with up to K = 5 data slots and 4 byte per agent). W e use the scheduling policy outlined in Sec. IV. T o challenge the synchronization of the cart positions, we apply a sine distortion signal ( 3.6 s period with an amplitude of ±5 V ) to the control input of pendulum B. The controllers are designed as described in Sec. V. W e use the same model for the cart-pole system as in [16] and also adopt the Q i matrices used for periodic synchronization. For Q sync , we set the ﬁrst diagonal entry to 20 and all other entries to zero to express our desire to synchronize the cart positions. Further, we choose R i = 0 . 01 for all systems. Metrics. Our ev aluation uses the following metrics: • r oot mean squar e of the synchr onization error (RMSE) computed based on the cart positions of all pendulums in an experiment as a measure of control performance; • utilization of the av ailable data slots during each round, broken down into free slots (radio of f), slots used for control trafﬁc, and slots used for additional (other) trafﬁc; • radio duty cycle , the fraction of time a node has its radio on, which is a widely used metric in the lo w-power wireless networking literature (see, e.g. , [19], [22]) for quantifying communication energy cost. In the follo wing, we ﬁrst illustrate the run time operation of our co-designed wireless control system in a real experiment, and then ev aluate the trade-off among control performance, communication energy cost, and serving additional traf ﬁc as a function of the triggering threshold. B. Efﬁcient Resour ce Arbitration and Allocation Fig. 4 shows a real trace of the control performance (top) and the slot utilization in each communication round (bottom) ov er time for a triggering threshold of δ = 0 . 03 . Looking at the utilization, we see that, on average, less than one third of the av ailable bandwidth is needed for control trafﬁc. Our co-design approach effecti vely uses the freed bandwidth to schedule additional trafﬁc (here at most one slot per round according to the example scheduling policy from Sec. IV) and to shut do wn the remaining bandwidth completely . During the many free slots all nodes have their radios turned off, which sav es signiﬁcant amounts of energy . Due to the sine distortion signal, the RMSE at the top exhibits a similar shape. C. Contr ol P erformance vs. Efﬁciency vs. Flexibility The triggering threshold δ allo ws a user to trade control per- formance for communication energy efﬁciency and ﬂexibility in serving other trafﬁc. T o e v aluate this trade-off, we consider six different thresholds and perform for each threshold three 2-minute experiments. In addition, we perform experiments with δ = 0 to obtain results for periodic control, where all agents communicate in ev ery time step requiring all bandwidth for control trafﬁc. For each threshold, we report the median and 25th/75th percentiles across the three experiments. Fig. 5 shows RMSE, radio duty cycle for control trafﬁc, and fraction of bandwidth available for other trafﬁc against the fraction of bandwidth used for control trafﬁc. W e use this intuiti ve unit for the x-axis instead of the triggering threshold δ because our measurements reveal that each δ corresponds to a certain fraction of bandwidth used for control trafﬁc with negligible variance across experiments with the same δ . Looking at Fig. 5, we observe that the more bandwidth is used for control traf ﬁc, the better the control performance and the less bandwidth is av ailable for other trafﬁc. As expected, higher bandwidth demands result in a higher radio duty cycle. Using 25 % of the av ailable bandwidth for control trafﬁc, the control performance is still comparable to the periodic baseline. Further bandwidth reductions lead to a noticeable decrease in control performance compared with the periodic baseline of up to 22 % when only 11 % of the av ailable bandwidth is used for control trafﬁc. At the same time, up to 87 % of communication energy can be sav ed, while the vast majority of the bandwidth is av ailable for other trafﬁc. Overall, these experimental results demonstrate that our control-guided communication approach allows for exploiting this trade-off to meet a wide range of requirements of emerging cyber-physical applications. V I I . C O N C L U S I O N S W e have demonstrated for the ﬁrst time distributed, self- triggered control over wireless multi-hop networks with energy savings and reallocation of resources at fast update intervals 0 10 20 30 40 50 60 0.0 2.5 5.0 RMSE [cm] 27 28 29 30 31 32 33 Time [s] 1 2 3 4 5 Slot control traffic other traffic free slot Figure 4: Control performance and bandwidth utilization over time, recorded during one of our experiments. The scheduling policy described in Sec. IV is used but applications can also specify any other policy . Each vertical line in the lower ﬁgure represents a communication round. The control trafﬁc demands vary ov er time between 0 and 4 slots. One slot is always used for other trafﬁc and the remaining free slots are shut down to save communication energy . 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Fraction of bandwidth used for control traffic 2 3 4 RMSE [cm] Self-trig. control Periodic control (a) Control performance. 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Fraction of bandwidth used for control traffic 0 10 20 30 40 50 Radio duty cycle for control traffic [%] Self-trig. control Periodic control (b) Radio-duty cycle. 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Fraction of bandwidth used for control traffic 0.00 0.25 0.50 0.75 1.00 Fraction of bandwidth available for other traffic Self-trig. control Periodic control (c) Bandwidth av ailable for other trafﬁc. Figure 5: Trade-of f between control performance, communication energy efﬁciency , and ﬂexibility in serving other trafﬁc for different fractions of control trafﬁc, reported in terms of the median and 25th/75th percentiles. Control performance decreases when less bandwidth is used for control trafﬁc. Conv ersely , freed resources that are not needed for control trafﬁc result in considerable communication energy savings or allow to serve other trafﬁc ( e.g. , status, sensors). of tens of milliseconds. At the heart of our solution is control- guided communication, a new co-design approach where the control system predicts and informs the communication system about future resource demands. Using this information, bandwidth and energy are either sav ed or used efﬁciently for dif ferent kinds of trafﬁc. Experiments on a real cyber-physical testbed sho w the ef fectiveness of our approach. As part of our future work, we focus on a variety of theoretical questions, for example, regarding the closed-loop stability of the ov erall system, especially in the presence of message loss. A C K N O W L E D G E M E N T S W e thank Harsoveet Singh and Felix Grimminger for help with the testbed, and the TEC group at ETH Zurich for making the design of the DPP platform av ailable to the public. R E F E R E N C E S [1] J. ˚ Akerberg, M. Gidlund, and M. Bj ¨ orkman, “Future research challenges in wireless sensor and actuator networks targeting industrial automation, ” in 9th IEEE Int. Conf. on Industrial Informatics , 2011. [2] W . Heemels, K. H. Johansson, and P . T abuada, “ An introduction to ev ent-triggered and self-triggered control, ” in Pr oc. of IEEE CDC , 2012. [3] M. Misko wicz, Event-Based Contr ol and Signal Processing . CRC Press, 2016. [4] C. Lu, A. Saifullah, B. Li, M. Sha, H. Gonzalez, D. Gunatilaka, C. W u, L. Nie, and Y . Chen, “Real-time wireless sensor-actuator networks for industrial cyber-physical systems, ” Proc. IEEE , vol. 104, no. 5, 2016. [5] M. V elasco, J. Fuertes, and P . Marti, “The self triggered task model for real-time control systems, ” in Proc. of IEEE RTSS , 2003. [6] X. W ang and M. Lemmon, “Self-triggered feedback control systems with ﬁnite-gain L 2 stability , ” IEEE T rans. Automat. Contr . , vol. 54, no. 3, 2009. [7] M. Mazo, A. Anta, and P . T abuada, “ An ISS self-triggered implementa- tion of linear controllers, ” Automatica , vol. 46, no. 8, 2010. [8] J. Araujo, M. Mazo, A. Anta, P . T abuada, and K. H. Johansson, “System architectures, protocols and algorithms for aperiodic wireless control systems, ” IEEE T rans. Ind. Informat. , vol. 10, no. 1, 2014. [9] J. Ara ´ ujo, A. Anta, M. Mazo, J. Faria, A. Hernandez, P . T abuada, and K. H. Johansson, “Self-triggered control over wireless sensor and actuator networks, ” in Proc. of DCOSS , 2011. [10] Y . Ma and C. Lu, “Efﬁcient holistic control ov er industrial wireless sensor-actuator networks, ” in Pr oc. of IEEE ICII , 2018. [11] C. Santos, M. Mazo Jr , and F . Espinosa, “ Adaptive self-triggered control of a remotely operated p3-dx robot: Simulation and experimentation, ” Robotics and Autonomous Systems , vol. 62, no. 6, 2014. [12] C. Santos, F . Espinosa, E. Santiso, and M. Mazo, “ Aperiodic linear networked control considering variable channel delays: Application to robots coordination, ” Sensors , vol. 15, no. 6, 2015. [13] D. Baumann, F . Mager, H. Singh, M. Zimmerling, and S. T rimpe, “Evaluating low-po wer wireless cyber-physical systems, ” in Proc. of IEEE CPSBench , 2018. [14] J. Lunze, F eedback Control of Lar ge-Scale Systems . Prentice Hall, 1992. [15] B. D. O. Anderson and J. B. Moore, Optimal Control: Linear Quadratic Methods . Prentice Hall, 2007. [16] F . Mager, D. Baumann, R. Jacob, L. Thiele, S. Trimpe, and M. Zim- merling, “Feedback control goes wireless: Guaranteed stability over low-po wer multi-hop networks, ” in Pr oc. of ACM/IEEE ICCPS , 2019. [17] F . Sutton, M. Zimmerling, R. Da Forno, R. Lim, T . Gsell, G. Gi- annopoulou, F . Ferrari, J. Beutel, and L. Thiele, “Bolt: A stateful processor interconnect, ” in Pr oc. of ACM SenSys , 2015. [18] F . Ferrari, M. Zimmerling, L. Thiele, and O. Saukh, “Efﬁcient network ﬂooding and time synchronization with Glossy, ” in Pr oc. of ACM/IEEE IPSN , 2011. [19] F . Ferrari, M. Zimmerling, L. Mottola, and L. Thiele, “Low-po wer wireless bus, ” in Proc. of ACM SenSys , 2012. [20] S. Trimpe and D. Baumann, “Resource-a ware IoT control: Saving communication through predictive triggering, ” IEEE Internet Things J. , 2019. [21] B. D. Anderson and J. B. Moore, Optimal ﬁltering . Courier Corporation, 2012. [22] O. Gnawali, R. Fonseca, K. Jamieson, D. Moss, and P . Levis, “Collection tree protocol, ” in Pr oc. of ACM SenSys , 2009.

Control-guided Communication: Efficient Resource Arbitration and Allocation in Multi-hop Wireless Control Systems

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment