Robust Hierarchical MPC for Handling Long Horizon Demand Forecast Uncertainty with Application to Automotive Thermal Management

This paper presents a robust hierarchical MPC (H-MPC) for dynamic systems with slow states subject to demand forecast uncertainty. The H-MPC has two layers: (i) the scheduling MPC at the upper layer with a relatively long prediction/planning horizon …

Authors: Mohammad Reza Amini, Ilya Kolmanovsky, Jing Sun

Robust Hierarchical MPC for Handling Long Horizon Demand Forecast   Uncertainty with Application to Automotive Thermal Management
58th Conference on Decision and Control (CDC) Decemb er 11–13, 2019, Nice, F rance Robust Hierarchical MPC f or Handling Long Horizon Demand For ecast Uncertainty with Application to A utomoti ve Thermal Management ∗ Mohammad Reza Amini 1 , Ilya K olmano vsky 2 , and Jing Sun 1 Abstract — This paper presents a robust hierarchical MPC (H-MPC) for dynamic systems with slo w states subject to demand f orecast uncertainty . The H-MPC has tw o layers: (i) the scheduling MPC at the upper layer with a relati vely long predic- tion/planning horizon and slow update rate, and (ii) the piloting MPC at the lower layer over a shorter prediction horizon with a faster update rate. The scheduling layer MPC calculates the optimal slow states, which will be tracked by the piloting MPC, while enf orcing the system constraints according to a long-range and approximate prediction of the future demand/load, e.g., traction power demand for driving a vehicle. In this paper , to enhance the H-MPC rob ustness against the long-term demand for ecast uncertainty , we propose to use the high-quality preview information enabled by the connectivity technology over the short horizon to modify the planned trajectories via a constraint tightening approach at the scheduling layer . Simulation results are presented f or a simplified vehicle model to confirm the effectiveness of the pr oposed robust H-MPC framework in handling demand forecast uncertainty . I . I N T RO D U C T I O N For dynamic systems with multiple time scales, centralized optimization could lead to non-uniform performance as a centralized controller inherently assumes that all the system outputs are equally reactiv e [1]. Howe v er , the outputs of such dynamic systems tend to react and respond at different rates. This is a common observ ation for dynamic systems with slo w and storage states, e.g., microgrids [2], vehicle [3]–[6] and aircraft [7] thermal management systems. One approach to address the aforementioned issue in a centralized scheme is to select a relatively long pre- diction horizon to ensure satisfactory performance of the slow states. While this approach could fulfill the ov erall performance requirements, it has sev eral significant issues for practical applications: • Firstly , it will significantly increase the computational complexity . For fast states, the computational complex- ity is often not justifiable as the information downstream of the horizon has minimal influence on the optimal ev olution of the fast state trajectories. • Secondly , for most systems operating in a dynamic en vironment, accurate prediction of the demand/load profile ov er a long prediction horizon is not feasible. For example, vehicle speed cannot be predicted accurately *This paper is based upon the work supported by the United States Department of Energy (DOE) under award No. de-ar0000797. 1 M.R. Amini and J. Sun are with the Department of Naval Architecture & Marine Engineering, Univ ersity of Michigan, Ann Arbor, MI 48109 USA. Emails: { mamini,jingsun } @umich.edu 2 I. K olmano vsky is with the Dept. of Aerospace Engineering, Univ ersity of Michigan, Ann Arbor, MI 48109 USA. Email: ilya@umich.edu ov er a long prediction horizon based on v ehicle-to- vehicle (V2V) and vehicle-to-infrastructure (V2I) infor- mation [4], [5]. The common approach exploited in the literature [6], [8], [9] for systems with multi-timescales is the hierarchical control with multiple layers. The hierarchical framework allo ws for optimization over different planning horizons at each layer based on the requirements of different system outputs. It also helps to significantly reduce the computational cost compared to its centralized counterpart [4], [6], [10]. Whether centralized or hierarchical, the uncertainty as- sociated with the long-term prediction of the demand/load is the common problem for systems with slow dynamics. As the optimization often pushes the system to w ork at its limit for best efficienc y , it is important to be able to enforce the constraints robustly in the presence of demand forecast uncertainty . This is because for slo w states, once the violation of the constraint occurs, the controller has to put more efforts to compensate for this violation, thereby reducing the ov erall system efficienc y . Unlike the most common techniques in the literature on the robust design of hierarchical and distributed MPC [6], [9], [11] which mainly focus on robustifying the controller under the presence of e xternal unknown dis- turbances, the robustness to long horizon demand forecast uncertainty has not been fully studied. From a new perspecti v e, in this paper , we consider the long horizon demand forecast uncertainty challenge which plays a critical role in planning and optimization of slow/storage states with wide application in v ehicle and aircraft ther- mal management, building HV A C control, and power grid optimization. T o address this challenge, we exploit the H- MPC optimization framework as the hierarchical architecture provides a unique capability of using the demand forecast with dif ferent accuracies ov er different prediction horizons. Our proposed H-MPC has two layers as scheduling and piloting layers. The scheduling layer at the top computes the optimal control input at a slow update rate ov er a relativ ely long prediction horizon. The output of the scheduling layer is the optimal trajectories of the “slow” states of interest according to an approximate prediction of the demand over the long planning horizon. In the lower layer , piloting layer, ov er a much shorter prediction horizon and using a faster update rate, an MPC is designed to follow the scheduled slow state trajectories coming from the upper layer while fulfilling the overall system performance requirements. It is en visioned that ov er a relati vely short prediction horizon, the demand/load profile can be accurately predicted. As an example for connected and automated vehicles, the 1 V2V/V2I communications can be used to predict the future vehicle speed and the associated traction power demand over a short time windo w accurately [12]. W e propose to use this short horizon accurate demand pre view information to predict the system state ev olution ov er the prediction horizon of the piloting layer MPC while tracking the planned tra- jectory coming from the scheduling layer MPC. Computing the ev olution of the system state with a faster update rate when the accurate demand previe w is accessible allows for detecting the constraint violation proactively over the short prediction horizon. Particularly , we can predict the deviation of the slow state from the planned trajectory and realize the possibility of constraint violation over the short prediction horizon. It is noted that, assuming a perfect model, the deviation in the reference tracking and constraint violation at the piloting layer are mainly attrib uted to the mismatch between the actual demand and the approximate pre view incorporated for planning at the scheduling layer . Once these deviations and possible constraint violation are predicted ov er the shorter prediction horizon, the constraint set at the scheduling layer is re-computed and tightened to make the H-MPC prepared for the upcoming situation during which the chances of constraint violation are high. The main contribution of this paper is to develop a robust H-MPC for a class of dynamic systems with slow states subject to demand previe w uncertainty . W e propose to utilize short and long-range demand previe w information to compute robust constraint set to enhance the con ventional H-MPC robustness against the demand previe w uncertainty while improving the overall system efficienc y . The perfor- mance of the proposed rob ust H-MPC is studied through simulations for a vehicle thermal management case study . I I . H I E R A R C H I C A L M P C F O R M U L A T I O N W e consider the follo wing discrete-time linear time- in variant system, x k +1 = A x k + B 1 ˜ u k + B 2 ˆ u k , (1) where x = [ x 1 , · · · , x n ] | ∈ R n , ˜ u ∈ R m , and ˆ u ∈ R h are the system state, control input, and demand/load on the system, respectively , and A ∈ R n × n , B 1 ∈ R n × m , and B 2 ∈ R n × h . The integer k denotes the time step and the sampling period of (1) is assumed to be T . W e further assume that dif ferent states of the system ( x i x , i x = 1 , · · · , n ) respond at dif ferent rates. Depend- ing on the relativ e time scales of different states, they can be categorized into “fast” and “slow” states: x = [( x fast ) | , ( x slow ) | ] | . Note that ˜ u contains the physical con- trol inputs to the system that are optimized to regulate the ov erall performance. On the other hand, ˆ u is the v ector of non-adjustable inputs to the system representing measured external loads. Since ˆ u is a measured input to the system, it af fects the system states, and the kno wledge about ˆ u can be incorporated as a previe w for optimization of ˜ u . In automotiv e applications, traction power demand ( P trac ) for driving a vehicle is an example of ˆ u . The state and control input of the linear system in (1) are constrained: x ∈ X , ˜ u ∈ U , where X = { x k | P x x k ≤ q x } and U = { ˜ u k | P u ˜ u k ≤ q u } are polyhedral sets. A. Single-Layer MPC (S-MPC) F ormulation For the system (1), we first consider a centralized single- layer MPC (S-MPC) defined according to the follo wing finite-horizon optimization problem: min ˜ U k N X j =0 ` S-MPC ( k + j | k ) , s.t. x ( k + j + 1 | k ) = A x ( k + j | k ) + B 1 ˜ u ( k + j | k ) + B 2 ˆ u ( k + j | k ) , x ( k + j | k ) ∈ X , ˜ u ( k + j | k ) ∈ U , x ( k | k ) = x k , (2) where ` S-MPC is the S-MPC stage cost, ˜ U k = [ ˜ u ( k | k ) | , · · · , ˜ u ( k + N − 1 | k ) | ] | is the control input sequence, and ˆ U k = [ ˆ u ( k | k ) | , · · · , ˆ u ( k + N − 1 | k ) | ] | is the sequence of the “predicted” demand pre view . The cost function ` S-MPC is defined as [13]: ` S-MPC =   r ( k + j | k ) − r d ( k + j )   2 Λ (3) where r k = E x k + F ˜ u k , and r d ∈ R n r represents the desired operation of the system. When using the solution of the optimization problem (2) to control the system (1), the MPC feedback law defined by ˜ u ( k | k ) inherently assumes that all the system outputs are equally reactive [1], howe ver , in the case considered here, the outputs of the system respond ov er different time scales. B. Hierar chical MPC (H-MPC) F ormulation The H-MPC considered in this paper has two layers [9]: • scheduling layer at the top with an MPC that com- putes the optimal control inputs and the “slow” state trajectories at a slo wer update rate of T s = ν T , where ν is a positiv e integer , ov er a relatively long prediction horizon ( H s ). An approximate prediction of the demand sequence is incorporated at this layer . • piloting layer with an MPC implemented over a much shorter prediction horizon ( H p ) and using a faster update rate ( T ) to track the planned trajectories of the slow states from the scheduling layer . The piloting layer MPC has access to a more accurate prediction of the demand profile over H p . The scheduling layer MPC is based on the following finite- horizon optimization problem: min ˜ U s k s H s X j =0 ` scheduling ( k s + j | k s ) , s.t. x s ( k s + j + 1 | k s ) = A s x s ( k s + j | k s ) + B s 1 ˜ u s ( k s + j | k s ) + B s 2 ˆ u s ( k s + j | k s ) , x s ( k s + j | k s ) ∈ X , ˜ u s ( k s + j | k s ) ∈ U , x s ( k s | k s ) = x s k s , (4) where x s and ˜ U s k s = [ ˜ u s ( k s | k s ) | , · · · , ˜ u s ( k s + H s − 1 | k s ) | ] | are the state vector and control input sequence updated at the slower sampling time of T s = ν T , which is index ed by 2 k s . The sequence of the approximate demand profile ˆ U s k s = [ ˆ u s ( k s | k s ) | , · · · , ˆ u s ( k s + H s − 1 | k s ) | ] | is also incorporated at the scheduling layer as the pre view . The stage cost function ( ` scheduling ) of (4) is the same as the one considered in (3) while it is updated e very T s second. The prediction model in (4) is based on the do wn-sampled version of (1) with slower sampling rate of T s as follows: A s = ( A ) ν , B s 1 = ν − 1 X j =0 ( A ) j B 1 , B s 2 = ν − 1 X j =0 ( A ) j B 2 , (5) Once the scheduling layer optimization problem is solved, based on the computed control sequence U s k s , the sequence of the scheduled slow states at the subsequent time steps ov er the horizon of the piloting layer MPC ( H p ) can be calculated: ∗ x slow ( k s + 1 | k s ) , · · · , ∗ x slow ( k s + H p | k s ) , where ∗ x slow is the vector of scheduled slow states of interest. Note that unlike the approach presented in [9], [13], the horizon of the second layer , piloting layer ( H p ), can be longer than H p > ν . In the next step, the sequence of ∗ x slow is passed to the piloting layer as a piece wise constant function to account for the faster sampling rate of the lower layer controller ( T ): ∗ X slow ( k | k s ) = [ ∗ x slow ( k s | k s ) | , ∗ x slow ( k + 1 | k s ) | , · · · , ∗ x slow ( k + ν − 1 | k s ) | , ∗ x slow ( k s + 1 | k s ) | , · · · , ∗ x slow ( k + 2 ν − 1 | k s ) | , · · · , ∗ x slow ( k + H p − 1 | k s ) | ] | (6) where ∗ X slow is the sequence of the scheduled slo w states to be tracked at the piloting layer . Note that the sequence of ∗ X slow between the slow updates are assumed to be constant, e.g., ∗ x slow ( k s | k s ) = ∗ x slow ( k + 1 | k s ) = · · · = ∗ x slow ( k + ν − 1 | k s ) . The piloting layer MPC is based on the solution of the following optimal control problem: min ˜ U k H p X j =0 ` piloting ( k + j | k ) , s.t. x ( k + j + 1 | k ) = A x ( k + j | k ) + B 1 ˜ u ( k + j | k ) + B 2 ˆ u ( k + j | k ) , x ( k + j | k ) ∈ X , ˜ u ( k + j | k ) ∈ U , x ( k | k ) = x k . (7) where the stage cost ` piloting is a combination of the original cost in (3) and extra tracking terms to enforce the slo w states ( x slow ) to track the scheduled slow state trajectories ( ∗ x slow ) presented in (6). ˜ U k = [ ˜ u ( k | k ) | , · · · , ˜ u ( k + H p − 1 | k ) | ] | is the control input sequence of the piloting layer MPC. It is assumed that the incorporated demand previe w at the piloting layer ( ˆ U k = [ ˆ u ( k | k ) | , · · · , ˆ u ( k + H p − 1 | k ) | ] | ) is more accurate than the demand previe w used at the scheduling layer ( ˆ U s k s ). When implementing the H-MPC, at the end of each optimization iteration, ˜ u ( k | k ) is commanded to the system and the horizon is shifted by one time step ( T ). I I I . R O B U S T H I E R A R C H I C A L M P C An intuitiv e approach for improving the constraint sat- isfaction at the piloting layer is to tighten the constraints at the scheduling layer . If the constraint tightening is done conservati vely , the overall efficienc y of the system may be degraded. T o avoid unnecessary conservatism, in this paper , we propose to compute a robust constraint set (denoted by X rob ust ) at each time step k . W ith a faster update rate and more accurate incorporated demand previe w , the ev olution of the slo w state trajectories ov er the short horizon of the pilot- ing layer MPC can be predicted while tracking the planned trajectories from the scheduling layer . This prediction of the slow state ev olution is then used to compute X rob ust via estimating the deviation in tracking the planned slow state trajectory and potential violation of the constraint over H p . This concept is shown in Fig. 1. Fig. 1. Schematic of the constraint violation prediction to compute X robust . At time step k , the scheduling layer MPC plans the slo w state trajectories (b) based on an approximate kno wledge of the demand previe w (a). The piloting layer MPC is designed to not only regulate the fast states but track the planned slo w state trajectory while accessing the accurate demand pre view at a faster update rate. The evolution of the slow states ov er the short prediction horizon is computed (d). If any deviation or constraint violation is predicted over H p , the scheduling layer constraint set is tightened proactively . According to the accurate demand previe w ( ˆ U k = [ ˆ u ( k | k ) | , · · · , ˆ u ( k + H p − 1 | k ) | ] | ), the ev olution of the system states x over the piloting layer horizon H p can be predicted based on (1): x ( k + j + 1 | k ) = A x ( k + j | k ) + B 1 ˜ u ∗ ( k + j | k ) + B 2 ˆ u ( k + j | k ) (8) where j = 0 , · · · , H p − 1 and ˜ u ∗ ( k + j | k ) is the vector of the control inputs at k + j computed based on the piloting layer optimal control problem (7) solution at time step k . Based on the state predictions (8), the constraint violation of the slow states of interest can also be predicted at subsequent time steps, which will be used to tighten the state constraint set for the optimization problem at the scheduling layer . W e denote the tightened constraint set ov er the piloting layer horizon at k + j + 1 , j = 0 , · · · , H p − 1 , by X j , which is computed as follows: X j = { x ( k | k ) | (9) P x x ( k + j + 1 | k ) ≤ ( q x − ¯ q x ( k + j + 1 | k )) } , where q x = [ q 1 x , · · · , q n q x ] | ∈ R n q represents the state constraints, and is used in the definition of the original state constraint set X = { x k | P x x k ≤ q x } . Note that if we assume q x represents the upper and lower bounds on each state and each row of P x corresponds to one constraint, then non- 3 zero elements of P x are either 1 (upper limit) or − 1 (lower limit). W e then define ¯ q x ( k + j + 1 | k ) in (9) as the vector of predicted constraint violation at k + j + 1 based on the state prediction made at k (8). If q i q x ( i q = 1 , · · · , n q ) corresponds to x i x ( i x = 1 , · · · , n ), then the i th q element of ¯ q x at k + j + 1 , i.e., ¯ q i q x ( k + j + 1 | k ) , is calculated as follows: ¯ q i q x ( k + j + 1 | k ) = (10)            0 if x i x is fast 0 if x i x ( k + j + 1 | k ) < q i q x & x i x is slow x i x ( k + j + 1 | k ) − q i q x if x i x ( k + j + 1 | k ) ≥ q i q x & x i x is slow Note that since we are interested in the slow states constraint enforcement, the elements of ¯ q x corresponding to fast states are set to zero in (10). Eventually and based on the computed X j s in (9), the robust constraint set ( X rob ust ) to be implemented at the next time step k + 1 at the scheduling layer is computed as: X rob ust = X ∩ X 1 ∩ · · · ∩ X k + H p − 1 , (11) based on which, the scheduling layer MPC of the robust H- MPC is re-formulated as follows: min ˜ U s k s H s X j =0 ` scheduling ( k s + j | k s ) , s.t. x s ( k s + j + 1 | k s ) = A s x s ( k s + j | k s ) + B s 1 ˜ u s ( k s + j | k s ) + B s 2 ˆ u s ( k s + j | k s ) , x s ( k s + j | k s ) ∈ X rob ust , ˜ u s ( k s + j | k s ) ∈ U , x s ( k s | k s ) = x s k s , (12) Remark : Note that one can propose to tighten the constraint whenev er a violation of the constraint is detected at the piloting layer at time step k . This event-triggered “passiv e” constraint tightening approach can be incorporated in the H- MPC at the scheduling layer (4) by replacing the original constraint set X with X passive , which is computed as follows: X passive = { x ( k | k ) | P x x ( k | k ) ≤ ( q x − ¯ q x ( k | k )) } , (13) where ¯ q x ( k | k ) is the v ector of measured (detected) constraint violation at each time step k . While X passive is easy to compute, it leads to marginal improv ement in the H-MPC robustness, see [4] for an example. This can be explained with respect to the slow dynamic characteristics which call for a long prediction horizon with enough lead time to ensure an effecti ve constraint enforcement. In other words, if the H-MPC constraint set is tightened based on (13) after the occurrence of the violation, compensating for the occurred violation in the subsequent time steps may not be effecti ve as it takes extra effort from the controller , potentially reducing the system efficienc y . I V . C A S E S T U D Y : V E H I C L E T H E R M A L M A N A G E M E N T A simplified model of a vehicle is considered in this section to demonstrate the effecti veness of the proposed robust H-MPC. The main structure of the model is adopted from [13] and a ne w state has been added to the model to represent the slo w state. The model has four states, including the position of the v ehicle ( x 1 ), vehicle speed ( x 2 ), on- board stored energy ( x 3 ), e.g., battery state of char ge, and thermal index of the vehicle energy storage ( x 4 ), e.g., battery temperature. Compared to the first three states, the fourth state has a relatively slower dynamics. The control inputs to the model are acceleration ( ˜ u 1 ), deceleration ( ˜ u 2 ), and the power consumed for thermal management of the energy storage ( ˜ u 3 ). The power to an external load ( ˆ u ) should also be delivered by the energy storage. The power demand is assumed to be non-adjustable, e.g., an auxiliary load on the battery to fulfill a demand within the vehicle. The simplified vehicle model has the following structure: x k +1 =    1 1 0 0 0 1 0 0 0 0 1 0 0 0 0 1    x k (14) +    1 1 0 1 − 1 0 − 0 . 8 0 . 8 − 0 . 15 1 1 − 0 . 85    ˜ u k +    0 0 − 0 . 25 1    ˆ u k where x = [ x 1 , x 2 , x 3 , x 4 ] | and ˜ u = [ ˜ u 1 , ˜ u 2 , ˜ u 3 ] | . The controller design objective is to track a given desired po- sition ( x d 1 ) while enforcing the state and input constraints. Specifically , the thermal index of the energy storage ( x 4 ) is preferred to be maintained below its upper optimum operation limit x U L 4 . It is noted that while the vehicle tracks the reference position, the energy storage needs to deliver power to satisfy the external load ( ˆ u ). Moreover , operating the vehicle by adjusting ˜ u and responding to ˆ u consumes the on-board vehicle energy . The operation of the vehicle is required to be optimized such that the on-board energy lasts until the end of the vehicle mission. The state and input constraints are listed below: [ − 1 , − 20 , 0 , 0] | ≤ x k ≤ [100 , 20 , 100 , 30] | (15) [ − 1 , − 1 , − 1] | ≤ ˜ u k ≤ [1 , 1 , 1] | . (16) In practice, the exact demand previe w is assumed to be unknown over the entire vehicle mission, ho we ver , an approximate knowledge of the demand is av ailable to the vehicle. Moreover , we assume the vehicle is connected to a server which can provide short-range and accurate demand predictions to the vehicle. The actual and approximate pro- files of the po wer demand as the load ( ˆ u ) on the on-board energy storage are shown in Fig. 1-(a,c). In order to ev aluate the H-MPC performance for the vehicle model (14), first a centralized single-layer MPC (S- MPC) is designed. T o this end, we consider an S-MPC formulated over a finite-horizon ( N ) with ˜ u 1 , ˜ u 2 , ˜ u 3 being the optimization v ariables according to (2). The stage cost of S-MPC, ` S-MPC , is defined as follows: ` S-MPC = λ 1 ˜ u 2 1 + λ 2 ˜ u 2 2 + λ 3 ( x 1 − x d 1 ) 2 (17) where x d 1 is the desired and known position trajectory . X and U are the conv ex sets properly defined according to (15) and (16), respectiv ely . The S-MPC optimization problem is solved at ev ery time step, then the horizon is shifted by one 4 step ( T = 1 sec ), and only the current control is commanded to the system. The S-MPC simulation is carried out on a desktop computer , with an Intel ® Core i7@2.60 GHz pro- cessor , in MA TLAB ® /SIMULINK ® using Y ALMIP [14] for formulating the optimization problem, and IPOPT [15] for solving the optimization problem numerically . Additionally , in order to ensure the feasibility of the S-MPC solution, slack optimization variables are used to soften the constraints on the position ( x 1 ) and thermal index ( x 4 ). In all simulated cases, a very large weighting factor (e.g., 10 6 ) has been considered for the slack variable term in the MPC stage cost to ensure a comparable performance. Fig. 2 shows the performing of the S-MPC with short ( N = 10 (10 sec ) ) and long ( N = 100 ) prediction horizons. Under the unrealistic assumption that the controller has the exact knowledge of the demand profile shown in Fig. 1-(c), Fig. 2-(a) shows that the S-MPC with long horizon tracks the desired position accurately with no x 1 constraint violation while enforcing the constraint on the thermal index state x 4 . When the shorter horizon is used, the S-MPC cannot mitigate the violations of x 4 constraint; thus, it compromises x 1 track- ing (Fig. 2-(b)) to avoid further increase in the thermal index. The reason for this deviation in the S-MPC performance with shorter prediction horizon is that the controller does not have enough lead time to take proactiv e actions in response to the external power demand early on. Moreover , compared to the long horizon S-MPC, the short horizon controller consumes more energy (Fig. 2-(c)). Fig. 2. Performance of the S-MPC with different prediction horizon lengths: (a) tracking the desired position trajectory , (b) thermal index, and (c) on-board energy . It was shown that extending the prediction horizon im- prov es S-MPC tracking and constraint satisfaction perfor- mance. While this observation is not a surprise, as discussed in Sec II, long horizon optimization for slow states is asso- ciated with demand forecast uncertainty and computational complexity . Fig. 2 also shows the S-MPC performance with N = 100 when the incorporated pre view is based on the approximate kno wledge of the demand profile (Fig. 1-(a)). As can be seen, the uncertainty in the demand previe w significantly alters the long horizon S-MPC performance. The desired position is not being tracked preferably and the thermal index constraint violation occurs more often. Addi- tionally , according to Fig. 2-(c), the S-MPC with the av erage demand previe w has the highest energy consumption. T o address the demand forecast uncertainty impact on the S-MPC performance, a robust H-MPC is designed for the vehicle model (14) according to the proposed framew ork in Secs. II and III. The scheduling layer MPC is based on the optimization problem formulated in (4) with the same stage cost defined in (17), and with a slo wer update rate. The long prediction horizon of the scheduling layer MPC is set to H s = 20 (100 sec ) . The sampling time of the scheduling layer MPC is T s = 5 sec , thus ν = 5 . The robust constraint set X rob ust is calculated according to the procedure (9)-(11) proposed in Sec. III. The scheduling layer MPC optimization problem is solved according to the a verage demand pre view shown in Fig. 1-(a). This solution is used to calculate the optimal thermal index trajectory ( ∗ x slow 4 ). This scheduled ∗ x slow 4 trajectory is then incorporated in the stage cost of the piloting layer, which is formulated according to (7) as a finite-horizon optimization problem ov er a relatively shorter prediction horizon ( H p = 20 (20 sec ) ) and with a faster sampling period of T p = T = 1 sec . Note that it is assumed the e xact kno wledge of the demand previe w is av ailable to the piloting-layer MPC over the receding horizon H p = 20 (20 sec ) . The stage cost of the piloting layer MPC ( ` piloting ) is defined as: ` piloting = λ 1 ˜ u 2 1 + λ 2 ˜ u 2 2 + λ 3 ( x 1 − x d 1 ) 2 + λ 4 ( x 4 − ∗ x slow 4 ) 2 . (18) Since T s > T p , the scheduled ∗ x slow 4 trajectory is passed on as a piecewise constant function according to (6). Fig. 3 shows the comparison between the S-MPC and the baseline H-MPC with X as the constraint set. The H-MPC, unlike the S-MPC with av erage demand previe w , manages to deliv er an acceptable x 1 tracking performance (Fig. 3-(a)). Since the robust constraint set X rob ust is not being incorporated in the baseline H-MPC (4), often violation of thermal in- dex state x 4 is still observed in Fig. 3-(b). Additionally , compared to the S-MPC with long prediction horizon and av erage demand previe w , the H-MPC sho ws less energy consumption (Fig. 3-(c)) Overall, the H-MPC slightly relaxes the accurate x 1 tracking requirement by putting part of the effort into tracking the scheduled ∗ x slow 4 . Howe ver , due to the uncertainty in the long-range demand forecast used at the scheduling layer , x 4 constraint enforcement is not achie ved. Note that operating the vehicles while the thermal index is maintained close to the upper limit x U L 4 is equiv alent to less energy consumption. Since the demand pre view is uncertain, steering x 4 to x U L 4 (e.g., 30 o C ) increases the chances of constraint violation once the vehicle faces the actual power demand. Upon incorporation of X rob ust in (4), the robust H-MPC (12) becomes able to take advantage of the short horizon and more accurate demand previe w over the piloting layer hori- zon. As shown in Fig. 4, compared to the baseline H-MPC 5 Fig. 3. Comparison between performances of S-MPC and H-MPC: (a) tracking the desired position trajectory , (b) thermal index, and (c) on-board energy ( H p = 20 sec , H s = N = 100 sec ). (4) with X as the constraint set at the scheduling layer , the robust H-MPC (12) with X rob ust shows significantly better x 4 constraint enforcement under the uncertainty associated with long horizon demand forecast. As shown in Fig. 4, the robust H-MPC starts to tighten the x U L 4 constraint early on to ensure its enforcement during the period that the demanded load on the battery is higher and any uncertainty in the demand previe w could lead to violation of the thermal index limit. Fig. 4. Comparison between performances of H-MPC and robust H-MPC in regulating the energy storage thermal index . The performance of the robust H-MPC is also compared with the H-MPC with X passive , which is computed accord- ing to (13). Unlike X rob ust , in the passive approach X is tightened whenev er the violation of x U L 4 is detected by the piloting layer MPC. This means the scheduling layer MPC is informed about the constraint violation only after its occurrence, then it manages to update the scheduled trajectories to reduce the chances of constraint violation in subsequent time steps. Fig. 4 shows that compared to the baseline H-MPC, the H-MPC with X passive can only slightly decrease the x 4 constraint violation. This can be explained with respect to the slower dynamics of x 4 which call for a proacti ve constraint tightening algorithm to effecti vely Fig. 5. Comparison between performances of H-MPC and robust H-MPC: (a) tracking the desired position trajectory , and (b) on-board stored energy ( H p = 20 sec , H s = N = 100 sec ) improv e the H-MPC robustness. This requirement has been addressed by the proposed robust H-MPC frame work in this paper . Note that with baseline and passive H-MPCs, enforcement of the original x U L 4 constraint is infeasible. As a results, the computed X and X passive shown in Fig. 4 are based on the softened x 4 constraint using slack variables. The other interesting observ ation about the robust H-MPC is its performance in tracking x d 1 (Fig. 5-(a)) while enforcing x U L 4 constraint. Gi ven the limited energy stored on-board of the vehicle, the robust H-MPC slightly relaxes the x 1 trajectory tracking objective (while it enforces x 1 constraint) during the period it puts more ef fort to mitigate x U L 4 violation by decreasing (cooling) the ener gy storage thermal index from around t = 10 sec to t = 60 sec . After this time period, since the robust H-MPC does not see any other major external power demand over the long prediction horizon, it increases the position tracking priority , see Fig. 5. The energy consumption results of different studied H- MPCs are sho wn in Fig. 5-(b). It can be seen that the rob ust H-MPC has the lowest energy consumption, as compared to the baseline and passi ve H-MPCs. This is an interesting observation as it shows the robust H-MPC framew ork does not necessarily lead to a conservati ve controller design which is usually expected when designing a rob ust MPC under the influence of unknown external disturbances. W e recall that for the specific class of dynamic systems with slow states considered in this paper , steering the slow state to its limit usually leads to lo wer energy consumption. The constraint violation often occurs when the external power demand on the energy storage is higher, meaning during the time periods that the energy storage is required to deliv er po wer for operation of the vehicle, it also has to provide extra power for regulating the thermal state. The robust H-MPC, on the other hand, effecti vely shifts the thermal management load to those periods during which the po wer demand is relatively lower . Thanks to this intelligent thermal management load shift by the robust H-MPC, not only less often thermal 6 constraint violations are observed, but the overall efficienc y (fuel economy) of the vehicle is also higher, compared to the baseline H-MPC. V . S U M M A RY A N D C O N C L U S I O N S The problem of robust hierarchical MPC (H-MPC) design for constrained systems with multi-timescales and exposed to external demands was considered in this paper . From a new prospectiv e, we proposed to use demand previe w information with different accuracies at different layers of an H-MPC to proactiv ely predict the (chances of) constraint violations and compute a robust constrain set to enhance the ov erall robustness of the H-MPC against the uncertainty associated with long horizon demand forecast. The computation of the robust constraint set is built upon the information that is av ailable at each layer of an H-MPC, but they are not often being communicated between the layers. By lev eraging this information and accessing to demand previe w information with dif ferent accuracies, we showed that the rob ustness of conv entional H-MPC can be effecti vely improv ed. W e demonstrated the application of the proposed robust H-MPC for a simplified vehicle thermal management case study . The simulation results confirmed the effecti veness of the proposed approach in regulating the thermal state of the vehicle based on an uncertain knowledge of the demand previe w while improving the ov erall system ef ficiency , as compared to a con ventional H-MPC framework. R E F E R E N C E S [1] E.D.M. Medagoda and P .W . Gibbens. Multiple Horizon Model Pre- dictiv e Flight Control. Journal of Guidance, Contr ol, and Dynamics , 37(3):946–951, 2014. [2] W .C. Clarke, C. Manzie, and M.J. Brear . Hierarchical Economic MPC for Systems with Storage States. Automatica , 94:138–150, 2018. [3] M.R. Amini, X. Gong, H. W ang, Y . Feng, I. K olmanovsky , and J. Sun. Sequential Optimization of Speed, Thermal Load, and Power Split in Connected HEVs. In ACC , 2019. Philadelphia, P A, USA. [4] M.R. Amini, I. Kolmanovsk y , and J. Sun. T wo-Layer Model Predictive Battery Thermal and Energy Management Optimization for Connected and Automated Electric V ehicles. In 57th IEEE CDC , 2018. Miami Beach, FL, USA. [5] M.R. Amini, H. W ang, X. Gong, D. Liao-McPherson, I. Kolmano vsky , and J. Sun. Cabin and Battery Thermal Management of Connected and Automated HEVs for Improved Energy Efficiency Using Hierarchical Model Predictive Control. IEEE Tr ansactions on Control Systems T echnology , 2019. [6] J.P . Koeln and A.G. Alleyne. Robust Hierarchical Model Predictiv e Control of Graph-Based Power Flo w Systems. Automatica , 96:127– 133, 2018. [7] J. K oeln, H. Pangborn, M. Williams, M. Kawamura, and A. Alleyne. Hierarchical Control of Aircraft Electro-Thermal Systems. IEEE T ransactions on Contr ol Systems T echnology , 2019. [8] D. Barcelliy , A. Bemporad, and G. Ripaccioliy . Hierarchical Multi- Rate Control Design for Constrained Linear Systems. In 49th CDC , 2010. Atlanta, GA, USA. [9] R. Scattolini and P . Colaneri. Hierarchical Model Predicti ve Control. In 46th IEEE CDC , 2007. New Orleans, LA, USA. [10] R. Scattolini. Architectures for Distributed and Hierarchical Model Predictiv e Control–A Revie w. Process Contr ol , 19(5):723–731, 2009. [11] B. Picasso, D. De V ito, R. Scattolini, and P . Colaneri. An MPC Approach to the Design of T wo-Layer Hierarchical Control Systems. Automatica , 46(5):823–831, 2010. [12] Z. Y ang, Y . Feng, X. Gong, D. Zhao, and J. Sun. Eco-trajectory Planning with Consideration of Queue Along Congested Corridor for Hybrid Electric V ehicles. In TRB , 2019. W ashington, D.C., USA. [13] J.P . K oeln and A.G. Alleyne. T wo-Lev el Hierarchical Mission-Based Model Predictive Control. In ACC , 2018. Milwauk ee, WI, USA. [14] J. Lofberg. Y ALMIP: A T oolbox for Modeling and Optimization in MA TLAB. In IEEE CACSD , 2004. New Orleans, LA, USA. [15] A. W ¨ achter and L. Biegler . On the Implementation of an Interior-Point Filter Line-Search Algorithm for Lar ge-Scale Nonlinear Programming. Mathematical Pro gramming , 106(1):25–57, 2006. 7

Original Paper

Loading high-quality paper...

Comments & Academic Discussion

Loading comments...

Leave a Comment