Zhiqiang Geng ,Ke Yang ,Yongming Han ,Xiangbai Gu ,2,*
1 College of Information Science and Technology,Beijing University of Chemical Technology,Beijing 100029,China
2 Sinopec Engineering(Group)Co.,Ltd.,Beijing 100101,China
Keywords:High order statistics Nonlinear characteristics diagnosis Interpretative structural model TE process
ABSTRACT Nonlinear characteristic fault detection and diagnosis method based on higher-order statistical(HOS)is an effective data-driven method,but the calculation costs much for a large-scale process control system.An HOS-ISM fault diagnosis framework combining interpretative structural model(ISM)and HOS is proposed:(1)the adjacency matrix is determined by partial correlation coefficient;(2)the modified adjacency matrix is defined by directed graph with prior knowledge of process piping and instrument diagram;(3)interpretative structural for large-scale process control system is built by this ISM method;and(4)non-Gaussianity index,nonlinearity index,and total nonlinearity index are calculated dynamically based on interpretative structural to effectively eliminate uncertainty of the nonlinear characteristic diagnostic method with reasonable sampling period and data window.The proposed HOS-ISM fault diagnosis framework is verified by the Tennessee Eastman process and presents improvement for highly non-linear characteristic for selected fault cases.
With the development of industrial automation and integration,improvement of fault diagnosis technology has a great significance for industry.Recently,the development of computer and signal processing technologies promotes the development of fault diagnosis technology.Fault diagnosis methods can be divided into knowledge approach,analytical model approach and signal processing approach.Venkatasubramanian et al.[1-3]have divided diagnostic methods into quantitative model method,qualitative model method and historical data method.Some methods have been proposed based on data-driven diagnostic,such as principal component analysis(PCA),kernel principal component analysis(KPCA)[4,5], fisher discriminant analysis[6],and others using AI in fault diagnosis[7-9].New theories are introduced to fault diagnosis,such as model reasoning methods based on graph theory and methods with evolutionary algorithm[10].
HOS is one of the data-driven methods.Third-order and fourth order moments or higher order statistics are used for nonlinear analysis on communication signal,radar signal,nonlinear seismic wave and audio signal and monitoring mechanical condition[11-13].Choudhury et al.[14] first proposed non-Gaussianity index(NGI)and nonlinearity index(NLI),and applied the index to nonlinear characteristic diagnostic of control loops successfully.Choudhury[15]modified NGI and NLI and proposed a new index,total nonlinearity index(TNLI),to measure the degree of nonlinearity,with the diagnostic rate of false positives reduced.These indices have been used to detect the valve adhesion and poor controller performance.However,the difference of data selection causes inconclusive results,lowering the efficiency of nonlinear characteristics of the diagnosis.Besides,bicoherence is so sensitive that the indexes will change a lot if the data have any change.For complex and multi-variable systems,it is necessary to improve the method due to unclear goals of higher-order statistics.
In 1973,ISM was developed by J.War field to analyze complex systems.Recently,Peng applied it on the outline design stage[16].Wang et al.applied it to analyze the cause of accident level[17].Yang et al.-investigated the effect of workplace risk by combining ISM and hierarchical approach[18].Singh and Kant made use of ISM to determine the relationship among the knowledge management barriers[19].However,the adjacency matrix is mostly established with the knowledge of expert system,which has the disadvantage of great subjectivity and inconsistency.Data-driven methods without prior knowledge also have the disadvantage of great blindness.Hence,establishment of adjacency matrix of ISM needs to combine prior knowledge and a datadriven method based on partial correlation coefficient.In other words,both methods can authenticate and compensate each other.Yan[20]established online contacts between two variables with comparing partial correlation coefficient and correlation coefficient.Wang et al.[21]considered that partial correlation coefficient is more effective with independent variables.Kim e al.[22]studied Copula function modeling using partial correlation coefficient.Yang et al.used correlation coefficient method to verify the efficiency of signed directed graph model[23].
ISM can be used to improve the efficiency of the method and diagnostic purposes in this study.This paper describes the relation between variables using partial correlation coefficients for defining the adjacency matrix,modifies this adjacency matrix based on priori knowledge of process piping and instrument diagram,and builds ISM of control loop system.Then above structural information is integrated with nonlinear characteristic diagnostic method,which is based on higher-order statistical,for more effective diagnosis of nonlinear characteristic of process.
The correlation coefficient describes the relationship among variables and it is seldom used to infer the inner relationship of variables.Whether the relationship is related with other factors,it should be considered.The partial correlation coefficient measures the ceteris paribus effect among variables and reflects the real linkage of variables,and as the strength of the relationship between variables increases,the absolute value of partial correlation coefficient also increases.Because of the influence of ceteris paribus,the correlation between dependent and independent variables is a dimensionless number between?1 and 1.We usually use its absolute value.
The normal data of all control loops are chosen and normalized as:

where

t is the number of control loops and m is the number of samples with value of500 in this paper.According to experimental verification,Outputs of control loops are used,which describe the relationship of control loops better.
Let xi(i=1,2,…,m)be the i-th value of variable x,then the correlation coefficient of xiand yiis

From the Eq.(1),we can get the correlation coefficient matrix

The correlation coefficient matrix is computed and the inverse matrix is obtained,which is used to obtain the partial correlation coefficient matrix.
The inverse matrix of r is

Then the partial correlation coefficient of the two variables is

where i=1,2,…,m and j=1,2,…,n.
And

There are different definitions in different industries whether the partial correlation coefficient is related.Table 1 shows the general scope of partial correlation coefficient.

Table 1 The scope of partial correlation coefficient
If Rijis a positive number and greater than the threshold value,then aij=1 and aij=0 where aijis the adjacency value of xito xj,otherwise,aij=0 and aij=1.The adjacency matrix is

where aij=1or aij=0(i=1,2,…,n;j=1,2,…,n).
Modifying adjacency matrix A′,we can build the control loop directed graph.The relationship of control loops is described by priori knowledge of piping and instrument diagram(PID)[24]in process flow chart.Adjacency matrix A is modified by the control loop directed graph.For instance,if node-i has an effect on node-j or node-i→node-j in process flow chart,the value of aijis set to 1,or aij=0 in adjacency matrix A.Both methods can authenticate and compensate each other.Finally,adjacency matrix A′is finished.
With the n×n identity matrix

we have

We can get formula RE=(A'+E)n?1,which is the reachability matrix of adjacency matrix A'

Definition 1.In the i-th row REi(i=1,2,…,n)of reachability matrix RE,if REij=1(j=1,2,…,n),then element REijis added to the reachable set,which is expressed as Si.
Definition 2.In the j-th column REj(j=1,2,…,n.)of reachability matrix RE,if REij=1(i=1,2,…,n),then element REijis added to the first set,which is expressed as Bj.
The influencing factors can be stratified based on Si∩Bj=Si,and then the highest level of factors L1is identified.The column and the row corresponding with L1are removed from the reachability matrix RE.By the same decision rules,L2,L3…,Lkare definite.The last step is to model the hierarchy of ISM using each level of L.Specific examples are shown in Section 4.
The detail steps for this algorithm are described in the ISM framework in Section 3.2.According to the ISM,a control loop system is divided into several small sets of control loops,which are used to systematic fault detection and diagnosis.This method avoids the blindness of diagnosis and increases diagnostic efficiency.
In practice,many industrial processes include not only Gaussian signals but also non-Gaussian and nonlinear signals,which cannot be identified by the first order and second order signals.Higher order statistical theory is a good choice for identifying these signals.The method of non-linearity detection uses HOS because it is sensitive to certain types of phase coupling between different frequency bands[25].
Non-Gaussianity index and nonlinearity index are based on the statistical hypothesis test[14].The principal domain of the bicoherence is 0<f1<0.5,f2<f1and 2f1+f2<1.Then they are modified and TNLI is proposed[15].
Non-Gaussianity index is defined as:

Nonlinearity Index is defined as

Total nonlinearity index is defined as

It is an important index to compare the extent of nonlinearities.Fig.1 is the rule based decision flow diagram by nonlinearity indexes[14],where α=0.05.After the fault occurs,it can be used to feature extraction of fault.The calculation is based on data length of 4096,segment length of 64,50%overlap,Hanning window with length of 64,and DFT length of 128 for one index of HOS[14].

Fig.1.Rule based decision flow diagram.
The indexes are applied in a new way with moving window in this paper.Since the calculation of these indexes is based on data,different data will lead to uncertain results of diagnosis.There are a lot of day-today monitoring and operation data in industrial processes.Section 3.1 only shows a handful of index calculation without the variation of indexes.The dynamic calculation for these indexes is needed.The proposed fault detection and diagnosis framework is showed in Fig.2.
The framework includes online and offline parts.The goal of offline is to build ISM of control loop system.For online part,after data collection,choose a small loop set by the graph based on ISM,then extract nonlinear feature and judge the fault.Specific example will be given in Section 4.
This framework process is as follows.
Step 1 Build interpretative structural model(Section 2).
Step 2 Obtain data collection from process with a fault.
Step 3 Choose data in small loop set(level i)using ISM of control system.When the fault is not detected,the small control loop set in lower level(i?1)is selected.The level is chosen from the bottom level to the upper level.These sets are good for fault detection and diagnosis order in this framework.
Step 4 Dynamically extract nonlinear feature by calculating TNLI,NGI and NLI.
We selected data by moving window with a length of 4096 at a step of 64 and searching the index variation.Meanwhile,moving window is used to implement indexes of HOS in on-line manner,where the window length is maintained by introducing the most recent measurement and discarding the oldest one.In other words,one index calculation needs 4096 data length and dynamical calculation needs a moving window with a length of4096.PV-SP of every control loop is used to calculate three indexes[15].NGI and NLI are calculated as shown in Fig.1 and TNLI can be calculated.
Step 5 Faultis detected according to nonlinear feature from Step 4.After comparison to the nonlinear feature of control loops,if fault is detected,this process will be recovered.Otherwise,go back to Step 3.

Fig.2.The proposed fault detection and diagnosis framework.
The Tennessee Eastman(TE)process is a classic chemical process simulation model proposed by Downs and Vogel[26].It is usually used to multivariable control,nonlinear control,optimization,model predictive control and fault diagnosis[27-29].TE process includes reactor,condenser,compressor,vapor/liquid separator and stripper,with 41 measurements containing 22 continuous variables and 19 chromatography component measurement values and with 12 manipulated variables and 20 disturbances which can be set[30].
Larsson[31]has proposed a self-optimizing control of TE process and simulated it with Matlab.Here we paper use a model named MultiLoop_Skoge_mode1 in Matlab for getting data of TE process.Control loops of TE are analyzed in Table 2 where r1,r2,r3,r4,r5,r6,r7,FP,SP17,PctGsp,and Eadj are intermediate variables of TE process,which are useful for calculation.For getting real-time data of these intermediate variables,TE simulation is modified without affecting the operation.A control loop digraph can be built by a priori knowledge,as shown in Fig.3.
The simulation data of TE process without disturbances are obtained.According to the ISM framework in Fig.2,adjacency matrix(Table 3),modified adjacency matrix(Table 4),and reachability matrix(Table 5)can be finished,and ISM is built(Table 6).Control loops of Level 7 present the highest influence on control system and the impact of the front level is weak.The small control loop sets in Table 6 are good for an effective fault detection and diagnosis of HOS.
The above description shows the importance and relation of control loops in TE process.It is helpful to the next nonlinear characteristicanalysis.A dynamic calculation is introduced into NLI,TNLI and NGI in the following four cases,and the dynamic figure is obtained,with which the location and point of failure can be identified.This study concentrates on those faults that produce large impact and occur at control loops.Three cases IDV(4),IDV(11)and IDV(14),with different interfering signals and same location,are chosen for studying characters of these indexes.Case IDV(12)occurs at other loop.

Table 2 Control loops of TE control process

Fig.3.Control loop digraph of TE process.
4.3.1.Case 1—IDV(4)
After running TE simulation model,IDV(4)(pulse),associated with step change in the reactor cooling water inlet temperature,is set topulse interference at 71 h.PV-SP of each loop is acquired,values of NLI,TNLI and NGI are calculated with 64 steps interval.With small control loop set(Table 6),these indexes can be calculated.The fault is not detected until control loops in Level 2 are chosen.

Table 3 Adjacency matrix A of TE process based on partial correlation coefficient
Fig.4 shows that the curves of loop 7 are very similar and the process experiences steady state,sudden rise,and fall to ultimate stability,due to pulse signal.
For NLI,it has step change when the fault signal enters into the window of 4096,so its value increases suddenly;it keeps relatively stable as the fault signal moves along the window of 4096 and falls suddenly when the fault signal moves out of the window.It is easy to determinethe fault point.The abscissa of the first crest is 48,which can be converted to time point 48×64×0.01=30.72 h.Because the length of window is 4096,the fault point can be calculated as 30.72+4096×0.01=71.68 h.This method can detect not only the fault loop 7 but also the fault point.The proposed method shows higher output level and higher recognition rate.In other words,the method is more sensitive in detecting small changes in control loops and demonstrates persistent fault detection.

Table 4 Modified adjacency matrix A′of TE process

Table 5 Reachability matrix RE

Table 6 ISM of TE process
4.3.2.Case 2—IDV(11)
IDV(11)(random signal),the inlet temperature of cooling water is interfered at 55 h and eliminated at 151 h.
As shown by loop 7 in Fig.5,the fault point is located with the first crest moving 64 along the abscissa direction.The fault eliminate point is at the second crest.
For NLI,it has a step change when the fault signal enters the window of 4096,and its value increases suddenly;it falls to ultimate stability as the fault signal moves along the window and the proportion of fault data increases in the window,but it is larger than that in normal situation;it rises as the proportion of fault data reduces,and its value reaches the maximum with a small amount of data;it returns to a steady state gradually with normal signal only because the fault signal moves out of the window.
The abscissa of the first crest is 26,which is converted to time point as 26×64×0.01=16.64 h,and the fault point is 16.64+64×64×0.01=57.6 h.The abscissa of the second crest is 237,fault elimination point is at 237×64×0.01=151.68 h.It is consistent with that under the actual situation.This method can detect both the location of fault and the fault point and elimination point.
4.3.3.Case 3—IDV(14)
IDV(14),sticking of reactor cooling water valve,is interfered at 55 h and eliminated at 130 h.
As shown by loop 7 in Fig.6,the fault point is in the location of the first crest moving 64 along abscissa direction.The fault elimination point is in the second crest.This figure is similar to Fig.5.The abscissa of the first crest is 22 and is converted to time point 22×64×0.01=14.08 h,and the fault point is at 14.08+4096×0.01=55.04 h.The abscissa of the second crest is 202,and the fault elimination point is at 202×64×0.01=129.28 h.
4.3.4.Case 4—IDV(12)
IDV(12),condenser cooling water inlet temperature,is interfered at 56 h and eliminated at 143 h.After this small control loop set(loop 8)of Level 6(Table 6)is used to index calculation,this fault is detected.Only indexes of loops 5 and 8 need to be calculated.The result is showed in Fig.7.For NLI,the abscissa of the first crest is 25,with time point at 25×64×0.01=16 h and the fault point at 16+64×64×0.01=56.96 h.The abscissa of the second crest is 223,and the fault elimination point is at223×64×0.01=142.72 h.We can also obtain the fault point and eliminate the fault,but this case is not as clear as the previous three cases,because of lower changes of the control loop variables after the fault occurs.
The faults in the four cases can be detected and their locations are determined.The proposed method reduces the number of control loops or variables with interpretative structural for large-scale process control system and cuts off the amount of calculation when the process is a large-scale one such as TE.It should be noted that PCA-based method[32],such as HOS,is a data-driven method and belongs to those based on signal processing.Both methods are applicable to fault detection of linear and nonlinear systems.However,the methods for fault detection and fault isolation are not sat is factory.A combination with other theoretical methods is needed in order to improve the performance of fault detection and diagnosis[33].

Fig.4.TNLI,NGI and NLI of loops 2,4 and 7 with IDV(4).

Fig.5.TNLI,NGI and NLI of loops 2,4 and 7 with IDV(11).

Fig.6.TNLI,NGI,NLI of loops 2,4 and 7 with IDV(14).

Fig.7.TNLI,NGI,NLI of loops 5 and 8 with IDV(12).

Table 7 The features of the methods cited in this paper
Table 7 shows the features of the methods introduced in this paper as well as the traditional method for large-scale control system.HOS is usually used in the diagnosis and detection of large-scale control systems,with large amount of calculation and higher cost.In order to improve the efficiency of diagnosis,ISM is introduced and modified to a new method,ISM based on partial correlation coefficient.In addition,we combine it with a priori knowledge to improve the accuracy of traditional ISM,especially for a large-scale control system.The combination of HOS and ISM improves the efficiency and avoids the blindness of HOS detection.
The presented HOS-ISM based fault diagnosis framework integrates HOS and ISM and combines a priori process with data-driven methods to determine control loop,increase the accuracy of data selection,and cover the shortage of uncertainty in diagnosis result of HOS due to the selection of different data sets.The calculation of nonlinear index online,its dynamic changes,intuitive identification of the fault loop,determination and elimination of the fault point all have important practical significance.However,with out a significant change of nonlinear characteristics of the control loop,detection of fault is difficult,which will be improved in the further work.
Chinese Journal of Chemical Engineering2015年1期