,,*,
1.College of Computer Science and T echnology,Civil Aviation University of China,Tianjin 300300,P.R.China;
2.Key Laboratory of Intelligent Application Technology for Civil Aviation Passenger Service,Beijing101318,P.R.China
Abstract:T he homogeneity analysis of multi-airport system can provide important decision-making support for the route layout and cooperative operation.Existing research seldom analyzes the homogeneity of multi-airport system from the perspective of route network analysis,and the attribute information of airport nodes in the airport route network is not appropriately integrated into the airport network.In order to solve this problem,a multi-airport system homogeneity analysis method based on airport attribute network representation learning is proposed.Firstly,the route network of a multi-airport system with attribute information is constructed.If there are flights between airports,an edge is added between airports,and regional attribute information is added for each airport node.Secondly,the airport attributes and the airport network vector are represented respectively.The airport attributes and the airport network vector are embedded into the unified airport representation vector space by the network representation learning method,and then the airport vector integrating the airport attributes and the airport network characteristics is obtained.By calculating the similarity of the airport vectors,it is convenient to calculate the degree of homogeneity between airports and the homogeneity of the multi-airport system.The experimental results on the Beijing-Tianjin-Hebei multi-airport system show that,compared with other existing algorithms,the homogeneity analysis method based on attributed network representation learning can get more consistent results with the current situation of Beijing-Tianjin-Hebeimulti-airport system.
Key words:air transportation;multi-airport system;homogeneity analysis;network representation learning;airport attribute network
Multi-airport system means multiple airports serving the same region,and the analysis of multiairport system has become a hot research topic.The research on multi-airport system mainly focuses on three aspects:Analyzing the development of multiairport system from multiple perspectives[1-3],airport selection of passengers in multi airport system[4-6],and the homogeneity of multi-airport system[7-10].
The homogeneity of multi-airport system refers to the phenomenon that the airports in the multi-airport system lose their unique characteristics and gradually converge,which is mainly manifested in the large overlap of route layout.Research on the homogeneity of multi-airport systems is of great significance to the synergetic development of multi-airport system,and it can help prevent multi-airport system from losing its unique characteristics,wasting resources,and restricting the development of multi-airport system.
Since there is no uniform standard for the homogen-eity of multi-airport system,researchers can only indirectly evaluate the homogeneity of multi-airport system from the perspective of calculating similarity.Ref.[7]obtained the convergence coefficient of each airport in the Pearl-River-Delta multiairport system via a comprehensive calculation of the proportion and growth rate of the passenger transportation capacity.Jiang et al.[8-9]analyzed the degree of homogeneity of the Yangtze-River-Delta multi-airport system by calculating the difference between the flight frequency,passenger throughput,and cargo and mail throughput of each airport in the proportion of the data in the entire region.These methods treated all airports in the multi-airport system as discrete individuals.They all lacked the consideration of the network formed by the multi-airport system and its navigable airports.
Ref.[10]considered the homogeneity of multiairport system from two aspects:Airport attribute similarity and airport network similarity.Airport attributes include the number of infrastructure,passenger throughput,etc.Airport network is constrcuted by the airports in the multi-airport system and their navigable airports.However,some airport attributes are less related to the homogeneity of multi-airport system,which seriously affect the accuracy of the homogeneity calculation.In addition,the attribute information of airports and the airport network are calculated separately and then weighted fused directly,which leads to the loss of the characteristics of the two parts of the algorithm.In order to better integrate airport attributes and airport network information,the airport network and airport attributes should be calculated synchronously under the same framework to form an organic integration of these two parts.
In order to solve the above problems,this paper proposes a multi-airport system homogeneity analysis algorithm based on the airport attributed network representation learning.The airport attribute information has also been changed from the number of runways and other infrastructures to the route region information,which is believed to be able to better calculate the degree of homogeneity of the multi-airport system.Airport attribute information and airport network information are no longer calculated separately.The airport network node vector and the attribute node vector are uniformly and synchronously embedded into a unified low dimensional space through network representation learning.The homogeneity of multi-airport system is calculated by averaging the similarity between the airport vectors obtained by the airport attributed network representation learning.Finally,the rationality of the algorithm is evaluated based on the data of Beijing-Tianjin-Hebei airport multi-airport system.
The homogeneity of multi-airport system is calculated by analyzing the route layout.Airports and their navigable airports naturally form an airport graph,as shown in Fig.1.In Fig.1,the solid circles represent airports in the multi-airport system,and the hollow circles represent navigable airport nodes.The connection line represents the connection between airports based on the route.Each airport node has attribute information,which is represented by a rectangular box.

Fig.1 Airport attribute network diagram
Through graph representation learning,the node vector representation of airports considering the global route layout of multi-airport system can be obtained,and the homogeneity degree between airports can be calculated via the node vector.Therefore,the homogeneity algorithm of multi-airport system is expressed as a problem of airport network representation learning.In order to better learn the vector representation of airport nodes,the route region attribute is introduced into the airport network.So it is formulated as an attributed networkrepresentation learning problem.In order to get the unified representation of airport network and airport attribute vector,it is necessary to represent the airport structure vector and the airport attribute vector respectively,and optimize their fusion embedding representation through a unified loss function.
N={G,A}is an airport attribute network withnairport nodes,G∈Rn×nrepresents the airport network structure,which is a adjacency matrix with weight.The element(i,j)of the matrix means that there is a route between the airportiand the airportj.If there is no route between them,the value is set to 0.A∈Rn×mis a matrix representing the attribute information of the airports in the network.The data in rowiof matrixArepresents them-D attributes of the airporti.
The algorithm process diagram is shown in Fig.2.The model includes two branches:One constructs a learning modelH(G)based on the node similarity of the node network structure to obtain the potential vector space representation of node structure,the other constructs a modelH(A)to learn the potential vector space representation of airport node attribute information.The two learned vector representations are fused into a unified low dimensional space vector of airport features,represented asH.

Fig.2 Network representation learning of airport attribute
The network structure similarity matrixS(G)is constructed by the structural similarity between the airport nodes,and the structural similarity of the nodes is calculated by the cosine similarity of the nodes in the airport network structure matrixG,where elements(ijG)represents the similarity between nodesiandj.If the two airport nodes are similar,the hidden vectors should be similar in the network node vector space obtained by network representation learning[11].Then,the vector representation learning model of airport network structure is

wherenis the total number of airport network nodes.h(iG)andh(jG)are the hidden vector representations in network representation space,anddi(G)anddj(G)the normalizations forh(iG)andh(jG),which are the sums of rowiand rowjdata in matrixS(G),respectively.
The Laplacian matrix and trace of the matrix are introduced to transform Eq.(1)by referring to the related knowledge of spectral clustering[12].The transformed model and the constraint condition are

whereD(G)is a diagonal matrix,the element in the rowiand columni,andEthe identity matrix.By solving the objective functionZ G,the representation vectorH(G)of airport node structure representation is obtained.
Similar to the representation learning process of the airport node network structure,the airport node attribute information matrixAis obtained via the cosine similarity calculation of the node attribute informationS(A).In order to minimize the divergence degree between the node similarity calculated by Euclidean distance and the node similarity in similarity matrix,the objective function is constructed as

whereD(A)is a diagonal matrix,and each element on its diagonal is the sum of all elements in the corresponding row of the matrixS(A).By solving the objective functionZ A,the representation vectorH(A)with node attribute characteristics is obtained.
Neither network vector nor attribute information vector of airport node can individually express the characteristics and contents of the entire airport attribute network.These two need to be merged into a unified vector space.
In order to make the node attribute information representation vectorH(A)and node network structure feature vectorH(G)better complement with each other in the whole node vector representation,the principal component analysis method is used to express the correlation between them,as shown in Eq(.8).Similarly,in order to make the features and contents expressed byH(G)andH(A)embodied inH,the correlations betweenH(G)andH,H(A)andHshould also be reflected in the objective function

On this basis,the sum ofZ G,Z Aand their correlation are obtained.The vector spacesH(G)andH(A)are fused by maximizing the solution.However,the direct summation operation will lead to a large proportion of node network structure characteristics or node attribute information in the overall node representation.Therefore,two weight adjustment parametersλ1andλ2are introduced to tradeoff the attribute information representationH(A)and the association betweenH(A)andH(G)in the objective function.The objective function is solved as in Eq.(11),and the constraint conditions are as in Eq.(12).The overall node vector spaceHis obtained by solving the objective functionZin the entire network.

If Eq.(11)is directly solved in the solution space of the whole network,it has high requirements on the computing performance and time of the computer,and it may not get the expected solution.In order to solve this problem,we change the global optimal solution to the local optimal solution,and solve the equations by Lagrange extremum.The specific optimization method is as follows.
Firstly,calculating the second-order partial derivative ofZin Eq(.11),we get

Then,let the second derivative be equal to 0,we have

whereα1,α2,α3are the eigenvalues.Eqs.(12,13)are updated iteratively until the loss difference of Eq.(11)is less than the set threshold.Finally,the firstDeigenvectors in the result are taken as the final vector space solutionH.
After obtaining airport node representation vectorH,the homogeneity degree within the multi-airport system is calculated.Taking any two airportsiandjas an example,if the representation vectors of airportsiandjareh i=[x i1,x i2,…,x id]andh j=[y j1,yj2,…,y jd],respectively,the homogeneity degree between airportsiandjcan be expressed as

Based on the above-mentioned homogeneity degree between airports,the homogeneity degree of one airport in the multi-airport system can be calculated,which is the average value of the homogeneity degree between other airports and this airport,as shown in Eq.(16).The degree of homogeneity of the multi-airport system is to re-average the average homogeneity degree of a single airport in the multiairport system

whereT ijis the element in the matrixT,1≤i,j≤n0,which represents the homogeneity degreeamongnairports;the average homogeneity de0gree of a single airport in the multi-airport system;andT A_Gthe homogeneity degree of the multi-airport system.
The experiment uses the data of Beijing-Tianjin-Hebei multi-airport system,including two parts:(1)The airport network constructed by the airports in the multi-airport system and their navigable airports.The data are derived from the flight data of Beijing-Tianjin-Hebei multi-airport system in 201X.The data example is shown in Table 1.The first column represents the airports within the the multi-airport system,and the second column represents the airports with flights within the multi-airport system.The airport is represented by three character code:HDG,Handan Airport;NAY,Beijing Nanyuan airport;PEK,Beijing Capital Airport;SJW,Shijiazhuang Airport;TSN,Tianjin Airport;CAN,Guangzhou Baiyun Airport;BAV,Baotou Airport;CGD,Changde Airport;SHA,Shanghai Hongqiao Airport;HAK,Haikou Airport,etc.(2)The attributes of airport nodes in the airport network include the province and location information of each airport.The province information of the airport comes from the publicly released airport introduction,and the codes and administrative regions of Chinese provinces issued by the National Bureau of statistics[13](Table 2).
In this paper,the three-character code is replaced with the digital code,and the administrative region is represented by numbers.All the nodes of the airport network constructed by Beijing-Tianjin-Hebei airport multi-airport system and its navigable airports are replaced by numbers.The seven administrative regions are North China,Northeast China,East China,Central China,South China,Southwest China and Northwest China,coded with numbers 1—7.The experimental data of node attribute information are shown in Table 3.
Based on Tables 1,3,the airport node network structure matrix and the airport node attribute infor-mation matrix are constructed respectively.The algorithm proposed in Section 2 is used to carry out the experiment.The airport node attribute information and network structure association value is set as 0.1,the weight of airport node attribute information is set as 0.4,and the dimension of airport node representation vector is set as 32.

Table 1 Airport network related data example

Table 2 Examples of province and administr ative r egion in China

Table 3 Experimental data of airport attribute information
The algorithm in Ref.[10]is chosen as the benchmark algorithm.It considers the homogeneity of multi-airport system from two aspects:Airport attribute similarity and airport network similarity.In this method,the node vector representation of the airport network is obtained through the network representation learning algorithm node2vec[14-15].The dimension of the airport node vector is set to 32.The homogeneity degree between airports in Beijing-Tianjin-Hebei multi-airport system is calculated by fusing the airport node vectors and the similarity computed by the airport attributes.
The homogeneity degree is expressed by the color depth.In order to more intuitively express the homogeneity degree between airports in Beijing-Tianjin-Hebei multi-airport system,the confusion matrix is used to express the result,as shown in Table 4 and Fig.3.

Table 4 Homogeneity of Beijing-Tianjin-Hebei air port group in 201X by baseline method

Fig.3 Homogeneity degree of Beijing-Tianjin-Hebei multiairport system based on baseline algorithm[10]shown in heat map
The experimental results are shown in Table 5.According to Eq(.14),the average homogeneity degrees of Beijing Capital Airport,Tianjin Airport,Shijiazhuang Airport are 0.623 0,0.617 5,0.629 5,respectively.Similarly,the average homogeneity degree of other five airports in the multi-airport system and each airport can be obtained.On this basis,according to Eq.(15),the average homogeneity degree of each airport in the airport group is calculated again,and the homogeneity degree of the multi-airport system can be obtained.Therefore,the homogeneity degree of the development of Beijing-Tianjin-Hebei multi-airport system in 201X is 0.583 7.All the homogeneity degrees range from 0 to 1,and that of multi-airport system is greater than 0.5.Therefore,the development of Beijing-Tianjin-Hebei airport group in 201X has obvious homogeneity characteristics.

Table 5 Homogeneity of Beijing-Tianjin-Hebei air port group in 201X by the pr oposed method
It can be seen from Table 5 that the three largescale airports in the Beijing-Tianjin-Hebei multi-airport system have a great degree of homogeneity.As a super large airport in the world,Beijing Capital Airport has great development advantages in Beijing-Tianjin-Hebei region.From the information of navigable airport,province,administrative region and transportation scale,Tianjin Airport and Shijiazhuang Airport rank second and third in the multiairport system,showing a high homogeneity degree with the Beijing Capital Airport,with the degree of homogeneity being 0.898 6 and 0.736 3,respective-ly.The homogeneity degree between Tianjin Airport and Shijiazhuang Airport is 0.731 1,both of which are alternate airports of Beijing Capital Airport and regional hub airport,and the homogeneity degree is also obvious.The more intuitive confusion matrice of homogeneity degree among airports of Beijing-Tianjin-Hebei airport group is shown in Fig.4,and the depth of color represents different degrees of homogeneity.

Fig.4 Homogeneity degree of Beijing-Tianjin-Hebei airport group based on learning method of attribute network representation shown in heat map
The homogeneity degree of Beijing-Tianjin-Hebei multi-airport system calculated by the baseline method[10](represented as M 1)is 0.516 6,and that calculated by the attributed network representation learning method(represented as M 2)is 0.583 7.Obviously,the homogeneity calculated by M 1 is less than that calculated by M 2.From the point of the homogeneity among airports,the average homogeneity of a single airport obtained by M 1 is also lower than that obtained by M 2.Moreover,the homogeneity degree between airports obtained by M 1 is relatively concentrated,while the homogeneity degree between airports obtained by M 2 is quite different,ranging from 0.413 4 to 0.898 6.
The results of M 2 are more reasonable:(1)The navigable airport numbers of the eight airports are 150,88,52,54,6,9,12 and 3.Taking the homogeneity with Beijing Capital Airport as an example,the homogeneity degrees of Tianjin Airport,Shijiazhuang Airport,Nanyuan Airport should be decreased but higher than those of the other four airports.However,there is little diffrence between the homogeneity degrees obtained by M 1 algorithm,and the homogenization degrees of Tangshan Airport,Qinhuangdao Airport are higher than that of Nanyuan Airport,which is unreasonable.(2)The homogeneity degree obtained by M 2 between Handan Airport and Nanyuan Airport is relatively large,because these two airports are the only airports that are not accessible to North China.(3)The homogeneity degrees among several smallscale airports calculated by M 1 is higher than those by M 2,because indicators such as infrastructure in M 1 occupy a relatively large proportion.The number of these infrastructures is similar in small airports,but they cannot reflect the airport homogeneity degree.From the perspective of route layout,although the navigation directions have a certain degree of overlap,the proportion is quite different.The only same navigable airport among the four airports is Shanghai Pudong Airport.Both Handan Airport(six navigable airports)and Zhangjiakou Airport(three navigable airports)have no routes in Shandong,Xi’an,Neimeng directions,so the homogeneity degree should not be so high.In summary,the result of M 2 is consistent with the actual situation,but the result of M 1 is quite different.Therefore,adding the attribute information to the airport node has a great positive influence on the homogeneity calculation of the multi-airport system.
The attribute information which affects the route layout is added to each airport node in the airport network,and a multi-airport system homogeneity analysis algorithm based on attributed network representation learning is proposed.In this algorithm,the vector representations of the structural characteristics of the airport node network and the vector representation of the attribute information feature of the airport node are fused into a uniform airport node vector space,and then the homogeneity degree of the airport group is calculated via the similarity of these airport vectors.The experimental results show that the homogeneity degree of multi-airport system calculated by the proposed method with airport node attribute information outperforms the other fusion methods.
AcknowledgementsThis work was supported by the Natural Science Foundation of Tianjin(No.20JCQNJC00720)and the Fundamental Research Fund for the Central Universities(No.3122021052).
AuthorsDr.LIU Caihua received the B.S.degree in software engineering and the Ph.D.degree in computer application technology from Nankai University,China,in 2010 and 2017,respectively.She is a lecturer at the School of Computer Science and Technology,Civil Aviation University of China.Her research interests include computer vision and machine learning.
Prof.FENG Xia is a professor at the School of Computer Science and Technology,Civil Aviation University of China.Her research interests mainly focus on data mining and machine learning.
Author contr ibutionsDr.LIU Caihua designed the study,complied the models and wrote the manuscript.Miss CAI Rui contributed to data analysis,result interpretation and manuscript revision.Prof.FENG Xia contributed to the discussion and revision of the study.Prof.XU Tao contributed to the discussion and background of the study.All authors commented on the manuscript draft and approved the submission
Competing interestsThe authors declare no competing interests.
Transactions of Nanjing University of Aeronautics and Astronautics2021年4期