,,
1.College of Economics and Management,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,P.R.China;2.College of Civil Aviation,Nanjing University of Aeronautics and Astronautics,Nanjing 211106,P.R.China
(Received 15 July 2021;revised 20 October 2021;accepted 16 December 2021)
Abstract:As the main body of air traffic control safety,the air traffic controller is an important part of the whole air traffic control system.According to the relevant data of civil aviation over the years,a mapping model between flight support sorties and air traffic controller demand is constructed by using the prediction algorithm of support vector regression(SVR)based on grid search and cross-validation.Then the model predicts the demand for air traffic controllers in seven regions.Additionally,according to the employment data of civil aviation universities,the future training scale of air traffic controller is predicted.The forecast results show that the average relative error of the number of controllers predicted by the algorithm is 1.73%,and the prediction accuracy is higher than traditional regression algorithms.Under the influence of the epidemic,the demand for air traffic controllers will decrease in the short term,but with the control of the epidemic,the demand of air traffic controllers will return to the pre-epidemic level and gradually increase.It is expected that the controller increment will be about 816 by 2028.The forecast results of the demand for air traffic controllers provide a theoretical basis for the introduction and training of medium and long-term air traffic controllers,and also provide method guidance and decision support for the establishment of professional reserve and dynamic control mechanism in the air traffic control system.
Key words:air traffic controller;demand forecast;support vector regression(SVR);grid search;cross-validation
As the frontline personnel in the air traffic control system,air traffic controllers play a direct role in maintaining traffic order,ensuring flight safety and improving operation efficiency.In recent years,the civil aviation industry has developed rapidly.However,the insufficient and unbalanced development relationship between the increasing demand for flight services and the supporting air traffic controllers leads to low control efficiency,increased flight delays and even serious accidents.Therefore,forecasting the demand of air traffic controllers based on the air traffic and its future development is of great significance for improving the ability of air traffic control management and achieving high-quality development of air traffic control.
Air traffic controller demand refers to the number of controllers required to meet the flight support demand for a flight.Air traffic controller increment refers to the number of new controllers required to meet the increased flight support demand.With regard to prediction algorithms,scholars in domestic and international regions have invented and improved various prediction methods.He et al.[1]used the improved grey prediction based on the Markov chain theory to predict the increment of air traffic controllers.The experimental results show that the improved prediction algorithm has higher prediction accuracy than the traditional grey prediction,whereas the grey prediction is not suitable for long-term prediction.Kong et al.[2]collected the controller load related data through the control simulation experiment,and used the BP neural network algorithm to predict the controller load,while the neural network is prone to fall into the local minimum value.Ashley et al.[3]proposed an autoregressive summation moving average model to predict the traffic flow through the economic evaluation of airlines and airports,and speculated the long-term development plan in the future.Takashi et al.[4]utilized the deep belief network formed by multi-layer limited Boltzmann to capture features of time series,which could be used for short-term prediction.This algorithm has certain advantages in prediction accuracy,but it also increases the complexity of the algorithm.
The commonly used forecasting models include gray forecasting models,time series models,regression analysis models,Markov models,neural network models,etc.In order to obtain better prediction results under the premise of reducing the algorithm parameters and complexity,and avoid falling into the local optimal solution,this paper chooses the support vector regression(SVR)combined with parameter optimization.The main methods of SVR parameter selection mainly include grid search algorithms,genetic algorithms and chaos optimization.Gao et al.[5]adopted the grid search algorithm to optimize parameters in order to improve the accuracy of prediction and reduce computing load.Experimental results show that the method is more effective than the genetic algorithm and the particle swarm optimization algorithm.Gencoglu et al.[6]used the chaos theory to determine the optimal input variables by reconstructing the saturated embedding dimension of phase space,while the time series processed was required to be chaotic.Since the data processed in this paper are small samples,the chaos of time series is not obvious,so grid search is used to optimize parameters.In order to improve the robustness of the algorithm and avoid over-fitting,cross-validation is combined with grid search,which reduces the error caused by the random parameter selection and improves the generalization ability of the model[7].
At present,there are few researches on the forecasting of controller demand.This paper proposes a method for forecasting controller demand based on SVR and parameter optimization.In order to select the parameters of kernel function,according to historical data of flight support sorties and controller demand,an SVR model based on grid search and cross-validation is established to predict the demand for controllers in seven regional air traffic control bureaus.The prediction results show that the prediction algorithm in this paper has higher prediction accuracy than other prediction algorithms.Then,the forecast results are adjusted according to the impact of the outbreak of COVID-19 epidemic on the civil aviation industry.Finally,based on the adjusted prediction results and the training proportion of controllers in the three civil aviation colleges,the training scale of controllers in Nanjing University of Aeronautics and Astronautics(NUAA),Civil Aviation University of China(CAUC)and Civil Aviation Flight University of China(CAFUC)is determined,which provides reference for the training mode and strategy of air traffic controllers in universities in the future.
SVR is a supervised learning algorithm and a regression algorithm derived from the classification algorithm of support vector machine(SVM).The idea is to first map the low-dimensional nonlinear data to the high-dimensional space through the nonlinear mapping function?(x)for a group of data samples(y1,x11,…,x1n),…,(ym,xm1,…,xmn),as shown in Fig.1[8].

Fig.1 SVR diagram
Then the linear regression is carried out in this high-dimensional space to construct an optimal hyperplane to minimize the distance between data samples and the hyperplane,so as to solve the linear indivisibility problem of the original space[9].The objective function and constraint conditions are shown as

wherewis the weight vector,Cthe penalty factor,andεthe upper limit of error.μi,are the relaxation factors.
Lagrange multiplier is introduced to substitute the constraint conditions into the objective function,and the partial derivative ofw,b,μi,is made to be zero by KKT condition,and the regression function is obtained[10].

Kernel function is used to calculate the dot product of nonlinear mapping between two vectors,which can simplify the calculation of inner product in the mapping space.Therefore,the selection of kernel function and its parameters has an important impact on the performance of the prediction model of support vector regression algorithm.The kernel functions commonly used in the process of solving practical problems are shown in Table 1.

Table 1 Common kernel functions
In order to transform the original nonlinear separable feature space into a high-dimensional space and make the problem linearly separable in the highdimensional space,the kernel function is used to map the data into the high-dimensional space,so that it is linearly separable[11-13].The Gaussian kernel function is used in this paper because it rarely has dimensional disaster and the number of hyperplane parameters is less than that of polynomial kernel function.
The penalty parameterCand kernel parameterσin the Gaussian kernel function are very important to the prediction results of the model.In practical application ,is used to find the optimal parameter combination(C,g)of the algorithm by combining grid search and cross-validation.
Grid search is commonly used in dealing with nonlinear optimization algorithms.The basic idea is that based on the set parameter search range,several grids are divided within the parameter variable range to form grid points,and all the crossing points in the grid are all the parameter combinations to be searched.Then,each parameter combination in the grid is calculated to select the optimal parameter combination[14].The density of grid points determines the calculation time and accuracy.Generally,the smaller the mesh size is,the higher the prediction accuracy will be,but the model calculation time will increase.The larger the mesh size is,the shorter the model calculation time will be,but the prediction accuracy may not be high enough.Therefore,when using the grid search,it is necessary to comprehensively consider the calculation accuracy and time,so as to ensure the practical application effect of the prediction model.
Cross-validation is a method used to eliminate model training bias caused by sampling randomness[15].The demand prediction model in this paper adopts theK-fold cross-validation method to find the optimal parameter combination.The original data are evenly divided intoKparts,andK-1 parts of them are selected as training data to build the model,while the remaining part is used as test data to test the model.After trainingKtimes,Kmodels are obtained,and the average prediction accuracy of the test data of the models is used as the performance index of the model.
When evaluating the training data of the model,the parameters of SVR with the highest prediction accuracy are taken as the optimal parameters to improve the effectiveness and generalization ability of the prediction model.In the process of model training,the model accuracy is represented by the minimum mean square error[12].The mean square errorηMSEis expressed as

whereyi,are the true value and the predicted value of the data,respectively.
The grid search method can traverse every possible parameter combination ofCandgin the search range,and combine with cross-validation to find the optimal parameter with the smallest mean square error,thus avoiding the emergence of local optimal solution[16].The basic steps of grid parameter optimization based on theK-fold cross-validation are described as follows.
Step 1Establish parameter search grid.Letx=[-a,a],y=[-b,b],and all step size beλ,then the grid point of the parameter isC=2x,g=2y.
Step 2The prediction error value is obtained based onK-fold.Divide the original data set intoKparts,select one subset as the test set and the remainingK-1 parts as the training set,and train to get the best(C,g)of the model.Based on this model,the one test set is predicted,and the prediction error value is obtained.Kmodels can be obtained by repeating the above steps,so that each piece of data has been tested,and the mean value of prediction error values of all models is taken as the final prediction errorηMSEcorresponding to the set of parameters.
Step 3Traverse all the grids to select the optimal parameter combination(C,g).Select another parameter combination in the search grid,and repeat Step 2.The prediction errorηMSEabout all parameter combinations can be calculated,and sorted in descending order to select the optimal parameter combination(C,g)corresponding to the minimumηMSE.
3.1.1 Dataset and experimental settings
According to the civil aviation air traffic management statistical report which issued by Air Traffic Management Bureau of Civil Aviation Administration of China,the flight support sorties and air traffic controllers’quantity from 2010 to 2019 in north China,northeast China,east China,central south,southwest,northwest and Xinjiang can be obtained,which are collected by year.Due to the large order of magnitude difference between national flight support sorties and air traffic controller demand,direct training the model with raw data will result in large model prediction errors.
Therefore,in order to improve the calculation accuracy of the prediction model,while retaining data features and preventing over-fitting,the data are firstly normalized into[0,1],so that data with different representations are set to the same scale.The normalization calculation formula is

wherexmin,xmaxare the minimum and the maximum values in the same group of data,respectively.
In this paper,search grid ranges ofCandgareC∈[2-10,210],g∈[2-10,210],and parameter search step-size is 0.1.InK-fold cross-validation,setK=5.
3.1.2 Experimental results and analysis
The number of air traffic controllers is predicted based on the SVR model of grid search and 5-fold cross-validation.The experiment is implemented based on Python3.8.The optimal parameter combination and the mean relative error(MRE)are shown in Table 2.

Table 2 Optimal parameter combinations and errors of regional forecast from 2010 to 2019
The prediction results show that SVR has the best effect on the prediction of the number of air traffic controllers in northwest China.Forecasts for the number of air traffic controllers in the southwest are poor.There are two reasons:Firstly,the statistical error of the data of original flight support sorties or air traffic controllers in the region is large;Secondly,the error caused by the limitation of the prediction model itself.The outbreak of COVID-19 in 2020 had an impact on the aviation industry.In order to control the spread of the epidemic,many flights were delayed at the airport,and the demand for air traffic controllers was also affected.Therefore,it is necessary to adjust the air traffic controller demand according to the development trend of civil aviation industry under the influence of the epidemic.According to data released by the National Civil Aviation Work Conference and the National Civil Aviation Safety Work Conference in 2021,affected by the epidemic,civil aviation carried 420 million passengers in 2020,equivalent to 63.3% of that in 2019.In 2021,China’s civil aviation passenger traffic was expected to return to about 90% of that before the epidemic.It was estimated that by the end of 2022,China’s civil aviation passenger traffic would basically recover to the pre-epidemic level.According to the reduction of civil aviation in China,it was estimated that the number of air traffic controllers in 2020 would decrease by about 37%compared with that in 2019,increase by 43% in 2021 and 11% in 2022.So far,the number of air traffic controllers basically restores to the pre-epidemic level.The adjusted air traffic controller number demand predictions in 2021—2028 are shown in Table 3.

Table 3 Forecast of the number of regional air traffic controller demand in 2020—2028
At present,the overall age structure of controllers is stable,and the retirement of controllers has little impact on the turnover of front-line personnel in the next 5—10 years,which can be ignored.Therefore,the controller demand of the current year can be obtained by subtracting controller demand of the previous year,as shown in Table 4.
In order to further analyze the prediction accuracy of the regression model proposed in this paper,a comparison experiment of unary linear regression(ULR),time series forecasting and SVR is carried out.Based on the historical data of flight support sorties in seven regions from 2010 to 2019,different regression models are verified,as shown in Fig.2.

Table 4 Forecast of air traffic controller increment from 2022 to 2028

Fig.2 Comparison of three regression methods
According to Fig.2,the average relative error of ULR is 2.86%,the average relative error of time series prediction is 2.77%,and the average relative error of SVR is 1.73%.During the experiment,the calculation time of the SVR algorithm is shorter than that of the other two algorithms based on the same platform.The comparison results verify the effectiveness of the model in this paper.
Due to the impact of COVID-19,the number of national flight support sorties decreases,and the demand for controllers reaches a trough in 2020.After that,with the control of the epidemic,the civil aviation industry gradually recovers.In 2022,the demand for controllers will increase significantly and basically recover to the pre-epidemic level.In the post-epidemic era,the number of flight support sorties will increase steadily,and the demand for controllers will gradually increase from 2023 to 2028.With the dynamic change of civil aviation industry,the demand for controllers also may have a dynamic change.When the demand for controllers decreases,it is necessary to optimize the structure of the controller team,and reasonably adjust the working time and shift frequency to reduce the unnecessary waste of resources.When the demand for controllers gradually increases,human resources should be guaranteed to ensure that there are enough positions to introduce new controllers.
3.1.3 Ablation study
The above experiments determine the optimal parameter combination of Gaussian kernel function.The flight support sorties in each region in 2020—2028 are predicted by the grey prediction method.Then the predicted number of air traffic controllers in the corresponding year can be obtained by substituting it into the regression prediction model.In order to compare the effects of different kernel functions(Table 1)on the regression model and on the forecast of the number of controllers,based on the data of the flight support sorties and the number of controllers in seven regions from 2010 to 2019,the statistical average relative errors are shown in Table 5.

Table 5 Kernel function comparison test error %
According to the error judgment in the Table 5,the Gaussian kernel function is relatively stable in SVR,and the regression model training effect is good.When predicting the number of controllers,the average relative error is 4.86%,which is smaller than that of the other three kernel functions.In addition,the Gaussian kernel function seldom has the disaster of dimensionality while realizing the nonlinear mapping.Compared with the polynomial kernel function,the number of hyperplane parameters is less and the numerical calculation is reduced.Therefore,this paper finally chooses the Gaussian kernel function.
The main source of new controllers recruited by the Civil Aviation Administration of China(CAAC)is the students who major in control and the in-service personnel of control units who have been both systematically trained in universities with training qualifications.At present,there are three major universities in China with training qualifications and certain recruitment scale:NUAA,CAUC and CAFUC.According to the increase in demand for controllers predicted in this paper and the proportion of students from different universities working for CAAC,the scale of controller training in different universities can be predicted,so as to provide reference for the teaching and enrollment of universities.The training number of college controllers from 2022 to 2028 is shown in Table 6.

Table 6 Training number of air traffic controller from 2022 to 2028
According to the forecast results of the training scale of controllers,the number of controllers needed to be trained by the three universities will reach the peak of recent years in 2022,and the total number is expected to 923.After 2023,it will fall back to the basic level of pre-epidemic,and then increase steadily year by year.With the expansion of the scale of controller training in universities,the improvement of education resources allocation and the construction of high-level teachers also need to be speeded up to meet the requirements of controller training in the new situation.
SVR and parameter optimization algorithm are adopted to predict the future demand and the training scale of air traffic controllers in universities based on the national flight support data and the existing controller data.The main conclusions are as follows:
(1)Using grid search and 5-fold cross-validation can improve the performance of the optimal solution.The accuracy of prediction algorithm can be improved highly by using SVR that also can reduce the difficulty of algorithm design and the complexity of algorithm parameters.In the future,the SVR algorithm can be improved and combined with other prediction algorithms to achieve better prediction performance.
(2)The demand for controllers in the future will change in a U-shape.Affected by the epidemic,the number of national flight support sorties will be reduced.The demand for controllers will reach a trough in 2020 and recover from 2021.It is expected that it will return to the pre-epidemic level in 2022.After that,the demand for controllers will keep rising steadily.
(3)The training scale of controllers in universities will be gradually expanded.With the improvement of the epidemic situation and the recovery of civil aviation,more controllers will be needed.It is expected that the training scale of controllers will reach the peak in 2022 when the civil aviation industry fully recovers.Then it will fall back to the normal level and expand year by year.
Transactions of Nanjing University of Aeronautics and Astronautics2021年6期