999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Classification and Categorization of COVID-19 Outbreak in Pakistan

2021-12-10 11:57:34AmberAyoubKainaatMahboobAbdulRehmanJavedMuhammadRizwanThippaReddyGadekalluMustufaHaiderAbidiandMohammedAlkahtani
Computers Materials&Continua 2021年10期

Amber Ayoub,Kainaat Mahboob,Abdul Rehman Javed,Muhammad Rizwan,Thippa Reddy Gadekallu,Mustufa Haider Abidi and Mohammed Alkahtani

1Department of Computer Science,Kinnaird College for Women,Lahore,54000,Pakistan

2Department of Cyber Security,Air University,Islamabad,Pakistan

3School of Information Technology and Engineering,Vellore Institute of Technology,Tamil Nadu,India

4Raytheon Chair for Systems Engineering,Advanced Manufacturing Institute,King Saud University,Riyadh,11421,Saudi Arabia

5Industrial Engineering Department,College of Engineering,King Saud University,Riyadh,11421,Saudi Arabia

Abstract:Coronavirus is a potentially fatal disease that normally occurs in mammals and birds.Generally,in humans,the virus spreads through aerial droplets of any type of fluid secreted from the body of an infected person.Coronavirus is a family of viruses that is more lethal than other unpremeditated viruses.In December 2019,a new variant,i.e.,a novel coronavirus(COVID-19)developed in Wuhan province,China.Since January 23,2020,the number of infected individuals has increased rapidly,affecting the health and economies of many countries,including Pakistan.The objective of this research is to provide a system to classify and categorize the COVID-19 outbreak in Pakistan based on the data collected every day from different regions of Pakistan.This research also compares the performance of machine learning classifiers(i.e.,Decision Tree(DT),Naive Bayes(NB),Support Vector Machine,and Logistic Regression)on the COVID-19 dataset collected in Pakistan.According to the experimental results,DT and NB classifiers outperformed the other classifiers.In addition,the classified data is categorized by implementinga BayesianRegularization Artificial Neural Network(BRANN)classifier.The results demonstrate that the BRANN classifier outperforms state-of-the-art classifiers.

Keywords:COVID-19;pandemic;neural network;BRANN;machine learning

1 Introduction

The COVID-19 outbreak that appeared in Wuhan,China at the end of December 2019 was initially considered a pneumonia based on etiology.The virus soon spread worldwide at a rapid rate[1].On January 30,2020,the World Health Organization(WHO)declared the COVID-19 outbreak a Public Health Emergency of International Concern[2,3].This virus has affected people in more than 209 nations around the world.The overheads of the coronavirus outbreak are continually increasing.When this virus first started to spread,there were approximately 600 confirmed cases in China.Globally,the number of people who have died because of this virus has been increasing daily[4].The WHO determined that the most common symptoms of this virus are tiredness,fever,and dry cough[5].Most people with these symptoms can recover without extraordinary treatment or prescriptions.However,some patients have more severe symptoms,such as a runny nose,sore throat,nasal congestion,and general or severe pain.Typically,80%of people who became infected have severe symptoms[6].In the United Kingdom,the National Health Service(NHS)has reported cases with more severe side effects,including high fever and persistent cough.The NHS recommends that anybody with these sorts of symptoms should selfquarantine for 7 to 14 days[7].The infection spreads between individuals in close contact who are exposed to respiratory aerosol droplets that are emitted,primarily when an infected person coughs or sneezes,or shouts,sings,or talks.

For the most part,the droplets do not travel significant distances.Typically,they fall to the ground or onto immediate surfaces.Transmission may also occur through little droplets that can remain suspended in the air for longer periods of time[8].People may become infected by touching a contaminated surface and then touching their face[9].Outbreaks and rapid spread are highly expected,even before symptoms are noticeable,and from individuals who do not possess any symptoms of being infected by the virus,but they carry it[10].Fig.1 represents the worldwide spread of this coronavirus.It is believed that the virus did not spread in Pakistan the way it spread in other countries,like China,the USA,and Italy.Pakistan,with permeable borders,is sandwiched between two focal points of this coronavirus(China and Iran).

Figure 1:Worldwide spread of COVID-19

Recently,Pakistan has reinforced their precautions against COVID-19 by various strategies,such as detailing the use of national crisis readiness,compulsory thermal screenings at all entry points,observation of regional spread,contact tracing,and information assortment through various sources.Testing has been reinforced by bringing in Polymerase Chain Reaction units for SARS-COV-2 diagnostics[11].Assets have been deployed to setup quarantine centers in preparation of expected cases.Locations for these stations include a few urban areas,emergency clinics,and reconnaissance units that have been actuated to track the contacts of affirmed cases,as suggested by the WHO[10].The COVID-19 infection has spread to more than 213 nations,and as of April 17,2020,there were 1,995,983 confirmed cases and 131,037 deaths[12].

Pakistan revealed its initial two positive cases on February 26,2020.These cases were connected to travel to Iran[13].The number of positive cases across the nation rose to 7,025 on April 17th,2020:3,276 positive cases and 135 deaths in Punjab,2,008 cases in Sindh,993 cases in Khyber Pakhtunkhwa,303 cases in Balochistan,237 cases in Gilgit Baltistan,154 cases in Islamabad Capital Territory(ICT),and 46 cases in Azad Jammu Kashmir[14].

The number of positive cases is rising rapidly every day.In fact,in most countries,the number of cases is probably much higher than recorded,due to limited testing[14,15].Fig.2 shows the number of total coronavirus cases in Pakistan.The exponential increase in cases has driven the Government to force total and severe lockdowns in numerous urban areas[16].

Figure 2:COVID-19 cases in Pakistan

Fig.3 shows the total number of COVID-19 cases,the total number of deaths,and the total number of recovered cases in different regions of Pakistan.

Figure 3:Total recovered cases,deaths,and confirmed cases in Pakistan

1.1 Problem Statement

Deaths due to COVID-19 are increasing day by day in Pakistan.The nature of the COVID-19 outbreak differs in various countries.For example,in China,Iran,and France,COVID-19 outbreak is characterized by extremely high numbers and severe cases.The outbreak severity can be detected through an increase in the number of deaths.Thus,in this research,the nature of the outbreak is detected with the help of the COVID-19 dataset for the past few months in Pakistan collected by the Government.If the nature of the COVID-19 outbreak can be detected from the past months’ death rate,then with the help of standard operating procedure and precautionary measures,the death rate can be reduced in the coming months in Pakistan.For outbreak detection,the COVID-19 dataset is first classified with machine learning(ML)classifiers.Then the classified dataset is categorized into severe and normal COVID-19 outbreaks,using the Bayesian regularized artificial neural network(BRANN)classifier.

1.2 Motivation and Contribution

The COVID-19 death rate is high and is increasing day by day globally[17].This research is intended to classify and categorize the nature of the outbreak in Pakistan using machine learning classifiers.In this study,a dataset of COVID-19 patients from different regions(primarily populated regions)of Pakistan is preprocessed and then classified to understand the nature of the virus and its outbreak in Pakistan.Machine learning classifiers:Decision Tree(DT),Naive Bayes(NB),Support Vector Machine(SVM),and Logistic Regression(LR)are implemented,and results are compared based on performance measures(i.e.,accuracy,precision,and recall).The comparison of machine learning classifiers indicates that the DT and NB classifiers return 100% accuracy.Classified data is input to the BRANN to categorize the COVID-19 outbreak in Pakistan to determine if the nature of the outbreak will be normal or severe.

The remainder of this paper is organized as follows.Section 2 discusses the related work.Section 3 provides the proposed methodology to classify and categorize COVID patients.Section 4 provides the experimental analysis and results.Conclusions and suggestions for future work are presented in Section 5.

2 Literature Review

COVID-19 virus was initially discovered in December 2019 in the population of Wuhan,China.Later,it spread to other regions of China and other parts of the world[18].Various papers and studies have applied different techniques on COVID-19 datasets.In this section,several studies that investigate the application of machine learning algorithms on different diseases are discussed.

SVM and Mutual Information techniques have been applied to classify genes[19].In that study,the authors claimed that the SVM classifier achieved the best mean accuracy rate.In addition,the fuzzy KNN approach has been used on a Parkinson’s dataset to help generate a diagnostic system that will make better clinical diagnostic decisions[20].Here,researchers utilized different machine learning techniques to propose a novel method.They computed significant features by implementing machine learning techniques to improve the accuracy rate of predicting cardiovascular disease.Their prediction model gives 88.7% accuracy[21].In 2015,a combination of SVM and fuzzy logic was applied for the risk classification of diabetes.Fuzzy reasoning was used to predict the risk factors of(Type-II)diabetes,and an SVM was used to generate fuzzy rules from the Pima diabetes dataset[22].

Other researchers used the NB classifier to improve the accuracy of predicting heart disease[23].Different machine learning techniques,such as Artificial Neural Network(ANN),random forest(RF),and K-means clustering techniques were implemented to predict diabetes.The ANN technique provided the best accuracy rate(75.7%)in the prediction of diabetes[24].Some researchers also implemented machine learning techniques to predict hypertension outcomes based on medical data.In that study,the researchers evaluated four classifiers,i.e.,SVM,DT,RF,and XGBoost,to meet the desired accuracy level of the prediction system.XGBoost produced the best results among the four classifiers and provided a system accuracy of 94.36%.[25,26].

Other researchers used histopathological data patients who had a lung lobectomy to treat adenocarcinoma.For both “accidental” models,adjacent to malignancies,the lungs show edema and fundamental proteinaceous exudates as huge protein globules[27].The researchers documented vascular joins with blazing gatherings of fibrinoid content,multinucleated goliath cells,and pneumocyte hyperplasia.In addition,some researchers used the ANFIS model to estimate landslide susceptibility and to develop a model to predict landslides.The ANFIS model was used to train and validate the dataset[28].Different ML classifiers have been used to develop predictive models[29,30].In 2017,researchers proposed an SVM and fuzzy logic-based system automatically block pornographic content on the web.SVMs have also been used in statistical learning approaches to classify hypothesis test data and compute the error rate using the Gaussian-density function[31,32].

3 Proposed Methodology

Machine learning classifiers,DT,NB,LR,and SMV,are used to classify and categorize the COVID-19 outbreak in different regions of Pakistan.The proposed system is shown in Fig.4.

Figure 4:Proposed system for COVID-19 data classification and prediction

3.1 Dataset

The “Corona-Virus Pakistan Dataset 2020” was downloaded from Kaggle[33].The dataset contains 13 features that represent the lab tests of suspected,confirmed,and fatal COVID-19 cases per day in the most populated regions of Pakistan(Tab.1).The dataset features are listed in Tab.2.The dataset has 315 rows and 13 columns,i.e.,11089 data items.The dataset was checked for null and missing values of categorical features;none were found.The data distribution of categorical features,such as Date and Province,are shown in Fig.5.

Table 1:Selected regions of Pakistan in dataset

Table 2:Features of COVID-19 dataset

Figure 5:Data distribution of categorical features

3.2 Dataset Preprocessing

Preprocessing is necessary to avoid misclassified results and errors[34,35].Data preprocessing involved data preparation,data exploration,data distribution,and replacing categorical features.Preprocessing resulted in a clean dataset suitable for classification.This preprocessed dataset is fed to the machine learning classifiers to produce classified results[36].

4 Experimental Analysis and Results

For the classification of the dataset,Google Colab was used for python coding,and dataset categorization was implemented through MATLAB.The dataset was split into training(70%)and testing(30%)sets.The metrics used in this work are as follows.

4.1 Decision Tree Classifier

The COVID-19 dataset was classified using the DT ID3 classifier.The results are shown in Tab.3.As can be seen,this classifier achieved 100% accuracy,precision,and recall.The confusion matrix for the DT classifier is plotted in Fig.6a.

Table 3:Results achieved for decision tree classifier

4.2 Naive Bayes Classifier

The NB Classifier is implemented on the COVID-19 dataset because it is a continuous dataset.The NB classifier also achieved 100% accuracy(Tab.4).The confusion matrix for this classifier is shown in Fig.6b.

4.3 Logistic Regression Classifier

The LR classifier has been used successfully to predict various diseases[37,38].The testing data is predicted for the first 25 entries.The histogram of the predictions is shown in Fig.7.Figs.8a and 8b depict the confusion matrices for LR and SVM classifiers respectively.The Receiver Operating Characteristics(ROC)plot for the COVID19 dataset,based on true positive rate and false positive rate,is shown in Fig.9a.The LR ROC curve covers 91% of the area.The results obtained for LR are listed in Tab.5.

Figure 6:Confusion matrices for both classifiers(a)Decision tree classifier(b)Naive Bayesian classifier

Table 4:Results achieved for Naive Bayesian classifier

Figure 7:Histogram of predicted probabilities

4.4 Support Vector Machine Classifier

The linear SVM classifier achieved precision of 98%.The ROC curve for multiclass SVM is depicted in Fig.9b.It shows that the ROC curve for class-1 covers 100% of the area,while class-2 covers 88% of the area.Tab.6 lists the SVM results using formulas(1–3).

Figure 8:Confusion matrices for(a)LR and(b)SVM classifiers

Figure 9:ROC Curve for(a)LR and(b)SVM classifiers

Table 5:Results achieved for logistic regression classifier

DT and NB classifiers yielded 100% accuracy for this dataset.Tab.7 shows the results for the DT,NB,LR,and SVM classifiers.The classified dataset is input to an ANN(Section 4.5)for data categorization.

Table 6:Results achieved for SVM classifier

Table 7:Comparison of classification results

4.5 Artificial Neural Network

In the Artificial Neural Network training classifier,Bayesian regularization is used to categorize the search space into two classes:normal outbreak and severe outbreak.This classifier is used to categorize the nature of the COVID-19 outbreak in Pakistan based on data collected from various regions.Fig.10 shows the COVID-19 dataset simulation architecture.

Figure 10:COVID-19 dataset simulation architecture

Algorithm 1:Algorithm for Classification 1. Provide the Input Parameters 2. Data Preprocessing 3. Checking of Conditional Probability 4. While(error-ratethreshold-value 7.Back Propagation 8.Weight setting 9.End If 10. End While 11. Neural Network’s Bayesian regularization 12. Classification Results 13. Classification of the Outbreak Nature

The output is labeled 0 or and 1,where 0 represents a normal outbreak and 1 represents a severe outbreak.The output is labeled based on input parameter values.Tab.8 shows the classified,important ranking features of the dataset as inputs selected for the neural network.The Error Histogram and Regression values are given in Tab.9.

Table 8:Selected inputs of COVID-19 dataset

In Tab.9,from the 852 dataset entries,596 instances are selected for training,128 are selected for validation,and 128 are selected for testing.Furthermore,50 hidden neurons with one epoch are used for the neural network.The confusion matrix results demonstrated that the actual class predicts the predicted class with 99.88% accuracy.This indicates that the BRANN classifier predicts the results accurately for this dataset.Tab.9 shows that the BRANN classifier correctly categorized 128 data items for the validation and testing process.

Table 9:Bayesian regularization results

Figure 11:Error histogram of Bayesian regularization ANN algorithm

Figure 12:Bayesian regularization regression plot

From Fig.11,it is evident BRANN has 0 errors.This indicates that the neural network fits the data perfectly.Fig.12 shows how accurately a neural network determines the function for regression to analyze the dataset.The actual network details are shown in comparison with the target output.How accurately a model fits the data is represented through this colored line shown in the Fig.12.This line should closely intersect the real output from the left to the right corner of the regression plot.The above figure shows that the COVID-19 dataset closely fits in the BRANN model.

Fig.13 shows the training state of the BRANN(gradient,mu,parameters,the sum of squared parameters,and validation checks).They all achieve 1000 epochs,which indicates the good performance of the dataset.Fig.14 represents the mean square error of the BRANN.The blue and red training lines represents the testing mean square,and the dotted line represents the 1000 epochs.The figure listed below shows the best training performance of the BRANN.

Figure 13:Training state of BRANN

Figure 14:Mean square error of neural network BRANN

Fig.15 is the confusion matrix of the BRANN classifier.The BRANN classifier gives 99.88%accuracy for training,testing,and validation of the classifier on the COVID dataset for Pakistan.The outcome of the dataset is divided into two classes 0 and 1,where 0 denotes that the outbreak is normal,and 1 represents that the outbreak is severe.Five potential features are selected as input according to their importance that is classified through ML classifiers.

Figure 15:BRANN confusion matrix

The COVID-19 dataset for Pakistan is classified through machine learning techniques,and their accuracy results are compared.The results show that the NB classifier gives 100% accuracy for this dataset.Therefore,the BRANN best fits the dataset and categorizes the dataset into a normal class and severe class for the COVID-19 outbreak in Pakistan.

5 Conclusion

The proposed system categorizes the COVID-19 outbreak in Pakistan based on a dataset collected in different regions of Pakistan.Machine learning classifiers play a vital role in the classification,categorization,and prediction of dangerous diseases such as COVID-19.With the help of various machine learning techniques,the loss from COVID19 can be minimized in the upcoming months in Pakistan.First,we classified the COVOD-19 dataset using different machine learning classifiers.Then,the BRANN classifier was used to categorize the nature of outbreak as normal or severe.The experiments show that the BRANN provides a best fit regression plot with minimal error rate.In future,the proposed model can be further tested on a larger dataset[39,40]to test its scalability.

Funding Statement:The authors are grateful to the Raytheon Chair for Systems Engineering for funding.

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.

主站蜘蛛池模板: 亚洲无码不卡网| 国产一级毛片高清完整视频版| 欧美综合区自拍亚洲综合天堂 | a色毛片免费视频| 亚洲欧美日韩动漫| 亚洲欧洲日本在线| 亚洲精品天堂在线观看| 成人久久精品一区二区三区| 久久综合九九亚洲一区| 精品国产成人国产在线| 欧美在线天堂| 成人在线不卡视频| 国产成人精彩在线视频50| 色网站免费在线观看| 久久人人97超碰人人澡爱香蕉| 91麻豆国产在线| 亚洲性视频网站| 成人在线第一页| 亚洲精品亚洲人成在线| 精品欧美一区二区三区久久久| 一级黄色欧美| 久久99国产乱子伦精品免| 久久一本精品久久久ー99| 亚洲第一视频网站| 欧美人与牲动交a欧美精品| 国产精品流白浆在线观看| 性欧美在线| 色婷婷在线播放| 永久在线精品免费视频观看| 久久99热这里只有精品免费看| 日本免费高清一区| 亚洲首页在线观看| 国产极品美女在线| 欧美日韩第二页| 日韩av无码精品专区| 色窝窝免费一区二区三区| 中字无码精油按摩中出视频| 亚洲高清在线播放| 亚洲性一区| 国产精品片在线观看手机版| 日本在线欧美在线| 九九视频免费在线观看| 亚洲第一区欧美国产综合| 亚洲人成网18禁| 一边摸一边做爽的视频17国产| 久热中文字幕在线| 狠狠五月天中文字幕| 亚洲成人网在线观看| 亚洲婷婷在线视频| 欧美成人午夜影院| 中文字幕第1页在线播| 欧美www在线观看| www.亚洲一区二区三区| 日韩最新中文字幕| 亚洲成aⅴ人片在线影院八| 尤物午夜福利视频| 日韩不卡免费视频| 91区国产福利在线观看午夜| 毛片基地美国正在播放亚洲 | 国产极品美女在线播放| 亚欧美国产综合| 日本久久网站| 亚洲中文精品人人永久免费| 色吊丝av中文字幕| 中文字幕在线不卡视频| 白浆视频在线观看| 国产精品视频a| 男女性色大片免费网站| 女人天堂av免费| 亚洲欧美日韩天堂| 99免费视频观看| 亚洲AV无码不卡无码| 人妻丝袜无码视频| 午夜精品久久久久久久99热下载 | 亚洲欧洲日本在线| 日韩高清中文字幕| 福利国产微拍广场一区视频在线| 18禁不卡免费网站| 亚洲精品无码久久毛片波多野吉| 欧美三级自拍| 国产大全韩国亚洲一区二区三区| 91综合色区亚洲熟妇p|