999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Contribution of Ambient Air Pollution on Risk Assessment of Type 2 Diabetes Mellitus via Explainable Machine Learning*

2023-07-13 02:11:38DINGZhongAoZHANGLiYingLIRuiYingNIUMiaoMiaoZHAOBoDONGXiaoKangLIUXiaoTianHOUJianMAOZhenXingandWANGChongJian
Biomedical and Environmental Sciences 2023年6期

DING Zhong Ao , ZHANG Li Ying , LI Rui Ying , NIU Miao Miao , ZHAO Bo , DONG Xiao Kang ,LIU Xiao Tian, HOU Jian, MAO Zhen Xing, and WANG Chong Jian,4,#

1.Department of Epidemiology and Biostatistics, College of Public Health, Zhengzhou University, Zhengzhou 450001,Henan, China; 2.Department of Software Engineering, School of Computer and Artificial Intelligence, Zhengzhou University,Zhengzhou 450001, Henan, China; 3.Department of Statistics, University of Illinois at Urbana-Champaign, Champaign,U.S.A; 4.NHC Key Laboratory of Prevention and Treatment of Cerebrovascular Diseases, Zhengzhou 450001, Henan, China

Type 2 diabetes mellitus (T2DM) is recognized as a heterogeneous and complicated disease that is able to influence individuals at various life stages[1].Apart from traditional predictors such as age, family history of diabetes, body mass index, and so on,ambient air pollution is also shown to increase the risk of T2DM in previous studies.However, previous T2DM risk assessment models barely included air pollution features as the predictors.Machine learning algorithms are widely used for disease prediction model construction, and demonstrate superior discrimination abilities and greater effectiveness than statistical methods[2].However,the principle of “black box” in machine learning greatly hindered the interpretability of the model,especially for medical decisions[3].The SHapely additive exPlanations (SHAP) based on the game theory was proposed by Lundberg et.al to develop the explainable machine learning, and the SHAP methods were able to display the feature contributions as well as interaction effects in the model[4,5].This study aims to reveal the contribution of air pollutants exposure in the T2DM risk assessment model as well as air pollutants’ effects on traditional predictorsviaSHAP.

Participants in this study were derived from the Henan Rural Cohort.A detailed description of this cohort study was posted previously[6]and the brief introduction was provided in the supplementary material.A total of 38,258 individuals were finally included in this analysis, and the flow chart of the data processing procedure is shown in Supplementary Figure S1 (available in www.besjournal.com).The air pollutants exposure of an individual was evaluated by a 3-year annual mean concentration of 4 ambient air pollutants, listed as the nitrogen dioxide (NO2) and particulate matter with an aerodynamic diameter ≤ 1.0 μm,≤ 2.5 μm, ≤10.0 μm (PM1, PM2.5, PM10)[7].The definitions of T2DM are listed as follows: (1) FBG ≥ 7.0 mmol/L; (2)T2DM patient diagnosed by doctors previously and used anti-glycemic drugs or insulin in the past two weeks.A detailed description of the exposure,outcome and covariates assessment methods were placed in the supplementary material.

In this study, we determined the 20 traditional variables and the air pollutants exposure-related variable as the candidate variables[2].After variable selection, the Gradient Boosting Machine (GBM) was applied to model construction with selected variables in the analysis.To explain the effect of air pollutants in T2DM risk assessment models, SHAP was employed to show the contribution of predictors as an additive feature attribution method.A detailed description of the model development was provided in the supplementary material.

In order to calculate the mixture of air pollutants exposure, the quantile g-computation was employed in this analysis.The calculating equation of this method is shown below; detailed description of the formulas was placed in the supplemental material.

When describing the characteristics of predictors, numbers (frequencies) were used for categorical variables and mean ± Standard Deviation was used for continuous variables.The chi-square test (or Fisher’s exact test) was used for comparisons between categorical variables, whereas thet-test was used for continuous variables.The area under the curve (AUC) of the receiver operating characteristic curve (ROC) was used to evaluate the discriminative performance and the brier score (BS)was employed for calibration evaluation.For the comparison of AUCs, DeLong test was used.It was considered statistically significant when a doubletailedPvalue was less than 0.05.Statistical tests were performed using R 3.6.2 and SPSS 21.0 (IBM,Chicago, USA).

A total of 38,258 individuals were included in the analysis, and 3,564 T2DM patients were found in the overall study.Compared with the individuals with non-T2DM, those with T2DM tended to be older,fatter, and their heart rate as well as pulse pressure were higher than healthy individuals (P< 0.05).Detailed characteristics are shown in Supplementary Table S1 and Supplementary Table S2 (available in www.besjournal.com).Coefficients of the quantile gcomputation are shown in Supplementary Table S3(available in www.besjournal.com).After adjusting for covariates, there existed an association of air pollutants mixture with T2DM risk (odds ratio,OR1.22, 95%CI1.16–1.27).After stratifying the QGS by the tertiles, the subgroups all indicated this association in this analysis [OR1.30 (1.18, 1.43), 1.44(1.31, 1.59),P< 0.001], suggesting that higher exposure of air pollutants increased the prevalence risk of T2DM.The detailed information is shown in Table 1.The Principal Component Analysis and the air pollution score also indicated the tendency, and detailed information could be found in Supplementary Table S4 (available in www.besjournal.com).Although previous research confirmed the effects of long-term exposure to ambient air pollution on T2DM, the association of a mixture of air pollutants with T2DM prevalence was still unknown.Consistent with the results of previous studies[8], we employed three mixing approaches to validate that higher air pollutants exposure increased the risk of T2DM in this analysis.

Table 1. Associations (ORs and 95% CI) of the mixture of ambient air pollutants with T2DM

After the univariate logistic regression and collinearity diagnosis, nine variables (age, gender,family history of diabetes, more vegetable and fruit intake, physical activity, body mass index, waist-tohip ratio, pulse pressure, and heart rate) were finally chosen as traditional predictors.The GBM model contained air pollutants exposure got good discrimination (AUC 0.787) and acceptable calibration (brier score, BS 0.076), better than the traditional model (AUC 0.764, BS 0.079).The detailed information can be found in Table 2 and Supplementary Table S5 (available in www.besjournal.com).The results showed that air pollution posted as a hazardous factor for T2DM,while ambient air pollution can also improve the prediction performance of traditional models to some contents.

Table 2. Comparison of the performance metrics with and without air pollutants

The output of SHAP supplied an approach to explain the complex relationships in the GBM model.In Supplementary Figure S2 (available in www.besjournal.com), waist-to-hip ratio (WHR)ranked first in the SHAP value ranking (SHAP mean value 0.509).However, when adding air pollutants variable into the model, the air pollutants exposure ranked fifth (SHAP mean value 0.238),simultaneously altering the order of traditional predictors in Supplementary Figure S3, (available in www.besjournal.com).Additionally, the summary plot is chosen to indicate the effect direction between predictors and T2DM (Figure 1).Air pollutants exposure performed well in the plot with a long right tail, which indicated that a high concentration of ambient air pollution led to an increased prevalence risk of T2DM.Additionally, the asymmetric distribution of effect magnitudes that air pollutants exposure had on T2DM predicted cases demonstrated non-linear associations between air pollutants exposure and the risk of T2DM[9].The SHAP summary plot exceedingly provided vital evidence on the hazardous effect of air pollution,which was consistent with previous statistical analysis[8].SHAP proposed a rich visualization of feature contributions based on individuals, which indicated that air pollution elevated the risk of T2DM in an intricate way along with other features.The interaction plot was also employed to present the complex effects in the model.An interesting interaction effect can be found between age and air pollutants.In Supplementary Figure S4 (available in www.besjournal.com), a step-by-step increasing tendency was shown in individuals aging from 40 years to 60 years.However, when considering air pollutants exposure of different ages, elder individuals (age > 60) with higher air pollutants exposure seemed to be more dangerous, while younger individuals (age < 40) with higher air pollutants exposure had lower SHAP values (shown in Supplementary Figure S4).The participants aged 27–30 years drag down the SHAP value for nearly 0.2–0.3 points.Similar interaction effects were also observed in other variables (Supplementary Figure S5 and Supplementary Figure S6, available in www.besjournal.com).Wang et al.also employed the deep learning neural networks with SHAP to explain prediction for mental disorders[10].Consistent with that, the results of SHAP analysis visualized the complex interaction effects.

Figure 1.Feature importance ranking of 9 variables in the model.This summary plot illustrated the entire distribution of impacts each feature has on the model output.WHR,waist-to-hip ratio.

Previous studies have indicated the hazardous effect of air pollutants.However, no research had explored the role of air pollution in T2DM risk assessment to our best knowledge.Moreover,although SHAP with machine learning models was already applied to the air pollution research, the impacts of air pollution on T2DM were still unclear.To our knowledge, this is the first study that focuses on the effects of ambient air pollutants on T2DM resorting to SHAP.The GBM algorithm also accounts for the non-linear interactions which cannot be adequately modeled in statistical models, and the SHAP richly visualizes the interactions and feature contributions.However, limitations also exist in this study.We conducted this analysis in a crosssectional study with no follow-up data.Moreover,the biological mechanism needs to be further investigated.Future studies can focus on the etiology pathway of air pollutants-caused T2DM.

In summary, the consideration of personal air pollution exposure elevated the identification performance of T2DM cases in the T2DM risk assessment model.Additionally, the explainable machine learning method (SHAP) also reveals the contributing effects of mixture of ambient air pollution as well as its interaction effects with tradition predictors such as age.The study demonstrates the significance of considering environmental pollution exposure as the risk factor,which facilitates the prevention and management of T2DM.The human health is influenced by the interaction between the environment and the individual’s condition, and it is therefore significant to further investigate the contribution of incorporating the personal environmental exposures in the risk assessment models which for the primary care physicians' ability to assess the risk of developing chronic diseases.

No potential conflicts of interest were disclosed.

The authors thank all of the participants,coordinators, and administrators for their support and help during the research.

DING Zhong Ao took part in the investigation,methodology and writing of the original draft.ZHANG Li Ying took part in the investigation, data curation,formal analysis and writing of the code.LI Rui Ying, NIU Miao Miao, ZHAO Bo, DONG Xiao Kang, LIU Xiao Tian,HOU Jian and MAO Zhen Xing reviewed the manuscript.WANG Chong Jian took part in the conceptualization, methodology, investigation,validation, supervision, funding acquisition, project administration and review of the manuscript.

&These authors contributed equally to this work.

#Correspondence should be addressed to WANG Chong Jian, E-mail: tjwcj2008@zzu.edu.cn Tel: 86-371-67781452.

Biographical notes of the first authors: DING Zhong Ao, male, born in 1999, Postgraduate, majoring in epidemiology and biostatistics; ZHANG Li Ying, female,born in 1988, PhD, Lecturer, majoring in machine learning and medical data mining.

Received: November 3, 2022;Accepted: April 6, 2023


登錄APP查看全文

主站蜘蛛池模板: 久久精品中文无码资源站| 国产Av无码精品色午夜| 国产精品七七在线播放| 午夜精品福利影院| 99热线精品大全在线观看| 日韩人妻无码制服丝袜视频| 天天躁狠狠躁| 99草精品视频| 欧美成人a∨视频免费观看| 久久五月视频| 热伊人99re久久精品最新地| h网址在线观看| 亚洲精品第五页| 国产日本欧美亚洲精品视| 国产乱肥老妇精品视频| 重口调教一区二区视频| 国产亚洲精品无码专| 亚洲国产天堂久久综合| 欧美日韩第二页| 国产精品成人第一区| 成人日韩精品| 欧洲高清无码在线| 国产午夜精品一区二区三区软件| 国产色偷丝袜婷婷无码麻豆制服| 亚洲天堂在线视频| 亚洲精品成人福利在线电影| 狠狠ⅴ日韩v欧美v天堂| 亚洲嫩模喷白浆| 91香蕉国产亚洲一二三区| 精品一区国产精品| 国产在线专区| 国产黄色免费看| 亚洲bt欧美bt精品| 精品国产网站| 思思99思思久久最新精品| 亚洲三级a| 毛片免费在线| 波多野结衣中文字幕一区二区| 青草视频在线观看国产| 操国产美女| 精品国产91爱| 国产三级视频网站| 亚洲精品日产精品乱码不卡| 亚洲福利视频一区二区| 色婷婷在线播放| 久久黄色一级视频| 2022国产无码在线| 欧美精品成人| 国产高颜值露脸在线观看| 广东一级毛片| 综1合AV在线播放| 国产网友愉拍精品视频| 福利国产在线| 久久中文无码精品| 国产精品网曝门免费视频| a级毛片毛片免费观看久潮| 久久96热在精品国产高清| 国产精品开放后亚洲| 日本精品视频| 五月天丁香婷婷综合久久| 国产成人AV综合久久| 三级毛片在线播放| 亚洲第一区欧美国产综合| 嫩草在线视频| 在线免费无码视频| 制服丝袜国产精品| 一级毛片在线播放| 欧美A级V片在线观看| 少妇精品网站| 欧美专区在线观看| 久久国产精品国产自线拍| 麻豆精品在线| 国产制服丝袜91在线| 狠狠亚洲五月天| 日本尹人综合香蕉在线观看| 青青热久免费精品视频6| 亚洲视频黄| www.youjizz.com久久| 国产在线视频欧美亚综合| 久久免费观看视频| 亚洲国产精品VA在线看黑人| 国产va在线观看免费|