Open Access Open Access  Restricted Access Subscription Access

Hybrid Feature Selection Framework for Identification of Alzheimer’s Biomarkers


Affiliations
1 College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, Al Kharj, Saudi Arabia
2 Department of Computer Sciences and Engineering, Sethu Institute of Technology, Kariapatti, Virudhunagar – 626115, Tamil Nadu, India
 

Objectives: Alzheimer's Disease (AD) is a chronic disease that eventually leads to death. Early diagnosis is expected to improve the patient survival. Evidence from literature indicates that best combination of biomarkers can help in early diagnosis. Feature selection techniques are proven as a workhorse in biomarker selection. This work aims to contribute a new hybrid feature selection framework based on ensemble learning for biomarker identification with an objective to improve the robustness of the final set of selected biomarkers. Methods and Analysis: The proposed framework employs Significance Analysis of Microarray (SAM) filter to initially select most relevant biomarkers. Then, the selected biomarkers are reselected in wrapper phase using heterogeneous voting based ensemble of classifiers to enhance the robustness of the final set of selected biomarkers. An extensive experiment was conducted to demonstrate the effectiveness of the proposed framework against the individual base learner in term of sensitivity and specificity. Finally, the statistical significance of the selected biomarkers was also verified by ANOVA test. Findings: The reported experimental results demonstrated improvement in AD diagnosis accuracy and proved the potential of the proposed framework in selecting the stable set of biomarkers for early AD diagnosis alternative to its base learners. The diagnosis accuracy level obtained by the identified set of biomarkers is over 87%. The higher area under ROC curve revealed the advantage of the identified biomarkers in discriminating AD patients from healthy control diagnosis with sensitivity and specificity of 93%. The p-value of ANOVA test results confirmed the significance of the identified biomarkers in early AD diagnosis. Overall, the identified robust set of biomarkers is expected to open a new pathway for early AD diagnosis. Improvements: Although clinical studies are needed to conform the findings, in the light of the results reported in this paper, it is evident that the proposed framework stands out to find the significant biomarkers for early AD diagnosis.
User

  • Shui B, Tao D, Florea A, Cheng J, Zhao Q, Gu Y, Li W, Jaffrezic-Renault N, Mei Y, Guo Z. Biosensors for Alzheimer's Disease biomarker detection: A review. Biochimie. 2018; 147:13–24. PMid: 29307704. Crossref. Crossref.
  • Long J, Pan G, Ifeachor E, Belshaw R, Li X. Discovery of novel biomarkers for Alzheimer's Disease from blood. Disease Markers. 2016; 2016:1–9.
  • Blennow K, Dubois B, Fagan AM, Lewczuk P, de Leon MJ, Hampel H. Clinical utility of cerebrospinal fluid biomarkers in the diagnosis of early Alzheimer's Disease. Alzheimer's and Dementia: The Journal of the Alzheimer's Association. 2015; 11(1):58–69.
  • Blennow K, Zetterberg H. The application of cerebrospinal fluid biomarkers in early diagnosis of Alzheimer Disease. Medical Clinics. 2013; 97(3):369–76. Crossref.
  • Frisoni GB, Boccardi M, Barkhof F, Blennow K, Cappa S, Chiotis K, Demonet JF, Garibotto V, Giannakopoulos P, Gietl A, Hansson O. Strategic roadmap for an early diagnosis of Alzheimer's Disease based on biomarkers. The Lancet Neurology. 2017; 16(8):661–76. Crossref.
  • He Z, Yu W. Stable feature selection for biomarker discovery. Computational Biology and Chemistry. 2010; 34(4):215– 25. PMid: 20702140. Crossref.
  • Zhou W, Dickerson JA. A novel class dependent feature selection method for cancer biomarker discovery. Computers in Biology and Medicine. 2014; 47:66–75. PMid: 24561345. Crossref.
  • Hsu HH, Hsieh CW, Lu MD. Hybrid feature selection by combining filters and wrappers. Expert Systems with Applications. 2011; 38(7):8144–50. Crossref.
  • Liu HC, Peng PC, Hsieh TC, Yeh TC, Lin CJ, Chen CY, Hou JY, Shih LY, Liang DC. Comparison of feature selection methods for cross-laboratory microarray analysis. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2013; 10(3):593–604. PMid: 24091394. Crossref.
  • Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proceedings of the National Academy of Sciences. 2001; 98(9):5116–21. PMid: 11309499 PMCid: PMC33173. Crossref.
  • Abeel T, Helleputte T, Van de Peer Y, Dupont P, Saeys Y. Robust biomarker identification for cancer diagnosis with ensemble feature selection methods. Bioinformatics. 2009 Nov 25; 26(3):392–8. PMid: 19942583. Crossref.
  • Pes B, Dessì N, Angioni M. Exploiting the ensemble paradigm for stable feature selection: A case study on highdimensional genomic data. Information Fusion. 2017; 35:132–47. Crossref.
  • Pradhan B, Jebur MN. Spatial prediction of landslide-prone areas through K-Nearest Neighbor algorithm and logistic regression model using high resolution airborne laser scanning data. Laser Scanning Applications in Landslide Assessment; 2017. p. 151–65. Crossref.
  • Son H, Kim C, Hwang N, Kim C, Kang Y. Classification of major construction materials in construction environments using ensemble classifiers. Advanced Engineering Informatics. 2014; 28(1):1–10. Crossref.
  • Oreski G, Oreski S. An experimental comparison of classification algorithm performances for highly imbalanced datasets. 25th Central European Conference on Information and Intelligent Systems; Varazdin, Croatia. 2014. p. 1–4.
  • Albukhanajer WA, Jin Y, Briffa JA. Classifier ensembles for image identification using multi-objective Pareto features. Neurocomputing. 2017; 238:316–27. Crossref.
  • Craig-Schapiro R, Kuhn M, Xiong C, Pickering EH, Liu J, Misko TP, Perrin RJ, Bales KR, Soares H, Fagan AM, Holtzman DM. Multiplexed immunoassay panel identifies novel CSF biomarkers for Alzheimer's Disease diagnosis and prognosis. PloS one. 2011; 6(4):18850. PMid: 21526197 PMCid: PMC3079734. Crossref.
  • Fergus P, Selvaraj M, Chalmers C. Machine learning ensemble modeling to classify caesarean section and vaginal delivery types using Cardiotocography traces. Computers in Biology and Medicine. 2018; 93:7–16. PMid: 29248699. Crossref.
  • Singh BK, Verma K, Thoke AS, Suri JS. Risk stratification of 2D ultrasound-based breast lesions using hybrid feature selection in machine learning paradigm. Measurement. 2017; 105:146–57. Crossref.
  • Ko J, Baldassano SN, Loh PL, Kording K, Litt B, Issadore D. Machine learning to detect signatures of disease in liquid biopsies- a user's guide. Lab on a Chip. 2018; 18(3):395– 405. PMid: 29192299. Crossref.
  • Baesens B, Van Gestel T, Viaene S, Stepanova M, Suykens J, Vanthienen J. Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society. 2003; 54(6):627–35. Crossref.

Abstract Views: 224

PDF Views: 0




  • Hybrid Feature Selection Framework for Identification of Alzheimer’s Biomarkers

Abstract Views: 224  |  PDF Views: 0

Authors

V. Thavavel
College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, Al Kharj, Saudi Arabia
M. Karthiyayini
Department of Computer Sciences and Engineering, Sethu Institute of Technology, Kariapatti, Virudhunagar – 626115, Tamil Nadu, India

Abstract


Objectives: Alzheimer's Disease (AD) is a chronic disease that eventually leads to death. Early diagnosis is expected to improve the patient survival. Evidence from literature indicates that best combination of biomarkers can help in early diagnosis. Feature selection techniques are proven as a workhorse in biomarker selection. This work aims to contribute a new hybrid feature selection framework based on ensemble learning for biomarker identification with an objective to improve the robustness of the final set of selected biomarkers. Methods and Analysis: The proposed framework employs Significance Analysis of Microarray (SAM) filter to initially select most relevant biomarkers. Then, the selected biomarkers are reselected in wrapper phase using heterogeneous voting based ensemble of classifiers to enhance the robustness of the final set of selected biomarkers. An extensive experiment was conducted to demonstrate the effectiveness of the proposed framework against the individual base learner in term of sensitivity and specificity. Finally, the statistical significance of the selected biomarkers was also verified by ANOVA test. Findings: The reported experimental results demonstrated improvement in AD diagnosis accuracy and proved the potential of the proposed framework in selecting the stable set of biomarkers for early AD diagnosis alternative to its base learners. The diagnosis accuracy level obtained by the identified set of biomarkers is over 87%. The higher area under ROC curve revealed the advantage of the identified biomarkers in discriminating AD patients from healthy control diagnosis with sensitivity and specificity of 93%. The p-value of ANOVA test results confirmed the significance of the identified biomarkers in early AD diagnosis. Overall, the identified robust set of biomarkers is expected to open a new pathway for early AD diagnosis. Improvements: Although clinical studies are needed to conform the findings, in the light of the results reported in this paper, it is evident that the proposed framework stands out to find the significant biomarkers for early AD diagnosis.

References





DOI: https://doi.org/10.17485/ijst%2F2018%2Fv11i22%2F123310