The Proceedings of the Conference on Information Systems Applied Research 2008: §2325    Home    Papers/Indices    prev (§2324)    Next (§2332)
Fri, Nov 7, 11:30 - 11:55, Pueblo B     Paper (refereed)
Recommended Citation: McClelland, M K.  Knowledge Discovery and Data Mining of Biomedical Data with Decision Trees.  In The Proceedings of the Conference on Information Systems Applied Research 2008, v 1 (Phoenix): §2325. ISSN: 0000-0000. (A later version appears in Journal of Information Systems Applied Research 3(2). ISSN: 1946-1836.)
CDpic

Knowledge Discovery and Data Mining of Biomedical Data with Decision Trees

thumb
Refereed19 pages
Marilyn K. McClelland    [a1] [a2]
School of Business
North Carolina Central University    [u1] [u2]
Durham, North Carolina, USA    [c1] [c2]

Traditional knowledge discovery processes are applied in a biomedical population study to develop a data warehouse of diverse clinical, phenotypic, psychosocial, and genetic data associated with hypertension. Experiences of an informationist as a member of a biomedical research team are shared. Issues encountered with missing data and data analysis is discussed. The use of decision trees which can accommodate missing data is explored. SAS Enterprise Miner 4.2 is used to develop classification and predictive decision trees for hypertension in African Americans. Preliminary knowledge discovery through the use of decision trees suggests psychosocial components such as anger as well as traditional metabolic syndrome components such as waist size are important factors in hypertension of healthy, community dwelling, African Americans. Advantages and limitations of decision trees are discussed. More broadly, the biomedical research team continues to benefit from the knowledge management infrastructure implemented as part of the knowledge discovery and databases (KDD) process described here.

Keywords: KDD, data mining, biomedical informatics, decision tree, knowledge management

Read this refereed paper in Adobe Portable Document (PDF) format. (19 pages, 1205 K bytes)
Preview this refereed paper in Plain Text (TXT) format. (36 K bytes)

CDpic
Comments and corrections to
webmaster@isedj.org