Table Of ContentPractical Data Analytics for
Innovation in Medicine:
Building Real Predictive and Prescriptive
Models in Personalized Healthcare and
Medical Research Using AI, ML, and Related
Technologies, 2nd Edition
Gary D. Miner, PhD; Linda A. Miner, PhD
Linda A. Miner, Originating Author & Editor
Scott Burk, PhD; Mitchell Goldstein, MD
Robert Nisbet, Integrating Author & Editor
Robert Nisbet, PhD; Nephi Walton, MD
Gary D. Miner, Coordinating Author & Editor
Thomas Hill, PhD
Guest—Authors:
Anna J. C. Russell-Toner, MComm(Statistics), CEO; The Boss Lady, The Data-Shack; Head-Office:
York, UK; also offices in South Africa & Hong Kong; http://www.data-shack.co.uk
Billie Corkerin, CPOT; I-CARE HOME Trainer; OMEG, Jenks, OK, https://www.omeg2020.com/
John B. C. Tan, PhD; Data Scientist, Department of Pediatrics, LOMA LINDA UNIVERSITY | School of
Medicine, Loma Linda, CA, https://medicine.llu.edu/
Fu-Sheng Chou, MD, PhD; Neonatologist and Assistant Professor, Department of Pediatrics
LOMA LINDA UNIVERSITY | School of Medicine, Loma Linda, CA, https://medicine.llu.edu/
Rebekah Leigh; Ms2 Medical Student, MD Candidate, Class of 2024,
LOMA LINDA UNIVERSITY | School of Medicine, Loma Linda, CA, https://medicine.llu.edu/
Harsha K. Chandnani MD, MBA, MPH; Pediatric Intensivist, Department of Pediatrics
LOMA LINDA UNIVERSITY | School of Medicine | Children’s Hospital, Loma Linda, CA,
https://medicine.llu.edu/, https://lluch.org/
Practical Data Analytics for Innovation in
Medicine
Building Real Predictive and Prescriptive Models in Personalized Healthcare and
Medical Research Using AI, ML, and Related Technologies
https://www.elsevier.com/books-and-journals/book-companion/9780323952743
Practical Data Analytics for Innovation in Medicine
Gary D. Miner, Linda A. Miner, Scott Burk, Mitchell Goldstein, Robert Nisbet, Nephi Walton, Thomas Hill
Resources available:
This book’s COMPANION WEB PAGE contains the following items:
1. COMPANION WEB PAGE - Selected Chapters from 1st Edition:
Chapters written for the first edition published in 2014-2015 that are NOT included include in this 2nd Edition, but are still important to
understanding the overall story of the background to use of ‘Predictive Analytics’ and ‘Digital processes’ in medicine, are included for
your reading.
2. COMPANION WEB PAGE - Tutorials from the 1st Edition:
All of the Tutorials and Case Studies that were published on the paper pages of the 1st edition of this book (published in 2014-2015)
are NOT included in the paper pages of this 2nd edition, but are available in their entirety on this book’s COMPANION WEB PAGE,
including data sets, where applicable.
3. COMPANION WEB PAGE - Software use instructions:
Information providing the Internet on-line URL – LINKS for obtaining downloads or evaluation copies (when available) of ‘Predictive
Analytic’ software that is used / illustrated in the TUTORIALS and CASE STUDIES is provided here, to assist the reader in using this
software, where interested.
4. COMPANION WEB PAGE - Miscellaneous resources:
Various background resource material, either in the form of URL LINKS and/or PDF documents may be provide here as deemed
appropriate by the authors of this book; this may include additional items added to this page in future years. Currently APPENDIX A
items are included here.
Practical Data Analytics for
Innovation in Medicine
Building Real Predictive and Prescriptive Models
in Personalized Healthcare and Medical
Research Using AI, ML, and Related
Technologies
Second Edition
Gary D. Miner, PhD; Linda A. Miner, PhD Linda A. Miner, Originating Author & Editor
Scott Burk, PhD; Mitchell Goldstein, MD Robert Nisbet, Integrating Author & Editor
Robert Nisbet, PhD; Nephi Walton, MD Gary D. Miner, Coordinating Author & Editor
Thomas Hill, PhD
Guest—Authors:
AnnaJ.C.Russell-Toner,MComm(Statistics),CEO;TheBossLady,TheData-Shack;Head-Office:York,UK;
alsoofficesinSouthAfrica&HongKong;http://www.data-shack.co.uk
BillieCorkerin,CPOT;I-CAREHOMETrainer;OMEG,Jenks,OK,https://www.omeg2020.com/
JohnB.C.Tan,PhD;DataScientist,DepartmentofPediatrics,LOMALINDAUNIVERSITY|SchoolofMedicine,
LomaLinda,CA,https://medicine.llu.edu/
Fu-ShengChou,MD,PhD;NeonatologistandAssistantProfessor,DepartmentofPediatricsLOMALINDA
UNIVERSITY|SchoolofMedicine,LomaLinda,CA,https://medicine.llu.edu/
RebekahLeigh;Ms2MedicalStudent,MDCandidate,Classof2024,LOMALINDAUNIVERSITY|
SchoolofMedicine,LomaLinda,CA,https://medicine.llu.edu/
HarshaK.ChandnaniMD,MBA,MPH;PediatricIntensivist,DepartmentofPediatricsLOMALINDAUNIVERSITY|
SchoolofMedicine|Children’sHospital,LomaLinda,CA,https://medicine.llu.edu/,https://lluch.org/
AcademicPressisanimprintofElsevier
125LondonWall,LondonEC2Y5AS,UnitedKingdom
525BStreet,Suite1650,SanDiego,CA92101,UnitedStates
50HampshireStreet,5thFloor,Cambridge,MA02139,UnitedStates
TheBoulevard,LangfordLane,Kidlington,OxfordOX51GB,UnitedKingdom
Copyright©2023ElsevierInc.Allrightsreserved.
Nopartofthispublicationmaybereproducedortransmittedinanyformorbyanymeans,electronicormechanical,including
photocopying,recording,oranyinformationstorageandretrievalsystem,withoutpermissioninwritingfromthepublisher.
Detailsonhowtoseekpermission,furtherinformationaboutthePublisher’spermissionspoliciesandourarrangementswith
organizationssuchastheCopyrightClearanceCenterandtheCopyrightLicensingAgency,canbefoundatourwebsite:www.
elsevier.com/permissions.
ThisbookandtheindividualcontributionscontainedinitareprotectedundercopyrightbythePublisher(otherthanasmaybe
notedherein).
Notices
Knowledgeandbestpracticeinthisfieldareconstantlychanging.Asnewresearchandexperiencebroadenourunderstanding,
changesinresearchmethods,professionalpractices,ormedicaltreatmentmaybecomenecessary.
Practitionersandresearchersmustalwaysrelyontheirownexperienceandknowledgeinevaluatingandusinganyinformation,
methods,compounds,orexperimentsdescribedherein.Inusingsuchinformationormethodstheyshouldbemindfuloftheir
ownsafetyandthesafetyofothers,includingpartiesforwhomtheyhaveaprofessionalresponsibility.
Tothefullestextentofthelaw,neitherthePublishernortheauthors,contributors,oreditors,assumeanyliabilityforanyinjury
and/ordamagetopersonsorpropertyasamatterofproductsliability,negligenceorotherwise,orfromanyuseoroperationof
anymethods,products,instructions,orideascontainedinthematerialherein.
ISBN:978-0-323-95274-3
ForInformationonallAcademicPresspublications
visitourwebsiteathttps://www.elsevier.com/books-and-journals
Publisher:StacyMasucci
AcquisitionsEditor:RafaelE.Teixeira
EditorialProjectManager:SamYoung
ProductionProjectManager:SwapnaSrinivasan
CoverDesigner:VickyPearson
TypesetbyMPSLimited,Chennai,India
Dedication
(cid:1)
A DUAL DEDICATION to the PAST and to the FUTURE: This book is
dedicated to two important people: Joseph M. Hilbe, J.D., Ph.D., a co-author of
our 1st Edition that passed away too-soon a few years ago; and Rafael Teixeira,
Senior Acquisitions Editor for Elsevier responsible for Medical Informatics and
Biostatistics among other health areas.
Joe Hilbe became “our friend” and “mentor” for all the co-authors of the first
edition, guiding the tone and structure of the book. Joe would attend the same
“Statistical conferences” as co-authors Gary and Linda Miner, and oftentimes
Thomas Hill, so we had frequent in-person contact with Joe during the writing
of the first edition, in addition to On-Line Meetings and digital
communications. We greatly missed Joe’s guidance during the writing of this
2nd edition.
v
vi Dedication
Rafael Teixeira contacted the authors over two years ago (March of 2020, as I
...
recall) asking if we’d consider writing a 2nd Edition. THIS SURPRISED US as
the 1st edition was NOT selling the thousands of copies as did our 2009 and
2012 books on “Data Mining” and “Text Mining,” perplexing the authors, as
we had thought that the release of the first edition in 2015 was at the “crest of
a wave” of interest in using predictive analytics and other digital means of “re-
engineering health care delivery.” It appeared we were wrong. BUT Rafael had
noticed that chapters and passages of our 1st edition were being accessed
numerous times via SCIENCE DIRECT, and even made an Excel sheet providing
the “number of hits” for each chapter and tutorial. Rafael said it was “time for
...
a 2nd edition”!!! and as we spent the past two years writing this 2nd
edition, we discovered his prediction was “right on,” as now in 2022 and
forthcoming years it appears that medicine is finally ready to put predictive
analytics and all sort of digital means to uses to provide better and more
accurate diagnoses and treatment plans for patients that receive the care from
these medical providers.
So, THANKS to JOE and RAFAEL for their wisdom and abilities to act on their
insights to help the world be a better place.
Contents
Abouttheauthors xix
Forewordforthe2ndedition(cid:1)JohnHalamka xxiii
Forewordforthe1steditionbyThomasH.Davenport xxv 2. History of predictive analytics in
Forewordforthe1steditionbyJamesTaylor xxvii medicine and healthcare 15
Forewordforthe1steditionbyJohnHalamka xxix
RobertNisbet
Prefaceandoverviewforthe2ndedition xxxi
Prefacetothe1stedition xxxiii Prelude 15
Acknowledgment xxxv Outline 15
GuestChapterAuthor’sListing xxxvii Introduction 16
EndorsementsandreviewerBlurbs—from PartI.Developmentofbodiesofmedical
the1stedition xxxix knowledge 16
Instructionsforusingsoftwareforthetutorials—how
Earliestmedicalrecordsinancientcultures 17
todownloadfromwebpages—forthe2ndedition xli Classificationofmedicalpracticeamong
ancientandmoderncultures 17
Prologue to Part I Medicalpracticedocumentsinmajor
worldculturesofEuropeandthe
Part I
MiddleEast 18
Historical perspective and the Egypt 18
issues of concern for health care Mesopotamia 19
Greece 20
delivery in the 21st century
AncientRome 22
Galen 23
1. What we want to accomplish with
Arabia 24
this second edition of our first Summaryofroyalmedicaldocumentationin
“Big Green Book” 5 ancientcultures 25
Effectsofthemiddleagesonmedical
LindaA.Miner
documentation 25
Prelude 5 RebirthofInterestinmedicaldocumentation
Purpose/summary 5 duringtherenaissance 26
Firstreasonsforourwritingthisbook 6 Theprintingpress 26
Highlightednewmaterial 6 TheProtestantReformation 26
Descriptivestatistics,dataorganization,and Erasmus 27
example 7 Humananatomy 27
Randomizedcontrolledtrials 9 AndreasVesalius(1514–1564) 27
Basicpredictiveanalyticsandexample 10 WilliamHarvey(1578–1657) 28
Example 11 Medicaldocumentationafterthe
Researchstandardscommontobothtraditional enlightenment 28
andpredictiveanalytics 11 Medicalcasedocumentation 28
Pandemicasrelatedtoresearchstandardsand ThedevelopmentoftheNationalLibraryof
accuratedata 11 Medicine 28
Especiallyforthesecondedition 13 PartII.Analyticaldecisionsystemsin
Chapterconclusion 13 medicineandhealthcare 29
Postscript 13 Computersandmedicaldatabases 29
References 14 Earlymedicaldatabases 30
vii
viii Contents
NationalLibraryofMedicinelistofonline Hospitaloptimization 50
medicaldatabases 30 Challenges 51
Othermedicalresearchdatabases 30 Datastoragevolumes 51
BillsofMortalityinLondon,UnitedKingdom 31 Dataprivacyandsecurity 51
Bestpracticeguidelines 31 PortabilityofPAmodels 52
GuidelinesoftheAmericanAcademyof RegulationofPAmodels 52
Neurology 31 Summary 53
Medicalrecordsmoveintothedigitalworld 32 Postscript 54
Healthcaredatasystems 32 References 54
Postscript 34 Furtherreading 54
References 34
4. Data and process models in
medical informatics 57
3. Bioinformatics 35
Robert(Bob)Nisbet
NephiWaltonandGaryD.Miner
Prelude 57
Prelude 35
Chapterpurpose 57
Theriseofpredictiveanalyticsinhealthcare 35
Introduction 57
Movingfromreactivetoproactiveresponse
Systemsforclassificationofdiseasesand
inhealthcare 36
mortality 58
Medicineandbigdata 36
Billsofmortality 58
Anapproachtopredictiveanalyticsprojects 37
TheICDsystem 58
Thepredictiveanalyticsprocessinhealthcare 38
TheOMOPcommondatamodel 58
ProcessstepsinFig.3.1 38
ReasonsforOMOP 59
Translationalbioinformatics 42
TheOMOPCDMprovidesacommondata
Clinicaldecisionsupportsystems 42
format 60
Hybridclinicaldecisionsupportsystems 43
OMOPCDMarchitectureispatient-centric 60
Consumerhealthinformatics 44
Additionaldataprocessingoperationsnec
Patient-focusedinformatics 44
essarytoservetheanalysisofOMOPdata 61
Healthliteracy 44
TheCRISP-DMprocessingmodel 62
Consumereducation 45
Howthischapterfacilitatespatient-centric
Direct-to-consumergenetictesting 45
healthcare 63
Useofpredictiveanalyticstoavoidan
Postscript 64
undesirablefuture 45
References 64
Consumerhealthkiosks 45
Furtherreading 64
WhousestheInternet?Nearlyeverybody 46
Patientmonitoringsystems 46
Applicationsforpredictiveanalyticsinintensive
5. Access to data for analytics—the
careunitpatientmonitoringsystems 47
Challengesofmedicaldevicesinthe “Biggest Issue” in medical and
intensivecareunit 47 healthcare predictive analytics 65
Publichealthinformatics 48
GaryD.Miner
Themajorproblem:lackofresources 48
Socialnetworksandthe“Pulse”ofpublic Prelude 65
health 48 Sizeofdatainourworld:estimateddigital
Predictiveanalyticsandpreventionand universenowandinthefuture 65
diseaseandinjury 49 Convergenceofhealthcareandmodern
Biosurveillance 49 technologies 66
Food-borneillness 49 Reasonswhyhealthcaredataisdifficultto
Medicalimaging 49 getanddifficulttomeasure 67
Clinicalresearchinformatics 50 Multipleplaceswheremedicaldataarefound 68
Intelligentsearchengines 50 Manydifferentformatsofmedicaldata:
Personalizedmedicine 50 structuredandunstructured 68