Submission ID 77931

Code OH-5-2
At the end of this workshop, participants will be able to:
Category Medical Education
Type Oral
Will the presenter be a: Other
Presenter Other Research Technician
Title Proof-of-Concept of Using Natural Language Processing to Score Narrative Assessments of Undergraduate Medical Students.
Background/Purpose There are tens of thousands of narrative statements with undifferentiating sentiment provided for the clerkship in-training evaluation reports (ITERs) in each cohort in our program. Here, we describe a proof-of-concept study on the use of natural language processing (NLP) to effectively review and flag student ITER assessments.
Methods Narrative assessments were collected for the top ten and bottom ten students (determined by ITER scores) within the classes of 2018-2021. Student identifiers were anonymized to prevent coding bias. Using a predetermined set of key words that are associated with "favourable cohort comparison" (FCC; n = 23) and "overall unfavourable cohort comparison" (O-UCC; n = 26) sentiment, narrative comments were coded and categorized using NVivo. Statistical analysis was performed using SPSS.
Results A total of 8859 ITER comments were included in the analysis, where 504 (5.7%) were coded as FCC or O-UCC sentiment. More than half (n = 290; 57.5%) of the coded sentiments belonged to the top students, where 53.6% and 4.0% were FCC and O-UCC sentiment, respectively, while the opposite pattern was true for the bottom students (9.3% and 33.1% were coded as FCC, and O-UCC, respectively). The ROC analysis demonstrated that the words categorized as FCC had an AUC of 0.781 (p < 0.0001, S.E. 0.027, 95% C.I. 0.728-0.834), while O-UCC had an AUC of 0.817 (p < 0.0001, S.E. 0.024, 95% C.I. 0.770-0.864).
Discussion This proof-of-concept study demonstrates the potential use of natural language processing to efficiently and effectively analyze the sentiment of tens of thousands of undergraduate medical student statements.
Keyword 1 Clerkship student evaluations
Keyword 2 Narrative assessments
Keyword 3 Natural language processing
Abstract content most relevant to: (check all that apply) Undergraduate Medical Education
Abstract Track - First Choice AI and Data Science
Authors Irene Ma
Irene Ma
Kevin McLaughlin
Mike Paget
Adrian Harvey
Janeve Desy
Christopher Naugler
x

Loading . . .
please wait . . . loading

Working...