International Congress on Academic Medicine

Submission ID 77931

Code	OH-5-2
At the end of this workshop, participants will be able to:
Category	Medical Education
Type	Oral
Will the presenter be a:	Other
Presenter Other	Research Technician
Title	Proof-of-Concept of Using Natural Language Processing to Score Narrative Assessments of Undergraduate Medical Students.
Background/Purpose	There are tens of thousands of narrative statements with undifferentiating sentiment provided for the clerkship in-training evaluation reports (ITERs) in each cohort in our program. Here, we describe a proof-of-concept study on the use of natural language processing (NLP) to effectively review and flag student ITER assessments.
Methods	Narrative assessments were collected for the top ten and bottom ten students (determined by ITER scores) within the classes of 2018-2021. Student identifiers were anonymized to prevent coding bias. Using a predetermined set of key words that are associated with "favourable cohort comparison" (FCC; n = 23) and "overall unfavourable cohort comparison" (O-UCC; n = 26) sentiment, narrative comments were coded and categorized using NVivo. Statistical analysis was performed using SPSS.
Results	A total of 8859 ITER comments were included in the analysis, where 504 (5.7%) were coded as FCC or O-UCC sentiment. More than half (n = 290; 57.5%) of the coded sentiments belonged to the top students, where 53.6% and 4.0% were FCC and O-UCC sentiment, respectively, while the opposite pattern was true for the bottom students (9.3% and 33.1% were coded as FCC, and O-UCC, respectively). The ROC analysis demonstrated that the words categorized as FCC had an AUC of 0.781 (p < 0.0001, S.E. 0.027, 95% C.I. 0.728-0.834), while O-UCC had an AUC of 0.817 (p < 0.0001, S.E. 0.024, 95% C.I. 0.770-0.864).
Discussion	This proof-of-concept study demonstrates the potential use of natural language processing to efficiently and effectively analyze the sentiment of tens of thousands of undergraduate medical student statements.
Keyword 1	Clerkship student evaluations
Keyword 2	Narrative assessments
Keyword 3	Natural language processing
Abstract content most relevant to: (check all that apply)	Undergraduate Medical Education
Abstract Track - First Choice	AI and Data Science
Authors	Irene Ma Irene Ma Kevin McLaughlin Mike Paget Adrian Harvey Janeve Desy Christopher Naugler

x

please wait . . . loading

Working...