E, Among all binary questions, median accuracy scores were 6.0 (IQR, 5.0-6.0) (mean score, 5.3 ) for easy, 5.5 (IQR, 3.4-6.0) (mean score, 4.6 score, 4.8 ) for hard questions, which resulted in a significant difference among groups ( P = .03 determined by the Kruskal-Wallis test). D, Among all descriptive questions, median accuracy scores for easy, medium, and hard questions were 5.3 (IQR, 3.0-6.0) (mean score, 4.8 ) for easy, 5.5 (IQR, 3.3-6.0) (mean score, 4.7 ) for medium, and 5.0 (IQR, 3.6-6.0) (mean score, 4.5 ) for hard questions ( P = .40 determined by the Kruskal-Wallis test). The median accuracy score for original questions was 2.0 (IQR, 1.0-2.0) (mean score, 1.6 ) compared with 4.0 (IQR, 2.0-5.3) (mean score, 3.9 ) for rescored answers ( P < .01 determined by Wilcoxon signed rank test). C, Of 36 questions with accuracy scores of 2 or lower, 34 were requeried or regraded 8 to 17 days later. B, Among all binary questions in the multispecialty analysis, median accuracy scores were 6.0 (IQR, 5.0-6.0) (mean score, 4.9 ) for easy, 4.0 (IQR, 3.0-6.0) (mean score, 4.3 ) for medium, and 5.0 (IQR, 1.0-6.0) (mean score, 4.2 ) for hard answers ( P = .10 determined by the Kruskal-Wallis test). A, Among all descriptive questions in the multispecialty analysis, median accuracy scores were 5.0 (IQR, 3.0-6.0) (mean score, 4.9 ) for easy, 5.0 (IQR, 3.0-6.0) (mean score, 4.4 ) for medium, and 5.0 (IQR, 3.0-6.0) (mean score, 4.1 ) for hard questions ( P = .70 determined by the Kruskal-Wallis test). Accuracy of artificial intelligence answers from multispecialty questions (A-C ) or all questions (multispecialty, melanoma and immunotherapy, and common medical conditions D-F ).
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |