Classification of thyroid nodules by ultrasound in clinical practice: the added value of the judgment of the skilled endocrinologist

Bruno Madeo; Giulia Brigante; Anna Ansaloni; Erica Taliani; Shaniko Kaleci; Maria Laura Monzani; Manuela Simoni; Vincenzo Rochira

doi:10.1530/endoabs.63.GP262

GP262

Prev Next

Section Contents Cite

Endocrine Abstracts (2019) 63 GP262 | DOI: 10.1530/endoabs.63.GP262

ECE2019 Guided Posters Thyroid Nodules and Cancer 2 (11 abstracts)

Classification of thyroid nodules by ultrasound in clinical practice: the added value of the judgment of the skilled endocrinologist

Bruno Madeo ^1, , Giulia Brigante ^1, , Anna Ansaloni ^1, , Erica Taliani ^1, , Shaniko Kaleci ³ , Maria Laura Monzani ^1, , Manuela Simoni ^1, & Vincenzo Rochira ^1,

Author affiliations

¹Unit of Endocrinology, Dept. Biomedical, Metabolic and Neural Sciences, University of Modena and Reggio Emilia, Modena, Italy; ²Department of Medical Specialties, Azienda Ospedaliero-Universitaria di Modena, Modena, Italy; ³Department of Diagnostic Medicine, Clinics and Public Health, Azienda Ospedaliero-Universitaria di Modena, Modena, Italy.

Background and aim: Several ultrasound (US) classifications for thyroid nodules have been proposed. Since most of them are hardly applicable in clinical practice, we set up the Modena US Thyroid Classification (MUT) that stratifies the risk of malignancy considering both knowledge derived from scientific literature and clinician subjective impression. The aim of the present study was to test the diagnostic accuracy of different thyroid US classification systems, AACE/ACE-AME, American Thyroid Association (ATA), British Thyroid Association (BTA), and MUT, and to evaluate inter-classification agreement.

Methods: We prospectively enrolled 111 patients (33M, 78F; age 19–75) candidate for surgery because of indeterminate, suspicious or malignant cytology. All patients underwent neck US before surgery and a score according to MUT was assigned: 1 not certainly nodular; 2 not suspect; 3 indeterminate; 4 suspect; 5 very suspect. Then, we retrospectively classified nodules according to AACE/ACE-AME, ATA and BTA, thanks to the detailed collection of each nodule US features in a preformed checklist. US pattern was related to histology. Sensitivity, specificity, diagnostic cut-off value and accuracy of each classification were calculated. The overall agreement between classifications was quantified by Bland-Altman test. The agreement between single nodule analysis by different classifications was evaluated considering Weighted Cohen’s Kappa.

Results: Fifteen patients had uninodular and 96 multinodular goiter, for a total of 457 nodules. MUT has the highest accuracy (AUC 0.808) and specificity (89%), followed by ATA and BTA, and finally by AACE/ACE-AME. ATA and BTA are highly interchangeable and MUT is comparable to both of them. AACE/ACE-AME is the least interchangeable with all the other classifications. Considering agreement between single nodule analyses by different classifications, ATA and BTA had the best (κ=0.723); AACE/ACE-AME showed slight agreement with BTA (κ=0.177) and MUT (κ=0.183), and fair agreement with ATA (κ=0.282); MUT had fair agreement with both ATA (κ=0.291) and BTA (κ=0.271).

Conclusions: Our data analysis to quantify the agreement between different classification systems confirms the reliability and reproducibility to classify malignancy. However, results bring out the limit in specificity of the current reference classifications, which improves when the subjective impression of the clinician is considered.