Uncovering Demographic Bias in AI Models Evaluating Skin Conditions from Clinical Images (IMAGE)
Caption
Scientists from ShanghaiTech University compared the performance of large language models (LLMs), like ChatGPT-4 and LLaVA, in diagnosing skin diseases among male and female patients across different age groups. The findings point to potential biases across age and sex groups, that must be addressed before clinical deployment.
Credit
Zhiyu Wan, Health Information Safety and Intelligence Research Lab, ShanghaiTech University (generated with the help of ChatGPT-4o)
Usage Restrictions
News organizations may use or redistribute this image, with proper attribution, as part of news coverage of this paper only.
License
Original content