Gröger, Fabian, Lionetti, Simone, Gottfrois, Philippe, Gonzalez-Jimenez, Alvaro, Groh, Matthew, Daneshjou, Roxana, Consortium, Labelling, Navarini, Alexander A., & Pouly, Marc. (2023). Towards Reliable Dermatology Evaluation Benchmarks. Machine Learning for Health, 225, 101–128.