Model Cards for Model Reporting

Margaret Mitchell; Simone Wu; Andrew Zaldivar; Parker Barnes; Lucy Vasserman; Ben Hutchinson; Elena Spitzer; Inioluwa Deborah Raji; Timnit Gebru; Margaret Mitchell; Simone Wu; Andrew Zaldivar; Parker Barnes; Lucy Vasserman; Ben Hutchinson; Elena Spitzer; Inioluwa Deborah Raji; Timnit Gebru

doi:10.1145/3287560.3287596

Abstract

Trained machine learning models are increasingly used to perform high-impact\ntasks in areas such as law enforcement, medicine, education, and employment. In\norder to clarify the intended use cases of machine learning models and minimize\ntheir usage in contexts for which they are not well suited, we recommend that\nreleased models be accompanied by documentation detailing their performance\ncharacteristics. In this paper, we propose a framework that we call model\ncards, to encourage such transparent model reporting. Model cards are short\ndocuments accompanying trained machine learning models that provide benchmarked\nevaluation in a variety of conditions, such as across different cultural,\ndemographic, or phenotypic groups (e.g., race, geographic location, sex,\nFitzpatrick skin type) and intersectional groups (e.g., age and race, or sex\nand Fitzpatrick skin type) that are relevant to the intended application\ndomains. Model cards also disclose the context in which models are intended to\nbe used, details of the performance evaluation procedures, and other relevant\ninformation. While we focus primarily on human-centered machine learning models\nin the application fields of computer vision and natural language processing,\nthis framework can be used to document any trained machine learning model. To\nsolidify the concept, we provide cards for two supervised models: One trained\nto detect smiling faces in images, and one trained to detect toxic comments in\ntext. We propose model cards as a step towards the responsible democratization\nof machine learning and related AI technology, increasing transparency into how\nwell AI technology works. We hope this work encourages those releasing trained\nmachine learning models to accompany model releases with similar detailed\nevaluation numbers and other relevant documentation.\n

Related Publications

Boosting Decision Trees

Harris Drucker , Corinna Cortes

We introduce a constructive, incremental learning system for regression problems that models data by means of locally linear experts. In contrast to other approaches, the expert...

1995 Neural Information Processing Systems 231 citations

A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis

Xiaoxuan Liu , Livia Faes , Aditya U Kale +14 more

Deep learning offers considerable promise for medical diagnostics. We aimed to evaluate the diagnostic accuracy of deep learning algorithms versus health-care professionals in c...

2019 The Lancet Digital Health 1631 citations

Domain Adaptation via Pseudo In-Domain Data Selection

Amittai Axelrod , Xiaodong He , Jianfeng Gao

We explore efficient domain adaptation for the task of statistical machine translation based on extracting sentences from a large generaldomain parallel corpus that are most rel...

2011 492 citations

Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning?

Nima Tajbakhsh , J. Shin , Suryakanth Gurudu +4 more

Training a deep convolutional neural network (CNN) from scratch is difficult because it requires a large amount of labeled training data and a great deal of expertise to ensure ...

2016 IEEE Transactions on Medical Imaging 2961 citations

Crop yield prediction using machine learning: A systematic literature review

Thomas van Klompenburg , Ayalew Kassahun , Cagatay Catal

Machine learning is an important decision support tool for crop yield prediction, including supporting decisions on what crops to grow and what to do during the growing season o...

2020 Computers and Electronics in Agriculture 1453 citations

Publication Info

Year: 2019
Type: article
Pages: 220-229
Citations: 1302
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Model Cards for Model Reporting

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1302

OpenAlex

Cite This

APA Style

                            
                                    Margaret Mitchell, 
                                
                                    Simone Wu, 
                                
                                    Andrew Zaldivar
                                
                                et al.
                            
                            (2019). 
                            Model Cards for Model Reporting. 
                            
                            , 220-229.
                            https://doi.org/10.1145/3287560.3287596

Identifiers

DOI: 10.1145/3287560.3287596