Data is the lifeblood of generative AI applications and these apps are ultimately only as good as the data they train on. It is obvious, therefore, that having and maintaining policies and procedures that are specifically designed to ensure high quality data is continuously provided is critical. I refer to this overall effort as “data stewardship” and below is a (very) rough draft of what this effort looks like. (Those of you who are familiar with the CIS-20 Cybersecurity Controls will appreciate the structural similarity.) This framework can also be used by data consumers; i.e., companies that build generative AI applications and by AI auditors.

Basic Controls

Foundational Controls

Organizational Controls

Leave a Reply

Your email address will not be published. Required fields are marked *