Work for a Member company and need a Member Portal account? Register here with your company email address.
Copyright
Apache 2.0
Courtesy of the researchers
In the struggle over who can train AI models and how, there’s a casualty many people don’t realize: The open web.
New research from the Data Provenance Initiative has found a dramatic drop in content made available to the collections used to build AI.
The project purpose is to improve transparency, documentation, and informed use of datasets in AI.
Researchers created a tool that enables an AI practitioner to find data that suits their model, which could improve accuracy and reduce bias