Login
Register

Work for a Member organization and need a Member Portal account? Register here with your official email address.

Project

Data Provenance for AI

Courtesy of the researchers

Project Contact:

Groups

Article Research

AI Training Can Undermine the Open Web. This Team Is Thinking Through Solutions

In the struggle over who can train AI models and how, there’s a casualty many people don’t realize: The open web.

Article Research

The Data That Powers AI Is Disappearing Fast

New research from the Data Provenance Initiative has found a dramatic drop in content made available to the collections used to build AI.

Data Provenance for AI receives 2024 Infrastructure Fund award

The project purpose is to improve transparency, documentation, and informed use of datasets in AI.

Article Research

Study: Transparency is often lacking in datasets used to train large language models

Researchers created a tool that enables an AI practitioner to find data that suits their model, which could improve accuracy and reduce bias