Inria invites you to their event

GPAI IP Workshop n°2: Understand and Addressing the AI Data Scraping Challenge

About this event

GPAI IP Project Advisory Committee's online workshop: Understand and Addressing the AI Data Scraping Challenge

As the prevalence of generative AI evolves, there is an increasing demand for data for AI training, testing, and validation. Various stakeholders are aggregating AI data through a variety of means, including scraping or ingesting data from third party websites and social media platforms (“data scraping”). Additionally, organizations are ingesting open source and other publicly available computer code in connection with the development of AI models.

Data scraping has increasingly become subject to litigation, enforcement actions, and policy proposals. Nonetheless, the fast pace of AI innovation and the lack of harmonization across jurisdictions has triggered legal uncertainty.

To tackle such legal uncertainty, the Committee proposes standardized contract terms as a potential solution to the challenges related to data and code scraping. Thus, in advancing this solution, the Committee presents this workshop to examine the current state of legal challenges arising from data scraping practices, and to evaluate emerging contractual approaches as a solution.

Agenda

(to be confirmed)

16:00 Presentation of Inria, GPAI, and IP Project Advisory Committee Co-Leads

  • Kaitlyn Bove, GPAI Project Manager, Inria

16:00 – 16:05 Welcome Remarks: Framing of the importance of data scraping to AI innovation and competition and the need to protect rights

  • Lee Tiedrich, Distinguished Faculty Fellow in Law & Responsible Technology at Duke University

16:05 – 16:15 Keynote

  • Ulrike Till, Director IP and Frontier Technologies Division at the World Intellectual Property Organization (WIPO)

16:15 – 16:55 Explanation of Data Scraping: What Is It and How Is It Being Used and Cross-Cutting Legal Considerations?

  • Moderator : Yann Dietrich, Group Head of Intellectual Property at Atos (including Eviden)
  • Gaurav Godhwani, Executive Director & Co-founder at CivicDataLab
  • Emma Frejinger, Scientific Advisor at IVADO Lab and professor, Department of Computer Science and Operations Research, Université de Montréal

16:55 – 17:40 What Are the Concerns of Rights Holders?

  • Moderator : Carolyn Blankenship, General Counsel and Secretary at BigBear.ai
  • Tim Friedlander, Artist Advocate, Voice Actor (Atlas Talent), Musician, Educator, CEO/founder of soundBOX:Studio Group, Co-Founder and President of NAVA
  • Marc Rotenberg, Executive Director and Founder at the Center for AI and Digital Policy

17:40c– 17:50 Keynote

  • Celine Caira, Economist/Artificial Intelligence Policy Analyst at the OECD

17:50 – 18:30 Benefits of of Data Scraping (and different use cases) and The Use of Technical Tools to Prevent Mitigate Data Scraping Harms

  • Moderator : Josef Drexl, Managing Director at the Max Planck Institute for Innovation and Competition
  • Damien Sileo, Researcher in Natural language processing and general intelligence at Inria
  • Ana Da Motta, Senior Manager Digital Policy & AI at Amazon Web Services
  • Philipp Hacker, Chair for Law and Ethics of the Digital Society at European University Viadrina

18:30 - 18:40 Keynote

  • [Vidéo] Alessandra Sala, Senior Director of Artificial Intelligence and Data Science at Shutterstock

18:40 – 19:25 Paths Forward: What Contractual Approaches Are Emerging, and Are These Suitable in Light of the Previous Discussions? What other tools merit consideration, e.g. business codes of conduct, technical tools, education and laws?

  • Moderator : Lee Tiedrich, Distinguished Faculty Fellow in Law & Responsible Technology at Duke University
  • Anita Huss-Ekerhult, Secretary General and CEO of IFRRO (International Federation of Reproduction Rights Organisations)
  • Roberto Di Cosmo, Director at Software Heritage
  • Alek Tarkowski, Director of Strategy at Open Future

19:25 – 19:30 Closing remarks

  • Lee Tiedrich, Distinguished Faculty Fellow in Law & Responsible Technology at Duke University


This workshop is coordinated by GPAI Paris Expert Support Center at Inria.

Hosted by

  • Guest speaker
    PH G
    Philipp Hacker

  • Guest speaker
    UT G
    Ulrike Till

  • Guest speaker
    AD G
    Ana Da Motta

  • Guest speaker
    LT G
    Lee Tiedrich

  • Guest speaker
    AT G
    Alek Tarkowski

  • Team member
    T
    Julia Savalli Responsable Communication du Programme IA @ Inria

  • Guest speaker
    DS G
    Damien Sileo

  • Guest speaker
    EF G
    Emma Frejinger

  • Guest speaker
    CB G
    Carolyn Blankenship

  • Guest speaker
    GG G
    Gaurav Godhwani

  • Guest speaker
    KB G
    Kaitlyn Bove

  • Guest speaker
    TF G
    Tim Friedlander

  • Guest speaker
    G
    Roberto Di Cosmo

  • Guest speaker
    CC G
    Celine Caira

  • Guest speaker
    MR G
    Marc Rotenberg

  • Guest speaker
    JD G
    Josef Drexl

  • Guest speaker
    AH G
    Anita Huss-Ekerhult

  • Guest speaker
    YD G
    Yann Dietrich

Inria

Institut national de recherche en sciences et technologies du numérique