Lisa N. Cao - Catalogues as Context: Using metadata to power and govern the next wave of AI development
- Alexy Khrabrov
- Mar 25
- 1 min read

Lisa is a data engineer, product manager, and speaker in open source data infrastructure and DataOps fields. Through her work at Datastrato, creators of Apache Gravitino, she is redefining the data cataloging space for generative AI use cases and end-to-end data integrations.
Catalogues as Context: Using metadata to power and govern the next wave of AI development
Developing powerful AI tooling has been our theme of the year, with agents and foundational models picking up steam across the board. Therein still lies the question though: how do we serve data for these applications to work effectively? What about at enterprise scale? What even is context? In this talk we discuss the current big data landscape, challenges to data platforming for AI, and why data catalogues and metadata are the only viable path forward to effective, governed AI-development. In this talk we use the open source framework, Apache Gravitino as a key example for why such a solution needs vendor neutrality.
Comments