Enterprise-ready document processing for AI
AI Infrastructure, RAG, AI Security, LLMOps
In this talk, Tu and Kerim look at what they learned about building an enterprise-ready document processing pipeline to make an internal chatbot smarter and capable of providing actually relevant answers, all the while safely respecting privacy, by sanitizing information that shouldn't be immediately visible.
The speakers look at how documents should be prepared, how they can be ingested at rapid scale, and where the PII sanitization needs to be applied.
Attendees can expect to learn actionable strategies and will get access to a repository they can clone and get started with, irrespective of the underlying model they prefer.

Kerim Satirli is a senior developer advocate at HashiCorp and AWS Community Builder for Security & Identity. Before he joined HashiCorp, Kerim worked on IIoT for the Amsterdam airport and helped museums bring their collections online. When Kerim isn't working, he's either spending time with his daughter, enjoying aerial photography, or baking a cake.
Tu Nguyen is a technical leader with a passion for technology and education. He currently helps people learn HashiCorp products. Previously, he built engaging, interactive tutorials for Terraform and Packer, and managed the Consul Education team.
He also advises DreamsForSchools in designing computer science curriculums for elementary and middle school students across Southern California. Previously, he’s led development teams in startups, and built internal DevOps tools at Optum.