Accelerated Innovation

Agents Foundations

Curating Your Agent Data

Workshop
Do your agents consistently get the right data, in the right shape, at the right time?
Agent data curation is foundational for reliable GenAI behavior, but scattered systems, noisy inputs, and ad hoc context stuffing quickly turn agents brittle and hard to scale.
 
To win, your GenAI solutions need to deliver clean, well-scoped context to every agent call, every time.
The Challenge
Without a strong approach to agent data curation, teams struggle to:
  • Discover and unify relevant data across fragmented systems and formats.
  • Balance real-time and historical context while staying within latency and cost budgets.
  • Enforce quality, tagging, and governance so agents remain reliable as use cases grow.
 
Agent data gaps will drive hallucinations, quiet model drift, and unreliable experiences your users cannot trust.
Our Solution
In this hands-on workshop, your team designs and validates practical ingestion, indexing, and context-delivery patterns using curated notebooks and realistic corpora. Areas of focus include:
  • Document Ingestion Pipelines — Pull, normalize, and enrich content from key transactional and knowledge sources.
  • Hybrid Search & Indexing — Combine vector and keyword indexes to deliver precise, efficient context windows.
  • Context Strategy & Scoping — Balance real-time signals and history to ground agents without overloading prompts.
  • Quality Controls & Governance — Apply filters, checks, and tagging so curated data remains clean, auditable, and compliant.
  • Capstone & Live Coaching — Assemble an end-to-end curation and indexing path that feeds a working agent, with expert feedback on tradeoffs.
Skills You'll Gain
  • Data Curation Strategy — Define which data your agents should see, when, and under what constraints.
  • Production-Ready Pipelines — Design ingestion and indexing flows that are scalable, observable, and easy to maintain.
  • Answer Quality & Hallucination Control — Improve grounding so agents respond with higher-quality, evidence-backed answers.
  • Cost & Latency Management — Tune data volume, retrieval patterns, and context size to hit performance and budget targets.
  • Reusable Curation Patterns — Establish shared patterns you can reuse across agents, domains, and platforms.

Who Should Attend:

Data EngineersTechnical Product ManagersSolution ArchitectsML EngineersEnterprise ArchitectsPlatform EngineersGenAI Engineers

Solution Essentials

Format

Virtual or in-person

Duration

4 Hours

Skill Level

Intermediate; familiarity with Python, APIs, or data pipelines recommended

Tools

Jupyter-style notebooks plus your preferred GenAI and indexing stack

Explore the Remaining Agents Foundations Certification Workshops

Help your teams responsibly adopt and scale Agentic AI. Click below to explore the remaining workshops in the Agents Foundations certification series.

Core Concepts & Capabilities of AI Agents
Advanced Concepts of AI Agents
Selecting Your Agent Architecture

Ready to strengthen how your agents use data?