Staging environment

Agent Engineering Bootcamp: Developers Edition

4.8 (107)

Featured in

Lenny’s List

Hamza Farooq

Founder | Ex-Google | Prof UCLA & UMN

Zain Hasan

Staff AI/ML Engineer (DX) @ Together AI

This course is popular

3 people enrolled last week.

Build Production-Ready Agentic RAG, Voice & Multi-Agent Systems

Master the full agent stack from first principles to production with RAG, voice, multi-agent systems, & inference

6 live sessions · 6 office hours · 1 demo day · 20+ hours

What You’ll Build

Agent Harness Design: Build a observable perceive–think–act loop in Python with structured tracing, evolving from simple LLM calls to full agent systems with tools, reflection, and planning

Agentic RAG Systems: Create advanced RAG pipelines with routing, multi-hop reasoning, Knowledge Graph, and semantic caching to improve accuracy and efficiency

Real-Time Voice Agents: Engineer a streaming STT → LLM → TTS pipeline with turn-taking, interruptions, and sub-second latency across modern voice platforms

Multi-Agent Orchestration: Design coordinated agent systems using MCP and A2A with patterns like hierarchical, debate, and orchestrator-worker workflows

Guardrails & Evaluation: Build reliable systems with safety guardrails and trajectory-based evaluation using LLM-as-judge and validated benchmark tasks

Production Deployment: Ship multi-agent systems with Google ADK, MCP, A2A, Llama Guard, and GCP monitoring

💻 Prerequisites: Python

👉 This bootcamp is for engineers who ship real agent systems

What you’ll learn

Master Advanced Techniques for Building and Optimizing Agentic RAG Systems and Multi-Agent Workflows — Designed for Builders

  • Outcome: Engineer ReAct agents with structured tracing, decision auditing, memory, and termination control.

  • Framework: Implement perceive-think-act loops, planning agents, reflection, and tool-use architectures.

  • Hands-on: Building a production-grade agent harness and unpacking what makes Claude Code powerful beyond the model.

  • Outcome: Serve production LLMs with predictable latency, lower cost, and benchmarks you can defend.

  • Deep dive: Master GPTQ, GGUF, QLoRA, AWQ, prefill, decode, KV caching, and speculative decoding.

  • Infrastructure: Measure TTFT, inter-token latency, throughput, and the economics of each deployment choice.

  • Outcome: Build retrieval systems that route, reason, adapt, and move beyond naive top-k search.

  • Framework: Implement retrieval-as-a-tool, query planning, multi-hop reasoning, and semantic chunking.

  • Advanced: Add Graph RAG and semantic caching to improve reasoning quality, cost, and latency.

  • Outcome: Engineer real-time voice agents with sub-second responses and natural conversational flow.

  • Architecture: Build streaming STT, LLM, and TTS loops with turn-taking, end detection, and barge-in.

  • Tools: Compare Deepgram, ElevenLabs, OpenAI Realtime, Vapi, and Retell across latency, quality, and cost.

  • Outcome: Design multi-agent systems with clear topology, communication protocols, and operational judgment.

  • Patterns: Implement orchestrator-worker, hierarchical, debate, and handoff workflows for real use cases.

  • Protocols: Connect agents with MCP and A2A, and evaluate when single-agent systems are the better choice.

  • Outcome: Ship agent systems with measurable quality, layered safety, and production-ready eval loops.

  • Guardrails: Implement Llama Guard, NeMo Guardrails, validation, and prompt-injection defenses.

  • Evals: Build trajectory-based evals with golden tasks, partial-credit grading, and LLM-as-judge scoring.

Learn directly from Hamza & Zain

Hamza Farooq

Hamza Farooq

Founder | Ex-Google | Adjunct UCLA & UMN, SCU | Venture Partner

Worked at:
Google
Walmart
University of Minnesota
Stanford Continuing Studies
UCLA
Zain Hasan

Zain Hasan

Staff AI/ML Engineer (DX) @ Together AI | AI Educator & Researcher | Founder

Worked at:
Coursera
Together AI
University of Toronto
Weaviate AI Database
DeepLearning.AI
See all products from Hamza

Who this course is for

  • Machine Learning Engineer exploring different techniques to scale LLM solutions

  • Researcher, who would like to delve in to various aspects of open-source LLMs

  • Software Engineer, looking to learn how to integrate AI into their products

What's included

Live sessions

Learn directly from Hamza Farooq & Zain Hasan in a real-time, interactive format.

Lifetime access

Go back to course content and recordings whenever you need to.

Office Hours

Get your questions answered during live office hours.

Community of peers

Stay accountable and share insights with like-minded professionals.

Certificate of completion

Share your new skills with your employer or on LinkedIn.

Maven Guarantee

Your purchase is backed by the Maven Guarantee.

Course syllabus

Week 1

May 28—May 31

    May

    30

    Session 1: The Agent Loop - ReAct, Architectures, and the Claude Code Harness

    Sat 5/304:00 PM—6:00 PM (UTC)

Week 2

Jun 1—Jun 7

    Jun

    4

    Optional: Office Hours

    Thu 6/44:00 PM—4:45 PM (UTC)
    Optional

    Jun

    6

    Session 2: LLM Quantization and KV Caching

    Sat 6/64:00 PM—6:00 PM (UTC)

Free resources

Schedule

Live sessions

2-3 hrs / week

    • Sat, May 30

      4:00 PM—6:00 PM (UTC)

    • Thu, Jun 4

      4:00 PM—4:45 PM (UTC)

    • Sat, Jun 6

      4:00 PM—6:00 PM (UTC)

Projects

1-3 hrs / week

Async content

1-3 hrs / week

Frequently asked questions

Maven for Teams

Reimbursement

Get your company to pay

Everything L&D needs: email template, receipts, and certificate of completion.

Get reimbursed

Team discount

Learn with your teammates

Save 20%+ when 2 or more teammates enroll in the same cohort.

Save 20%+ with a team

Private cohort

Run a cohort for your org

A dedicated cohort with a custom schedule and curriculum, tailored to your team.

Book a private cohort

$2,500

USD

·
May 28Jul 5
Enroll