List of topics
Nền tảng Agentic AI
Context Harness
Tool Harness
Orchestration Harness
Evaluation Harness
Security Harness
Governance Harness
AgentOps Harness
Project cuối khóa - Enterprise AI Agent Platform

Evaluation Harness

Evaluation Harness

  • Golden Datasets

  • LLM-as-a-Judge

  • Human Feedback

  • A/B Testing

  • Regression Testing

  • RAG Evaluation

  • Hallucination Detection

Lab

  • Build Evaluation Pipeline