<img alt="Shows the logo of agenta" src="https://github.com/Agenta-AI/agenta/assets/4510758/68e055d4-d7b8-4943-992f-761558c64253" >
<a href="https://cloud.agenta.ai?utm_source=github&utm_medium=referral&utm_campaign=readme">
<img alt="Shows the logo of agenta" src="https://imagedelivery.net/UNvjPBCIZFONpkVPQTxVuA/98140352-14c0-4db1-bafb-a1e8d271d500/large" >
</a>
<a href="https://join.slack.com/t/agenta-hq/shared_invite/zt-37pnbp5s6-mbBrPL863d_oLB61GSNFjw">
<img src="https://img.shields.io/badge/JOIN US ON SLACK-4A154B?style=for-the-badge&logo=slack&logoColor=white" />
</a>
<a href="https://www.linkedin.com/company/agenta-ai/">
<img src="https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white" />
</a>
<a href="https://twitter.com/agenta_ai">
<img src="https://img.shields.io/twitter/follow/agenta_ai?style=social" height="28" />
</a>
<img alt="Try Agenta Live Demo" src="https://github.com/user-attachments/assets/a2069e7b-c3e0-4a5e-9e41-8ddc4660d1f2" >
Agenta is a platform for building production-grade LLM applications. It helps engineering and product teams create reliable LLM apps faster through integrated prompt management, evaluation, and observability.
Collaborate with Subject Matter Experts (SMEs) on prompt engineering and make sure nothing breaks in production.
Evaluate your LLM applications systematically with both human and automated feedback. - Flexible Test Sets: Create test cases from production data, playground experiments, or upload CSVs - Pre-built and Custom Evaluators: Use LLM-as-judge, one of our 20+ pre-built evaluators, or you custom evaluators - UI and API Access: Run evaluations via UI (for SMEs) or programmatically (for engineers) - Human Feedback Integration: Collect and incorporate expert annotations
Explore evaluation frameworks →
Get visibility into your LLM applications in production. - Cost & Performance Tracking: Monitor spending, latency, and usage patterns - Tracing: Debug complex workflows with detailed traces - Open Standards: OpenTelemetry native tracing compatible with OpenLLMetry, and OpenInference - Integrations: Comes with pre-built integrations for most models and frameworks
The easiest way to get started is through Agenta Cloud. Free tier available with no credit card required.
<img alt="Try Agenta Live Demo" src="https://github.com/user-attachments/assets/3aa96780-b7e5-4b6f-bfee-8feaa36ff3b2" >
git clone https://github.com/Agenta-AI/agenta && cd agenta
docker compose -f hosting/docker-compose/oss/docker-compose.gh.yml --env-file hosting/docker-compose/oss/.env.oss.gh --profile with-web up -d
http://localhost.For deploying on a remote host, or using different ports refers to our self-hosting and remote deployment documentation.
Find help, explore resources, or get involved:
We welcome contributions of all kinds — from filing issues and sharing ideas to improving the codebase.
Consider giving us a star! It helps us grow our community and gets Agenta in front of more developers.
<a href="https://github.com/agenta-ai/agenta">
Thanks goes to these wonderful people (emoji key):
$ claude mcp add agenta \
-- python -m otcore.mcp_server <graph>