hub / github.com/Agenta-AI/agenta

github.com/Agenta-AI/agenta @48.5 sqlite

repository ↗ · DeepWiki ↗ · release 48.5 ↗

5,299 symbols 22,101 edges 1,485 files 777 documented · 15%

README

    <img alt="Shows the logo of agenta" src="https://github.com/Agenta-AI/agenta/assets/4510758/68e055d4-d7b8-4943-992f-761558c64253" >

The Open-source LLMOps Platform

Build reliable LLM applications faster with integrated prompt management, evaluation, and observability.

    <a href="https://cloud.agenta.ai?utm_source=github&utm_medium=referral&utm_campaign=readme">









    <img alt="Shows the logo of agenta" src="https://imagedelivery.net/UNvjPBCIZFONpkVPQTxVuA/98140352-14c0-4db1-bafb-a1e8d271d500/large" >



    </a>

Documentation • Website • Agenta Cloud

Contributors Last Commit

<a href="https://join.slack.com/t/agenta-hq/shared_invite/zt-37pnbp5s6-mbBrPL863d_oLB61GSNFjw">
    <img src="https://img.shields.io/badge/JOIN US ON SLACK-4A154B?style=for-the-badge&logo=slack&logoColor=white" />
</a>
<a href="https://www.linkedin.com/company/agenta-ai/">
    <img src="https://img.shields.io/badge/LinkedIn-0077B5?style=for-the-badge&logo=linkedin&logoColor=white" />
</a>
<a  href="https://twitter.com/agenta_ai">
    <img src="https://img.shields.io/twitter/follow/agenta_ai?style=social" height="28" />
</a>

    <img alt="Try Agenta Live Demo" src="https://github.com/user-attachments/assets/a2069e7b-c3e0-4a5e-9e41-8ddc4660d1f2" >

What is Agenta?

Agenta is a platform for building production-grade LLM applications. It helps engineering and product teams create reliable LLM apps faster through integrated prompt management, evaluation, and observability.

Core Features

🧪 Prompt Engineering & Management

Collaborate with Subject Matter Experts (SMEs) on prompt engineering and make sure nothing breaks in production.

Interactive Playground: Compare prompts side by side against your test cases
Multi-Model Support: Experiment with 50+ LLM models or bring-your-own models
Version Control: Version prompts and configurations with branching and environments
Complex Configurations: Enable SMEs to collaborate on complex configuration schemas beyond simple prompts

Explore prompt management →

📊 Evaluation & Testing

Evaluate your LLM applications systematically with both human and automated feedback. - Flexible Test Sets: Create test cases from production data, playground experiments, or upload CSVs - Pre-built and Custom Evaluators: Use LLM-as-judge, one of our 20+ pre-built evaluators, or you custom evaluators - UI and API Access: Run evaluations via UI (for SMEs) or programmatically (for engineers) - Human Feedback Integration: Collect and incorporate expert annotations

Explore evaluation frameworks →

📡 Observability & Monitoring

Get visibility into your LLM applications in production. - Cost & Performance Tracking: Monitor spending, latency, and usage patterns - Tracing: Debug complex workflows with detailed traces - Open Standards: OpenTelemetry native tracing compatible with OpenLLMetry, and OpenInference - Integrations: Comes with pre-built integrations for most models and frameworks

Learn about observability →

📸 Screenshots

Playground Prompt Management Evaluation Observability

🚀 Getting Started

Agenta Cloud (Recommended):

The easiest way to get started is through Agenta Cloud. Free tier available with no credit card required.

    <img alt="Try Agenta Live Demo" src="https://github.com/user-attachments/assets/3aa96780-b7e5-4b6f-bfee-8feaa36ff3b2" >

Self-hosting Agenta

Clone Agenta:

git clone https://github.com/Agenta-AI/agenta && cd agenta

Start Agenta services:

docker compose -f hosting/docker-compose/oss/docker-compose.gh.yml --env-file hosting/docker-compose/oss/.env.oss.gh --profile with-web up -d

Access Agenta at http://localhost.

For deploying on a remote host, or using different ports refers to our self-hosting and remote deployment documentation.

💬 Community

Find help, explore resources, or get involved:

🧰 Support

📚 Documentation – Full guides and API reference
📋 Changelog – Track recent updates
💬 Slack Community – Ask questions and get support

🤝 Contribute

We welcome contributions of all kinds — from filing issues and sharing ideas to improving the codebase.

🐛 Report bugs – Help us by reporting problems you encounter
💡 Share ideas and feedback – Suggest features or vote on ideas
🔧 Contribute to the codebase – Read the guide and open a pull request

⭐ Star Agenta

Consider giving us a star! It helps us grow our community and gets Agenta in front of more developers.

<a href="https://github.com/agenta-ai/agenta">

Star us

Contributors ✨

Thanks goes to these wonderful people (emoji key):

_{Sameh Methnani} 💻 📖	_{Suad Suljovic} 💻 🎨 🧑‍🏫 👀	_burtenshaw 💻	_Abram 💻 📖	_{Israel Abebe} 🐛 🎨 💻	_{Master X} 💻	_corinthian 💻 🎨
_{Pavle Janjusevic} 🚇	_{Kaosi Ezealigo} 🐛 💻

Extension points exported contracts — how you extend this code

BaseHookConfig (Interface)

Base hook configuration types

web/oss/src/components/Playground/hooks/usePlayground/types.d.ts

Config (Interface)

(no doc)

web/tests/utils/testmail/types.d.ts

BaseHookResponse (Interface)

Base hook response extending SWR

web/oss/src/components/Playground/hooks/usePlayground/types.d.ts

Message (Interface)

(no doc)

web/tests/utils/testmail/types.d.ts

SelectorConfig (Interface)

Generic selector types

web/oss/src/components/Playground/hooks/usePlayground/types.d.ts

WaitOptions (Interface)

(no doc)

web/tests/utils/testmail/types.d.ts

TreeProps (Interface)

* CustomTree is a recursive tree view component for rendering a hierarchy of nodes. * * This component is highly custo

web/oss/src/components/ui/CustomTreeComponent/index.tsx

ApiResponse (Interface)

(no doc)

web/tests/utils/testmail/types.d.ts

Core symbols most depended-on inside this repo

json

called by 981

sdk/agenta/client/backend/core/pydantic_utilities.py

get

called by 721

sdk/agenta/sdk/utils/cache.py

get

called by 662

api/oss/src/dbs/postgres/secrets/dao.py

parse_obj_as

called by 442

sdk/agenta/client/backend/core/pydantic_utilities.py

error

called by 294

api/oss/src/utils/logging.py

request

called by 255

sdk/agenta/client/backend/core/http_client.py

core_session

called by 159

api/oss/src/dbs/postgres/shared/engine.py

jsonable_encoder

called by 151

sdk/agenta/client/backend/core/jsonable_encoder.py

Shape

Function 2,454

Method 1,303

Class 894

Interface 487

Route 143

Enum 18

Languages

Python65%

TypeScript35%

Modules by API surface

api/oss/src/services/db_manager.py109 symbols

web/oss/src/lib/Types.ts80 symbols

sdk/agenta/sdk/types.py58 symbols

sdk/agenta/sdk/tracing/inline.py49 symbols

api/oss/src/services/evaluators_service.py48 symbols

sdk/agenta/client/client.py46 symbols

api/oss/src/core/observability/dtos.py44 symbols

sdk/agenta/client/backend/variants/client.py40 symbols

api/oss/src/models/api/evaluation_model.py40 symbols

web/oss/src/lib/helpers/utils.ts39 symbols

sdk/tests/legacy/new_tests/conftest.py38 symbols

api/oss/tests/legacy/conftest.py37 symbols

Dependencies from manifests, versioned

@ag-grid-community/client-side-row-model32.3.4 · 1×

@ag-grid-community/core32.3.4 · 1×

@ag-grid-community/csv-export32.3.4 · 1×

@ag-grid-community/react32.3.4 · 1×

@ag-grid-community/styles32.3.4 · 1×

@agenta/eeworkspace:./ee · 1×

@agenta/ossworkspace:./oss · 1×

@agenta/web-testsworkspace:../tests · 1×

@ant-design/colors7.2.0 · 1×

@ant-design/cssinjs1.22.1 · 1×

@ant-design/icons5.5.2 · 1×

@ant-design/v5-patch-for-react-191.0.3 · 1×

Datastores touched

agenta_ossDatabase · 1 repos

For agents

$ claude mcp add agenta \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact