> For the complete documentation index, see [llms.txt](https://docs.duplocloud.com/docs/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.duplocloud.com/docs/faqs.md).

# FAQs

## Getting Started

<details>

<summary>What does DuploCloud's infrastructure look like at a high level?</summary>

DuploCloud runs within your own cloud account. The platform itself is a small footprint — a few Docker containers, a MongoDB instance, and a few S3 buckets.

The product has three main layers:

1. **Workspace** — the organizational unit where all DevOps work is orchestrated. Each Workspace bundles together Scopes (access to your infrastructure) and Personas (sets of Skills that define how the agent behaves). You can create multiple Workspaces and invite users to specific ones, giving different teams precisely the access and capabilities they need.
2. **AI DevOps Platform** — the work surface for task-level and project-level objectives. Accessible via web browser, Slack, Teams, or directly in an IDE, this is where tickets are created and assigned to the agent. For simple one-off tasks, create a Ticket directly. For large, complex work, use Projects — which break the work into a Spec, Plan, and Tasks that the agent executes with your oversight.
3. **Integrations** — the connectivity layer that links the agent to your actual infrastructure through cloud providers (AWS, GCP, Azure), Kubernetes clusters (EKS, AKS, GKE), Git repositories (GitHub, GitLab, Bitbucket), observability tools, and MCP servers. Access is granted through Providers and Scopes, with temporary just-in-time credentials passed to the agent at execution time.

Your existing infrastructure — Terraform state, Kubernetes clusters, CI/CD pipelines — is not migrated or replaced. The agent connects to it through the integrations layer using the permissions you define.

</details>

<details>

<summary>What makes DuploCloud different from using Claude Code or similar AI coding tools?</summary>

DuploCloud is an AI DevOps platform, not a single-user coding assistant. The key differences:

**1. Shared system of intelligence — context doesn't live on individual laptops** With Claude Code or Cursor, every session starts from scratch. Work done by one engineer isn't visible to teammates, investigations can't be handed off mid-task, and the same context has to be re-established every time. DuploCloud centralises this: every ticket, investigation, and outcome is stored in a shared knowledge base. The second time a similar task runs — a migration, a compliance check, a cluster upgrade — the agent already knows your environment and asks significantly fewer questions.

**2. Team collaboration and handoff** A task started by one engineer can be continued by another without losing context. Projects, tickets, and plans are visible across the team in a shared workspace, and the agent can work on team members' tasks simultaneously.

**3. Scale and security — credentials managed centrally, not tied to individual laptops** DuploCloud manages credentials at the platform level, generating scoped, temporary access per ticket. Because the platform runs in your cloud rather than on a developer's laptop, it isn't constrained by local machine availability, access, or session limits. The agent is also restricted to the specific repositories and cloud accounts defined in the Scope for that task — it cannot reach outside its boundaries to gather context, which is a common failure mode with local AI tools.

**4. Always-on, not session-bound** A long-running task continues after hours, results are surfaced when complete, and the system keeps working while the team is offline.

If your team already uses Claude Code or Cursor for local development, DuploCloud acts as the coordination and record-keeping layer on top — making individual AI work visible, auditable, and collaborative at the team level.

</details>

<details>

<summary>We already have DevOps engineers. Why would we use DuploCloud?</summary>

The same reason companies with thousands of software engineers still give every developer Claude Code or Cursor: AI tools act as a force multiplier, not a replacement. Your engineers direct the work; DuploCloud's AI DevOps Agent handles the repetitive and time-consuming parts — routine infrastructure tickets, compliance evidence collection, PR reviews, EKS optimisation passes, cost analysis — so your team can focus on higher-leverage work.

For teams with lean or overloaded DevOps functions, this means eliminating the intake ticket backlog and reducing context-switching. For larger teams, it means running complex projects in parallel rather than sequentially, and removing single points of failure when engineers are unavailable.

DuploCloud also handles work that falls between the cracks of typical DevOps tooling: tribal knowledge documentation, cross-environment compliance scanning, and ongoing infrastructure hygiene — work that's important but rarely urgent enough to get prioritised.

</details>

<details>

<summary>We don't have a dedicated DevOps engineer. Is DuploCloud a viable alternative to hiring one?</summary>

For many teams, yes. A dedicated DevOps engineer typically costs $150–200K or more per year in salary; DuploCloud starts at a fraction of that. Beyond cost, the platform works continuously — tickets can run after hours, the agent can monitor your environment around the clock, and there's no on-call rotation.

The clearest fit is teams where routine deployments, compliance evidence collection, cost optimisation, and incident triage consume significant engineering time that your existing team has to absorb.

DuploCloud is not a substitute for a senior engineer making complex architectural decisions. But for the recurring, execution-heavy work that typically occupies most of a DevOps engineer's time, the platform handles it faster and at lower cost.

If you're at the stage of considering your first DevOps hire, a 30-day PoC scoped to tasks from your actual backlog is a practical way to evaluate whether the platform covers what you'd be hiring for.

</details>

<details>

<summary>Is DuploCloud similar to Heroku in terms of simplicity?</summary>

Yes — the ease of use is comparable. Teams often start on Heroku for its simplicity and move to AWS for production scale; DuploCloud is designed to give you Heroku-like simplicity on top of AWS (and GCP and Azure), without the cost and limitations of Heroku at scale.

The main differences: DuploCloud gives you access to the full breadth of cloud-native services that Heroku abstracts away, and DuploCloud plus your cloud provider is generally significantly more cost-effective at higher traffic volumes. The added flexibility does introduce some complexity, but DuploCloud's support and AI agent are there to absorb that complexity rather than pass it to your team.

</details>

<details>

<summary>How long does it take to get started?</summary>

The platform is designed to be operational quickly. Setup involves deploying a few Docker containers, connecting your cloud and Git providers, and configuring the AI DevOps Platform with the appropriate Skills and Scopes. All of which can be done in a few minutes, not days.

The 30-day PoC is structured to deliver measurable results against real infrastructure within the first sprint. Please contact the team to start scoping your onboarding.

</details>

<details>

<summary>What does DuploCloud set up for us during onboarding?</summary>

DuploCloud's team handles the initial platform setup as part of onboarding. This includes deploying the platform to your cloud environment, connecting your cloud and Git providers, and configuring your Workspaces, Skills, Scopes, and MCP server integrations for your ticketing and observability tools.

The overall onboarding flow follows the same structure as before — dev deployment, evaluation, QA, production cutover — but the project plan is now managed inside the product rather than a spreadsheet, so your team can track and collaborate on it in real time.

The team will also configure Skills to reflect your code conventions and operational standards before handoff, so the agent is working to your patterns from day one.

</details>

<details>

<summary>What responsibilities remain with our team after onboarding?</summary>

Your team retains ownership of the application layer — delivering containers, managing dependencies and patches within those containers, and making business decisions about what to build and prioritise. DuploCloud owns running the application and all associated cloud and infrastructure concerns: provisioning, scaling, compliance, cost management, and day-to-day operational work.

In practice:

* **Application ownership** — your team owns Docker containers, application code, and anything inside them.
* **Infrastructure ownership** — DuploCloud manages cloud infrastructure, Kubernetes configuration, CI/CD pipeline setup, compliance controls, and operational runbooks.
* **Decision authority** — your team retains final approval on every infrastructure change before it is executed. Engineers direct the work; the agent and DuploCloud's operations team execute it.

The platform is self-service once onboarded — your engineers can initiate and approve tickets directly — but the standard expectation during onboarding is that DuploCloud's team drives the work while your team reviews and approves.

</details>

<details>

<summary>Who performs the day-to-day DevOps work — the AI agent, DuploCloud engineers , or our team?</summary>

All three, depending on the work and service model:

**The AI agent** handles execution — running commands, generating infrastructure plans, analysing logs, producing diffs, and surfacing results for approval. Every action requires human sign-off before it is applied.

**DuploCloud's engineers** configure and maintain the agent, and handle work that requires human judgment — complex incidents, sensitive infrastructure changes, and anything outside the agent's defined Scope. Under the fully managed model, DuploCloud's team is accountable for outcomes, not just tooling.

**Your engineers** direct priorities, review and approve proposed changes, and own the application layer. Once onboarded, the platform is designed to be self-service — your team can run tickets directly without going through DuploCloud.

The balance shifts based on which service tier you choose:

* **Fully managed** — DuploCloud's team owns day-to-day execution; your team sets direction and approves changes.
* **Hybrid** — your team handles routine work; DuploCloud handles complex or sensitive tasks.
* **Self-serve** — your team runs everything; DuploCloud provides the platform and support.

</details>

<details>

<summary>Does deploying DuploCloud change or disrupt our existing infrastructure?</summary>

No. DuploCloud deploys as a small set of containers inside your existing cloud account — it connects to your infrastructure rather than replacing it. Your Terraform state, Kubernetes manifests, CI/CD pipelines, and running workloads are not touched during onboarding.

If you choose to stop using DuploCloud, your infrastructure continues running exactly as it was; nothing is locked in.

Onboarding typically requires about one meeting per week from your side — DuploCloud's team handles the setup, configuration, and integration work.

</details>

<details>

<summary>Why is a new namespace created during deployment? Will it interfere with our existing workloads?</summary>

The new namespace is specifically designed to enforce isolation from your existing workloads. Your current services and infrastructure are unaffected.

</details>

<details>

<summary>Why is HDv2 deployed in prod by default, even if I have multiple portals?</summary>

A single HDv2 instance in your prod account can manage all of your environments — prod, nonprod, staging, and more — through its multi-provider model. This reduces operational overhead and aligns with HDv2's design as a unified control plane. Multiple deployments are available if explicitly required, but a single prod deployment is the recommended default.

</details>

<details>

<summary>Can DuploCloud scan our existing infrastructure and identify what still needs to be done?</summary>

Yes — this is the standard starting point for any project. When you create a Project Plan, you provide the platform with access to your Git repositories and cloud accounts (via Scopes). The planning phase scans what already exists and generates tasks only for what's missing or non-compliant with the target spec.

If you're partway through a migration, the agent picks up from where your team left off — assessing the current state, identifying the remaining gaps, and producing a prioritized task list with code reviews for each delta. You don't start from scratch.

</details>

<details>

<summary>What cloud providers and platforms are supported?</summary>

| Category            | Supported                                                                 |
| ------------------- | ------------------------------------------------------------------------- |
| Cloud               | AWS, GCP, Azure                                                           |
| Kubernetes          | EKS, AKS, GKE, RHOS                                                       |
| Git                 | GitHub, GitLab, Bitbucket                                                 |
| Observability       | OpenTelemetry, Datadog, New Relic, Sentry                                 |
| Incident Management | Grafana Alert Manager, Datadog, New Relic, Sentry, PagerDuty, Incident.io |
| Extended access     | MCP Servers (any system with an MCP endpoint)                             |

See [Providers](/docs/armor/providers.md) for the full list and setup instructions.

</details>

<details>

<summary>Do you support self-hosted or on-premise deployments?</summary>

DuploCloud runs within your own cloud environment — your infrastructure, your accounts, your data. The platform ensures sensitive data never leaves your cloud environment.

For customers with strict data residency or on-premise requirements, contact <support@duplocloud.net> to discuss deployment options.

</details>

<details>

<summary>How are updates and security patches delivered for self-hosted deployments?</summary>

Updates are delivered as new Helm chart versions and Docker image tags. The platform consists of 3–4 containers — updating means pulling the new images and running a Helm upgrade, with no data migration required in typical releases.

DuploCloud maintains a regular release cadence and notifies customers when updates are available. Security patches are released on an accelerated schedule as needed. Because the platform is self-hosted, you control the timing of all updates — nothing is applied to your environment automatically.

For teams who want automated update management, the platform can be configured to watch for new image tags and apply updates through your existing CD pipeline or GitOps workflow.

</details>

## Agents & Customisation

<details>

<summary>Do you use MCP servers or APIs to access AWS, Kubernetes, etc.?</summary>

It depends on the agent. For AWS and Kubernetes, the platform primarily uses the CLI — LLMs have strong CLI comprehension and it provides precise, auditable execution. For third-party systems that publish MCP servers (observability tools, ticketing systems, etc.), DuploCloud uses those MCP endpoints directly.

The agent is flexible. DuploCloud's core value is in the overall orchestration layer — the agent can be modified or replaced for your specific environment. See [MCP Servers](/docs/armor/mcp-servers.md) for configuration details.

</details>

<details>

<summary>How do you handle long-running jobs?</summary>

The platform supports two communication modes:

* **Synchronous** — for short, fast-turnaround tasks where the result is returned inline.
* **Pub-sub (asynchronous)** — for long-running tasks such as code reviews that require a code checkout, analysis, and structured output. The agent publishes results when complete; no session needs to remain open.

Long-running tasks like generating code reviews or large deployments use the pub-sub model automatically.

</details>

<details>

<summary>Is agent memory persistent?</summary>

The agent is stateless — each execution starts fresh with the context provided in the ticket. Persistence lives at the help desk layer: every ticket maintains a full history of the investigation, actions taken, and outcomes. This history is stored in the platform's Knowledge Base and is accessible to the agent when working on related tickets.

The result is shared, searchable memory at the system level without the agent needing to carry state between runs. The agent working on a follow-up ticket can query prior work, and human team members can review or build on the full investigation history.

</details>

<details>

<summary>How do you give the AI agent context?</summary>

Context is assembled from four layers and delivered to the agent as part of each ticket:

1. **Graph database** — DuploCloud maintains a graph of your infrastructure that captures relationships between hosts, services, pods, dependencies, and cloud resources — giving the agent a structured, queryable map of your environment rather than just flat text.
2. **Knowledge Base retrieval** — the platform uses vector search over the platform's Knowledge Base (previous tickets, runbooks, architecture notes) to pull relevant prior work into the prompt.
3. **Skills** — best practices, guardrails, and operational patterns are encoded as Skills and included in the agent's system prompt. This is how domain expertise is consistently applied without relying on the model to infer it.
4. **Scope credentials** — the agent receives temporary, just-in-time credentials scoped to the exact resources it's permitted to access, so it has the access it needs without ever needing to ask for it.

The result is a multi-layer context strategy: graph relationships for infrastructure awareness, vector retrieval for institutional knowledge, Skills for operational expertise, and Scopes for safe execution.

</details>

<details>

<summary>Does the agent ask clarifying questions when a task is underspecified?</summary>

Yes — this is built into the Project flow. When you create a Project, the first step is a Spec: the agent interviews you, surfaces the configuration decisions that matter, and drafts a structured description of what it understands you want before doing anything.

The spec phase exists precisely for the "I don't know what I don't know" problem — it prompts for choices you might not have thought to specify (retention policies, failover behaviour, access patterns, environment boundaries) rather than silently assuming defaults. Once you've reviewed and confirmed the spec, the agent generates a detailed plan and task list for your approval before any execution begins.

On the first run of a new task type, the agent asks more questions to understand your environment and preferences. On subsequent similar tasks, it asks significantly fewer — the knowledge base retains what it learned about your setup, so you're not re-explaining your conventions from scratch each time.

For one-off Help Desk tickets, the agent works with the context it has and follows up inline if clarification is needed before taking an action.

</details>

<details>

<summary>Does DuploCloud's AI actually execute tasks, or does it just give recommendations?</summary>

It executes. When assigned a ticket, DuploCloud's agent runs real commands against your infrastructure — `kubectl` operations, AWS CLI calls, Terraform plans and applies — and surfaces the results for your review before any changes are committed.

The workflow is: the agent takes action, produces a diff or output, and presents it with an explanation. You approve or reject before anything is applied. For example, an EKS cost optimisation ticket might result in the agent analysing 12 nodes, identifying memory and CPU inefficiencies across workloads, and proposing specific resource adjustments — all executable in one click after your review.

This is different from advisory tools that generate recommendations you implement manually. The work happens inside DuploCloud, with a human in the approval loop.

</details>

<details>

<summary>Why use specialized personas rather than one agent with all skills?</summary>

A single agent with all skills would have a very broad system prompt — which degrades LLM performance. Smaller, focused context windows produce more accurate and reliable outputs than large, all-encompassing ones.

Specialized personas also make it easier to enforce security boundaries. An agent scoped to Kubernetes operations has no access to your Git credentials or AWS account — it can only do what its Scope allows. Mixing all capabilities into one agent would require broader permissions and increase the blast radius of any mistake.

</details>

<details>

<summary>Can the AI agent assist with compliance evidence collection and audit preparation?</summary>

Yes — this is one of the platform's core use cases. The agent can run scheduled compliance checks across your cloud environments and produce structured evidence artifacts automatically.

Specific capabilities include:

* Cross-account scans against compliance controls (SOC 2, HITRUST, ISO 27001)
* Continuous drift detection: the agent flags resources that fall out of compliance between audit cycles, rather than discovering gaps at audit time
* Evidence packaging for auditor review, drawn from ticket history, logs, and live environment state
* GRC platform integration: the agent keeps controls green in platforms like Drata and Vanta by flagging issues as they arise
* Persistent runbooks and control attestation in the knowledge base, making each subsequent audit cycle faster

The agent operates with read-only scope by default. Remediation actions require explicit human approval before execution — the platform notifies; it does not auto-remediate.

For teams who want DuploCloud to take accountability for compliance outcomes — evidence collection, ongoing control monitoring, and auditor liaison — this is available as a managed service.

</details>

<details>

<summary>Are the prebuilt agent and skills built only by DuploCloud, or can others contribute?</summary>

The prebuilt agent and skills are developed and maintained by DuploCloud, in close collaboration with customers and partners. They are not community-sourced in the open-source sense — this means every skill ships with a quality bar and has been validated against real workloads.

That said, the platform is fully extensible. You can build your own skills, modify existing ones, or use skills published by third-party vendors. For example, Hashicorp publishes a Terraform skill that you can plug directly into the agent.

</details>

<details>

<summary>How much work is it to set up and maintain the agent and skills?</summary>

Initial setup is handled by DuploCloud's team as part of onboarding — the DevOps agent, personas, Skills, and Scopes are configured against your environment before you run your first task.

Ongoing maintenance depends on which service model you choose:

* **Fully managed** — DuploCloud's operations team owns the agent, keeps Skills updated, and is accountable for task outcomes. You direct the work (priorities, what to tackle next); the team handles the rest.
* **Hybrid** — your team runs day-to-day tasks, with DuploCloud available for complex or sensitive work.
* **Self-serve** — your team owns configuration and upkeep. The prebuilt agent and Skills are maintained by DuploCloud and require no ongoing effort from you, but custom agent Skills you configure yourself are your responsibility.

Skills encode their logic as explicit, versioned instructions — not trained weights. Updating a Skill means editing text, not retraining a model. Prebuilt Skills don't accumulate drift over time.

</details>

<details>

<summary>Can we automate our existing runbooks and release processes?</summary>

Yes — this is a direct use case. Documented processes (release checklists, hotfix procedures, incident runbooks) can be converted into Skills, which the agent executes step-by-step with the same guardrails applied to any other task: scoped credentials, human approval before execution, and a full audit trail.

The conversion is straightforward: your runbook becomes a structured Skill that the agent follows. On a release trigger, the agent works through the steps, surfaces any exceptions or decisions that require human input, and completes the process. Engineers stay in the loop without needing to run every command themselves.

For teams running hotfixes and deployments every few days with a manual process, this is typically one of the first workflows automated after onboarding.

</details>

<details>

<summary>What LLMs are supported</summary>

The DuploCloud DevOps agent uses LLMs from Anthropic. Haiku, Sonnet, or Opus can be consumed from LLM Providers AWS Bedrock, GCP Vertex, or Azure Foundry.

</details>

<details>

<summary>What AI back-ends does DuploCloud use, and why?</summary>

DuploCloud works with managed LLM services from major cloud providers — AWS Bedrock, GCP Vertex AI, and Azure AI Foundry, for example — depending on your cloud environment. Using managed services means your data stays within your own cloud account and is not used to train third-party models. This is important for enterprise security and compliance requirements.

The platform is model-agnostic at the agent level. DuploCloud's team continuously evaluates new models as they are released and updates default model assignments based on what performs best for each task type — reasoning-heavy tasks like Terraform plan analysis may use a different model than higher-volume tasks like log summarisation. Customers can always override the default and choose specific models for the agent.

</details>

<details>

<summary>Does my choice of container platform affect the quality of AI output?</summary>

In practice, yes. LLMs perform better against widely adopted, open-standard platforms — such as Kubernetes (EKS, GKE, AKS) — than against proprietary or less common orchestration systems. This is because the volume of public documentation, community discussion, and training data is significantly higher for Kubernetes than for alternatives like ECS.

This doesn't mean proprietary platforms aren't supported — they are. But for complex tasks like troubleshooting, cost optimisation, and infrastructure generation, you'll typically get more accurate and detailed output on Kubernetes-based environments.

If you're choosing between platforms and AI-assisted operations is a priority, DuploCloud will factor this into its recommendation during the scoping phase.

</details>

## Integration & Tooling

<details>

<summary>Can you show us some Jenkins agents?</summary>

Yes — DuploCloud has deployed Jenkins agents for multiple customers. The DuploCloud DevOps agent supports Jenkins and GitHub Actions pipeline troubleshooting. Please contact the team to arrange a targeted demonstration.

</details>

<details>

<summary>Can we use our existing Terraform, Helm, or other IaC?</summary>

Yes. The platform includes a Terraform Skill out of the box, covering plan, apply, state management, and error handling. Helm and Kubernetes deployments are handled by the DevOps agent and Skills. External Skill packages from HashiCorp and Pulumi can also be made available to the agent. Your existing IaC files, modules, and conventions are used as-is — the agent works with your code, not a replacement for it.

</details>

<details>

<summary>Will AI-generated code follow our internal conventions and patterns?</summary>

Yes. Two mechanisms ensure this:

1. **Repository access** — the agent with access to your infrastructure repository reads your existing code before generating anything new. It infers your naming conventions, module structure, and organisational patterns and applies them to new output rather than falling back on generic defaults.
2. **Skills** — you can explicitly encode your standards as a Skill (module naming conventions, required tags, resource organisation patterns). Skills are included in the agent's system prompt and applied consistently to every task regardless of who initiates it.

For Terraform specifically, the agent generates code within your existing directory structure and variable conventions. If the agent is uncertain about a convention, it surfaces the decision in the spec or plan phase for you to confirm before generating any code.

Code review is part of the standard workflow. The platform presents the diff in the ticket interface alongside the agent's reasoning before any changes are applied — you're never committed to output you haven't reviewed.

</details>

<details>

<summary>Can proposed infrastructure changes be delivered as a pull request before being applied?</summary>

Yes. For infrastructure-as-code changes, the platform supports two execution modes:

* **Direct apply** — the agent generates the plan, presents the diff for your approval in the ticket interface, and applies it after sign-off.
* **Pull request mode** — the agent opens a PR with the proposed changes against your repository. The change is applied only after your team merges it through your standard review process.

PR mode is the recommended approach for teams that want changes to flow through existing Git review workflows — code owners, required reviewers, branch protection rules, and CI checks all apply as normal.

For teams using GitOps (Flux, ArgoCD), PR mode integrates naturally — the agent opens the PR, your pipeline detects the merge, and the change is applied by the reconciler in the usual way.

</details>

<details>

<summary>Does DuploCloud support GitOps workflows (Flux, ArgoCD)?</summary>

Yes. The agent can be configured with custom Skills for GitOps tools like Flux and ArgoCD. The platform's core model — the agent operating on your Git repositories with scoped access and a full audit trail of proposed changes — maps naturally to GitOps pull-based delivery.

The agent, configured for GitOps, can manage Flux Kustomizations, HelmReleases, and GitRepository resources alongside your existing reconciliation workflow. Contact the team to scope a GitOps configuration for your environment.

</details>

<details>

<summary>How does DuploCloud handle Terraform variable management across environments?</summary>

Terraform variable management is addressed at three levels:

1. **System of record** — variables and configuration are backed by your Git repositories. The platform treats your existing repo structure as the source of truth and works within it rather than replacing it.
2. **Scope-based access control** — each environment (dev, staging, production) is modeled as a separate Scope with its own credentials and boundaries. The DevOps agent only accesses the variables relevant to the Scope it's operating in, preventing cross-environment leakage.
3. **Skills** — Terraform Skills encode best practices for module structure, variable organization, and environment promotion patterns, ensuring consistency across environments regardless of which team member or agent is making changes.

If you're partway through a migration, the platform can scan your existing repositories and cloud accounts to identify gaps and generate a remediation plan.

</details>

<details>

<summary>How does DuploCloud integrate with our existing CI/CD pipeline?</summary>

Git repositories (GitHub, GitLab, Bitbucket) are modeled as [Providers](/docs/armor/providers.md) with scoped access. The DuploCloud DevOps agent integrates with Jenkins and GitHub Actions for pipeline troubleshooting and automation. For deeper pipeline integration, custom Skills can be configured to fit your specific workflow.

</details>

<details>

<summary>Does DuploCloud integrate with project management tools like Jira, Notion, and GitHub?</summary>

Yes. The platform connects to project management and documentation tools through the Provider and MCP server model.

Jira and GitHub are supported as out-of-the-box integrations — the agent can create, update, and link Jira tickets to DuploCloud tasks, and open pull requests on your repositories directly from a ticket. Documentation tools like Notion can be connected via MCP servers to provide the agent with access to your specs and design documents during the planning phase.

A common workflow: a spec lives in Notion, the agent reads it during the project planning phase, generates a detailed task breakdown, and creates corresponding Jira tickets. As tasks are completed, the Jira tickets update to reflect progress. The full project history is retained in DuploCloud, while your existing project tracking system remains the system of record.

</details>

<details>

<summary>We already use Drata, Vanta, or Thoropass. Does DuploCloud replace them?</summary>

No — they're complementary. GRC platforms like Drata, Vanta, and Thoropass identify compliance gaps and manage the audit workflow. DuploCloud does the technical work to close those gaps: building controls into your infrastructure, remediating findings, and keeping them green as your environment evolves.

A common pattern: your GRC platform flags a control as failing; DuploCloud's agent identifies the root cause, proposes a remediation, and executes it after your approval. DuploCloud integrates directly with GRC platforms to keep control statuses current.

</details>

## Security & Access

<details>

<summary>Do we need full admin access?</summary>

No. You don't need to grant any access to get started. DuploCloud's stack runs as a few Docker containers alongside a MongoDB instance and two S3 buckets — no privileged access to your environment is required upfront.

Access is granted on your terms through [Providers](/docs/armor/providers.md) and Scopes. The platform uses IAM permissions defined in each Scope to generate temporary, just-in-time credentials that are passed to the agent as part of the ticket. You control exactly what the agent can and cannot touch.

</details>

<details>

<summary>The AI HelpDesk runs in our prod environment — does that mean it has full access to everything?</summary>

No. Access is entirely determined by the permissions you choose to grant. For example, you can give the AI Suite Kubernetes-only access to a staging portal while withholding AWS access entirely. The platform operates as a single control plane that connects to multiple providers, each scoped to exactly what you allow.

</details>

<details>

<summary>If HDv2 only runs in prod, how do nonprod users access it — and does adding them to prod give them prod access?</summary>

Users are added to the prod portal but their access is strictly scoped within DuploCloud to allow only what is provided to it. Access is determined by assigned scopes, not by where the portal is deployed. A user can also be assigned to multiple Workspaces with different scopes — for example, read-only prod and full-access nonprod.

</details>

<details>

<summary>How do you give access to Git?</summary>

Git is modeled as a Provider — the same way AWS, Kubernetes, and observability tools are. To give the DevOps agent Git access:

1. Navigate to **Providers** and add your Git provider (GitHub, GitLab, or Bitbucket).
2. Add your repository credentials under the **Credentials** tab.
3. Create a **Scope** — a named token with defined boundaries over specific repositories.
4. When creating a ticket, select the appropriate Scope.

See [Providers](/docs/armor/providers.md) for step-by-step instructions.

</details>

<details>

<summary>How are credentials stored and secured?</summary>

Credentials are stored in DuploCloud or referenced from your own secrets manager. The platform uses them to generate scoped, temporary access at execution time — credentials are never passed to the agent directly or stored in session context.

Each Scope defines the exact resources the DevOps agent can access. Guardrails can further restrict specific resources, operations, or environments within that Scope. See [AI DevOps Policy Model](/docs/armor/ai-devops-policy-model.md) — Provider and Scope.

</details>

<details>

<summary>What is the audit trail for AI actions?</summary>

Every ticket maintains a full context and audit trail throughout its lifecycle — what the agent was asked to do, what it proposed, what was approved, and what was executed. Completed task history is stored in the platform's Knowledge Base, queryable for future reference.

</details>

<details>

<summary>What visibility do we have into every action taken in our cloud environment?</summary>

There are two complementary layers:

**DuploCloud ticket history** — every ticket maintains a complete record of what the agent was asked to do, what commands it proposed, what was approved, and what was executed, including the full diff for any changes. This history is searchable from the Knowledge Base.

**Cloud-native audit trails** — all agent actions are executed through standard cloud and infrastructure interfaces, and appear in your existing audit infrastructure. Your cloud provider's audit logging records every API call the agent makes, including the identity it used and the timestamp. These records live in your account, independent of DuploCloud.

If an incident needs investigation, you can trace through both layers: the DuploCloud ticket shows the intent, the approval, and the exact commands proposed; your cloud provider's logs show the corresponding API calls with credentials and timestamps.

</details>

<details>

<summary>How do you protect our environment from the AI agent?</summary>

Protection is enforced at the infrastructure level, not just the policy level — meaning the constraints are structural and cannot be bypassed by the agent regardless of what it's asked to do.

Three layers apply:

1. **Scoped credentials** — the agent receives temporary, just-in-time credentials generated from the Scope you assign to the ticket. Those credentials carry only the permissions you defined. If the Scope doesn't include permission to modify a particular resource or environment, the agent cannot do it — not because it's been instructed not to, but because the credentials don't allow it.
2. **Skills as guardrails** — operational guardrails (safety checks, approval steps, resource boundaries) are encoded as Skills and applied to every task. Skills are explicit, versioned instructions evaluated before execution — not model-dependent inference that could vary between runs.
3. **Human approval before every execution** — the agent proposes changes and waits for explicit human approval before applying anything. No action is taken autonomously on production infrastructure.

The posture is: make bad outcomes structurally impossible first, then add Skills and approval flows as additional layers on top.

</details>

<details>

<summary>What if a user accidentally pastes credentials or secrets into a prompt?</summary>

DuploCloud applies a security validation Skill to the agent. When the agent detects that a prompt contains patterns consistent with secrets — API keys, access tokens, passwords, or cloud provider credentials — it refuses to process the request and returns a warning explaining why.

More broadly, the platform is designed so that users never need to supply credentials in prompts at all. Credentials are managed at the Scope level and injected as temporary, just-in-time credentials at execution time. If a user tries to bypass this by pasting credentials directly, the security Skill acts as a catch.

For stronger guarantees, Scopes can be configured to remove sensitive credential access from the user-facing layer entirely — the agent simply doesn't have the access to misuse.

</details>

<details>

<summary>What compliance certifications does DuploCloud have?</summary>

DuploCloud is SOC 2 certified. Full security documentation is available for procurement review. The platform is used by customers in regulated industries including fintech and healthcare. Contact <support@duplocloud.net> for compliance documentation.

</details>

<details>

<summary>Does DuploCloud provide SOC 2 certification or conduct security audits?</summary>

No — DuploCloud is not an auditor and does not issue certifications. What DuploCloud does is ensure your infrastructure meets SOC 2, HIPAA, HITRUST, PCI, and other framework requirements on an ongoing basis: compliance controls are built directly into every deployment, the agent continuously scans for drift, and evidence is collected and packaged automatically for auditor review.

When you're ready for formal attestation, you engage a qualified auditor directly (or through your GRC platform). DuploCloud can connect you with auditors and provides the infrastructure evidence and control documentation they need.

Penetration testing is available as a DuploCloud service offering, which is typically the final step before audit submission.

</details>

<details>

<summary>Does the AI agent collect data in any capacity?</summary>

No. The agent only accesses metrics and logs from within your own cloud account, and only with your approval before any action is taken. No data leaves your account, and no data collection occurs beyond what is needed to respond to your request.

</details>

## Operations & Reliability

<details>

<summary>Who is responsible for AI's mistakes and how do I protect against them?</summary>

There are two layers of protection:

1. **Deterministic, permission-based controls** — the Scope you assign to the DevOps agent defines exactly what IAM permissions it gets. The platform uses those permissions to generate temporary credentials passed to the agent as part of the ticket. The agent cannot act outside those boundaries regardless of what it's asked to do.
2. **Skills** — best practices and operational guardrails are encoded directly into the agent's Skills. Skills define not just what the agent can do, but how it should do it, including safety checks and approval steps.

DuploCloud's human operations team also acts as a reliability layer — reviewing complex work and stepping in when something requires human judgment.

</details>

<details>

<summary>What happens when the AI agent encounters an error during execution?</summary>

The default behaviour when the agent hits an error is to stop, surface the error with an explanation, and wait for human review — not to attempt self-remediation or find an alternative path to completion.

The agent presents:

* What it was attempting to do
* The specific error it encountered
* What it believes the options are, and whether it can proceed safely

For multi-step tasks (a Kubernetes upgrade, an infrastructure apply across environments), you can configure the agent to execute one step at a time, requiring explicit approval before each subsequent action — so you're never committed to a full sequence before reviewing each step individually.

If the agent identifies that it lacks the prerequisites to complete a task — a missing dependency, insufficient permissions, a resource not in the expected state — it surfaces this during the plan phase before any execution begins. The intent is to fail fast and verbosely, not to proceed into a half-complete state.

The agent doesn't take autonomous "best-effort" alternative paths. If you approved a specific plan and an error occurs, the agent stops at the point of failure and reports back. Any retry or alternative approach requires your explicit instruction.

</details>

<details>

<summary>Will our team lose visibility or skills if AI handles the day-to-day DevOps work?</summary>

No — the platform is designed to make AI work transparent and inspectable, not to replace your team's understanding.

Every ticket maintains a complete record of what the agent was asked to do, every command it proposed, the approval that preceded each step, and the outcome — including diffs for any infrastructure changes. Your engineers can follow every action in detail.

Beyond passive visibility:

* The agent explains its reasoning, or can be prompted to provide reasoning, in plain language before each approval step, so your team learns from the work as it happens rather than receiving a black-box result.
* The Ticket History stores every investigation as a record — your team can review how a problem was previously solved or why a particular approach was taken.
* Skills encode operational standards as readable text instructions, not trained model weights — your team can inspect exactly what guardrails govern the agent's behaviour.

Your engineers remain in the approval loop for every proposed change. The platform doesn't execute autonomously unless explicitly asked to; it proposes and waits. Teams that want to validate everything the AI does can do so at the ticket level before any change is applied.

</details>

<details>

<summary>Can the AI agent be trusted to make compliance decisions, or does human judgment still have a role?</summary>

For technically deterministic compliance work — scanning environments for misconfigurations, collecting evidence against a control, verifying that a resource meets a specific policy — the agent performs reliably and can run autonomously within its defined Scope.

For judgment-heavy decisions — interpreting ambiguous regulatory requirements, determining whether a specific implementation satisfies a regulator's intent, or navigating evolving frameworks like the EU AI Act or global data privacy laws — human expertise remains essential. The platform is built around this distinction: the agent proposes findings and actions, humans review and approve before anything is executed. DuploCloud's human operations team is available throughout for guidance on implementation decisions, not just execution.

Three layers apply to every agent action:

1. **Scoped access** — the agent receives only the IAM permissions defined in the ticket's Scope and cannot act outside those boundaries.
2. **Skills** — compliance patterns and guardrails are encoded as Skills and applied consistently, independent of model inference.
3. **Human approvals** — changes to infrastructure or configuration require explicit sign-off before execution.

The practical model: the agent automates evidence collection, continuous monitoring, and remediation recommendations; humans review what those findings mean and decide what to do about them.

</details>

<details>

<summary>How do LLM model updates work, and will they affect the agent?</summary>

DuploCloud is model-agnostic. Each agent is configured to use a specific model through your cloud provider's managed LLM service (e.g., AWS Bedrock, Azure OpenAI), which you control. Model updates are not applied automatically — you decide when to change the model the agent uses.

DuploCloud monitors model performance across its customer base and makes recommendations when a newer model produces meaningfully better results for a specific task type (e.g., Kubernetes operations, Terraform plan analysis). These are recommendations, not forced updates.

Because Skills encode best practices as explicit, versioned instructions, agent behavior remains consistent even as underlying models evolve — the guardrails don't change with the model.

</details>

<details>

<summary>What happens to our data if we stop using DuploCloud?</summary>

Your infrastructure stays in your accounts — Terraform state, Kubernetes manifests, and all provisioned cloud resources remain fully under your control and continue operating. The Knowledge Base and audit trail are your data, stored in your own repositories (generally, as markdown files) and in DuploCloud's vector database, and can be exported at any time. DuploCloud does not own or lock in any of the artifacts produced.

</details>

<details>

<summary>How quickly can we revoke DuploCloud's access if we stop using the platform?</summary>

Immediately. Access is managed through standard cloud credentials and the Provider/Scope model — removing the access credentials your cloud and Git providers granted to DuploCloud is sufficient. There are no proprietary access mechanisms that require a support ticket or offboarding process from DuploCloud to remove.

Your infrastructure continues operating normally after revocation. The platform doesn't hold runtime state that needs to be migrated — it operates as a coordination layer on top of your existing cloud, Kubernetes, and Git setup. Removing access doesn't disrupt deployed workloads, running pipelines, or any provisioned cloud resources.

The Knowledge Base and ticket history are stored in your own repositories and can be exported at any time. Nothing is locked behind DuploCloud's access credentials.

</details>

<details>

<summary>How does DuploCloud help with cloud cost visibility and unexpected cost increases?</summary>

The DuploCloud agent can audit your cloud environment for common cost drivers — unused or oversized resources, missing VPC endpoints generating data egress charges, untagged infrastructure with no cost attribution, and instances left running after workloads moved to managed services.

Cost savings are attributed to specific tickets in the audit trail, giving you a clear record of what was changed and why.

</details>

## Pricing & Billing

<details>

<summary>What is the limit on the number of tokens?</summary>

There is no token-based billing. DuploCloud charges based on **tickets** (tasks completed) and **nodes under management** (infrastructure resources managed by the platform) — not on LLM token consumption. Think of it as the cost of a DevOps engineer for a fraction of the price. Contact the team for a business proposal with specific pricing assurances.

</details>

<details>

<summary>Does using DuploCloud's AI result in additional LLM costs on top of the platform fee?</summary>

Yes, but they are typically small and predictable. When the agent runs, it invokes LLMs through managed cloud services — AWS Bedrock, GCP Vertex AI, or Azure AI Foundry — that run within your own cloud account. Those token costs appear as standard cloud charges in your bill at the provider's listed rates. DuploCloud does not add a markup.

DuploCloud's own billing model does not charge per token — you pay per ticket and per node under management, not per API call. This means your platform costs don't scale unpredictably with usage volume.

For most workloads, the LLM charges per ticket are low relative to the engineering time saved. For teams with cost sensitivity, the agent can be configured to start with minimal Skills and add them incrementally — a lower-token baseline for initial evaluation that can be extended as the platform proves value.

</details>

<details>

<summary>What exactly counts as a "ticket"?</summary>

A ticket is a unit of work assigned to the AI agent. In the workflow, a human approves a Task generated from a Project Plan — at that point, the Task becomes a Ticket and is dispatched to the agent for execution. Each ticket corresponds to one discrete, agent-executed action or investigation. See [AI Helpdesk - Tickets](https://github.com/duplocloud/docs/blob/main/ai-suite/ai-helpdesk/tickets.md) for details.

</details>

<details>

<summary>What does "nodes under management" mean?</summary>

Nodes under management refers to the infrastructure resources — servers, Kubernetes nodes, cloud instances — that DuploCloud actively monitors and operates on. This forms the second dimension of pricing alongside tickets, reflecting the scope of infrastructure the platform is responsible for.

</details>

<details>

<summary>What's included in the 30-day PoC?</summary>

The PoC gives you a working DevOps agent running against your real infrastructure. DuploCloud's human operations team — infrastructure engineers, Kubernetes specialists, and security practitioners — is included to support setup, review complex work, and ensure the PoC runs against tasks from your actual backlog. Contact the team to scope a PoC around your specific environment.

</details>

<details>

<summary>Does DuploCloud track ROI metrics — tickets resolved, PRs generated, time saved?</summary>

Activity is tracked at the ticket level — every completed task records what was done, what commands were executed, what changes were proposed and approved, and how long it took. This data is queryable via the agent: you ask in natural language and it queries project history directly, answering questions like "how many tickets were completed this sprint," "what was the infrastructure cost delta over this project," or "how does AI-assisted throughput compare to estimated manual effort."

An analytics dashboard is on the roadmap. Agent-level observability is already built in: the agent emits OpenTelemetry traces and metrics (latency, failure rates, success rates) to your cloud's monitoring stack (e.g., AWS X-Ray), enabling benchmarking of agent performance over time.

For PoC engagements, the team can structure the evaluation around specific KPIs — establishing a baseline before the PoC starts and measuring against it at the end. A common methodology is comparing the token cost and human oversight time for a given project done with distributed local AI tools against the same project run centrally through DuploCloud.

</details>

<details>

<summary>What does it cost to run DuploCloud in my cloud account?</summary>

DuploCloud AI HelpDesk runs inside a Kubernetes cluster in your own cloud account. Infrastructure costs depend on whether you are deploying into a **dedicated new cluster** or an **existing cluster** with available capacity. Estimates below cover the DuploCloud platform components only — not your existing workloads or LLM usage.

**AWS (EKS)**

*Scenario 1 — Dedicated installation (new EKS cluster)*

| Resource                  | Details                                      | Estimated Monthly Cost |
| ------------------------- | -------------------------------------------- | ---------------------- |
| EKS control plane         | Managed Kubernetes cluster                   | \~$72                  |
| Worker nodes              | 2× `t3a.large` (minimum recommended)         | \~$110                 |
| EFS filesystem            | Shared storage for agent working directories | \~$10–15               |
| Application Load Balancer | HTTPS ingress                                | \~$10–18               |
| ACM certificate           | TLS for your portal domain                   | Free                   |
| **Total**                 |                                              | **\~$202–215/month**   |

*Scenario 2 — Existing EKS cluster*

If you already have an EKS cluster with available node capacity, only the incremental resources are added:

| Resource                  | Details                                            | Estimated Monthly Cost                       |
| ------------------------- | -------------------------------------------------- | -------------------------------------------- |
| EFS filesystem            | New filesystem provisioned for HelpDesk storage    | \~$10–15                                     |
| Application Load Balancer | New ALB for HelpDesk ingress                       | \~$10–18                                     |
| Additional node capacity  | Only if existing nodes cannot absorb HelpDesk pods | $0–$55/node                                  |
| **Total**                 |                                                    | **\~$20–90/month** (excluding any new nodes) |

Costs vary by region. `us-east-1` is typically the least expensive AWS region.

***

**GCP (GKE)**

*Scenario 1 — Dedicated installation (new GKE cluster)*

| Resource                   | Details                                         | Estimated Monthly Cost |
| -------------------------- | ----------------------------------------------- | ---------------------- |
| GKE cluster management fee | Standard zonal cluster                          | \~$74                  |
| Worker nodes               | 2× `e2-standard-2`                              | \~$98                  |
| Cloud Filestore            | Shared NFS volume (1 TB minimum for basic tier) | \~$204                 |
| Cloud Load Balancing       | HTTPS forwarding rule                           | \~$18                  |
| **Total**                  |                                                 | **\~$394/month**       |

Cloud Filestore's 1 TB minimum provisioned size is the largest single cost driver on GCP.

*Scenario 2 — Existing GKE cluster*

| Resource                 | Details                                            | Estimated Monthly Cost                     |
| ------------------------ | -------------------------------------------------- | ------------------------------------------ |
| Cloud Filestore          | New instance for HelpDesk storage (1 TB minimum)   | \~$204                                     |
| Cloud Load Balancing     | New forwarding rule for HelpDesk ingress           | \~$18                                      |
| Additional node capacity | Only if existing nodes cannot absorb HelpDesk pods | $0–$49/node                                |
| **Total**                |                                                    | **\~$222/month** (excluding any new nodes) |

If your cluster already has an existing Filestore instance with available capacity, it can be shared — removing that $204 cost.

***

**Azure (AKS)**

*Scenario 1 — Dedicated installation (new AKS node pool)*

| Resource                     | Details                                      | Estimated Monthly Cost |
| ---------------------------- | -------------------------------------------- | ---------------------- |
| AKS cluster management       | Managed Kubernetes control plane             | Free                   |
| Worker nodes                 | 2× `Standard_D2s_v3` (minimum recommended)   | \~$140                 |
| Azure Application Gateway v2 | HTTPS ingress                                | \~$180                 |
| Azure Blob NFS Premium       | Shared storage for agent working directories | \~$20                  |
| **Total**                    |                                              | **\~$340/month**       |

Azure Application Gateway has a significant fixed hourly cost (\~$0.246/hr) that dominates the estimate.

*Scenario 2 — Existing AKS cluster*

| Resource                     | Details                                            | Estimated Monthly Cost                                 |
| ---------------------------- | -------------------------------------------------- | ------------------------------------------------------ |
| Azure Blob NFS Premium       | New storage account for HelpDesk                   | \~$20                                                  |
| Azure Application Gateway v2 | New gateway, or shared with existing if available  | \~$0–$180                                              |
| Additional node capacity     | Only if existing nodes cannot absorb HelpDesk pods | $0–$70/node                                            |
| **Total**                    |                                                    | **\~$20–200/month** (depending on App Gateway sharing) |

If your organization already runs an Azure Application Gateway, the HelpDesk ingress can be configured to share it — removing the largest cost item from the incremental estimate.

***

**What is not included in these estimates:**

* **LLM usage** — when the agent runs, it invokes models through AWS Bedrock, GCP Vertex AI, or Azure AI Foundry running inside your account. Those charges appear separately in your cloud bill at standard provider rates. See [Does using DuploCloud's AI result in additional LLM costs?](#does-using-duploclouds-ai-result-in-additional-llm-costs-on-top-of-the-platform-fee) for details.
* **Your existing workloads** — DuploCloud deploys into a dedicated namespace and does not affect the cost of services already running in your account.
* **DuploCloud platform fee** — the figures above are cloud infrastructure costs only; the DuploCloud subscription fee is separate.

All figures are approximate and based on standard on-demand pricing in major regions. Actual costs vary by region, reserved instance discounts, and existing infrastructure that can be shared.

</details>


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.duplocloud.com/docs/faqs.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.