v2.0 · Now LiveGoogle Workspace SSO40+ Integrations

Infravox AI Documentation

Your autonomous SRE engineer — a unified platform that monitors, diagnoses, and auto-fixes infrastructure 24/7 using five specialist AI agents.

30-MINUTE ONBOARDING PATH

Follow this guided path to go from zero to fully autonomous. Each step links to a detailed how-to guide.

Quick Start

Connect your infrastructure and have Infravox AI monitoring in under 5 minutes.

1
Sign up & log in
Visit infravox.ai/login and create your account. Sign in with Google Workspace SSO (@infravox.ai) or use email/password. New users can start on the free Starter tier — no credit card required.
2
Connect your first integration
Go to Settings → Integrations and connect your cloud provider (AWS, Azure, GCP) or Kubernetes cluster. Takes under 2 minutes.
3
Add a Git repository
Under Code Review → Connect Repo link GitHub, GitLab, Bitbucket, or Azure Repos to enable AI PR reviews and code scanning.
4
Watch AI agents activate
Within minutes the SRE, DevOps, Security, and FinOps agents begin scanning your stack and will surface the first insights on your dashboard.
5
Enable Auto-Remediation
Go to Settings → Auto-Fix and toggle on auto-remediation for safe actions like pod restarts and rollbacks. All actions are logged with full audit trails.
Tip
You can connect as many integrations as your plan allows. The Unlimited plan supports unlimited integrations and repos. Start with the free Starter tier — no credit card needed.

How It Works

Every alert goes through a 6-step autonomous pipeline before you're notified — or before it auto-resolves.

01
Signal Detection
Metrics, logs, and traces are ingested continuously. Anomalies trigger within 60 seconds.
02
AI Triage
The AI Orchestrator dispatches the right specialist agent based on signal type.
03
Root Cause
Agent correlates logs, deployments, and topology to determine root cause with 94% confidence.
04
Fix Proposal
A ranked list of safe, reversible fixes is generated — restart, rollback, scale, or patch.
05
Auto-Execute
With auto-fix enabled, the safest action runs immediately and is verified. Full audit trail.
06
Resolution
Incident closes, postmortem is generated, and learnings update the AI model.

AI Agents

Five specialist agents collaborate through an AI Orchestrator to handle every operational task autonomously.

Core
SRE Agent
Detects anomalies, runs root cause analysis, manages incident lifecycle end-to-end.
Read docs
Core
DevOps Agent
Monitors CI/CD pipelines, detects deploy failures, triggers rollbacks automatically.
Read docs
Core
Security Agent
Scans configs, IaC, code, and runtime for vulnerabilities. Blocks risky PRs.
Read docs
Core
FinOps Agent
Identifies idle resources, over-provisioned nodes, and unused reservations.
Read docs
New
Git Agent
Reviews PRs, runs SAST scanning, monitors pipelines across GitHub, GitLab, Bitbucket, Azure Repos.
Read docs
Core
AI Orchestrator
Routes signals to the right agent, combines outputs, resolves conflicts, produces final answers.
Read docs

SRE Agent

Your 24/7 site reliability engineer. Monitors all services and resolves incidents in seconds.

< 60s
Avg detection time
94%
RCA confidence
78%
Auto-resolve rate
94s
Avg MTTR

The SRE Agent continuously ingests metrics, logs, and traces from all connected integrations. When an anomaly is detected it immediately:

Correlates the signal with recent deployments, config changes, and topology
Queries the log intelligence engine for error patterns in the last 5 minutes
Scores potential root causes using ML-trained models from historical incidents
Generates a blast-radius analysis to understand downstream impact
Proposes a ranked list of safe, reversible fix actions
Executes the fix (if auto-remediation is enabled) and verifies resolution
Note
The SRE Agent integrates with PagerDuty, Opsgenie, Slack, and Microsoft Teams. When a fix is auto-applied, the on-call alert is automatically resolved and a postmortem is generated.

DevOps Agent

Monitors every deployment, detects pipeline failures, and triggers automatic rollbacks.

Connected to GitHub Actions, GitLab CI, Jenkins, ArgoCD, CircleCI, and Azure Pipelines, the DevOps Agent watches every pipeline run in real time. It correlates new deployments with system health metrics — and rolls back automatically if a regression is detected within the first 5 minutes of a release.

# Example: automatic rollback trigger event event: deployment.regression_detected service: api-gateway deploy: v2.4.1 → rollback to v2.4.0 reason: error_rate +340% within 90s of deploy action: kubectl rollout undo deployment/api-gateway status: RESOLVED in 47s

Incident Management

From detection to resolution — fully automated. Average time to resolve: 94 seconds.

Proactive Detection
Anomaly detection across metrics, logs, and traces. Alerts before users notice.
Read docs
AI Root Cause Analysis
94% confidence diagnosis using ML models trained on millions of incidents.
Read docs
Popular
AI War Room
Real-time collaborative incident bridge with AI providing live guidance.
Read docs
Core
Auto-Remediation
Safe, verified auto-fix actions: restart pods, rollback deploys, scale services.
Read docs
Postmortems
AI-generated postmortems with timeline, root cause, and preventative actions.
Read docs

Root Cause Analysis

ML-powered diagnosis with 94% confidence across infrastructure, code, and dependencies.

Every alert triggers a full RCA pipeline. The AI correlates signals across your entire topology — services, databases, cloud resources, and recent code changes — to identify root cause with high confidence.

Tip
Infravox AI uses a combination of log embedding similarity, deployment diff analysis, and topology graph traversal to attribute incidents. You can see the full reasoning chain in each incident's detail view.

Auto-Remediation

The killer feature. Alert → diagnose → fix → verify — all in under 94 seconds.

1
Alert fires
A metric threshold, log pattern, or anomaly detection triggers the pipeline.
2
AI analyzes
The Orchestrator dispatches the relevant agent. Logs, metrics, and deployment history are correlated.
3
Fix proposed
A ranked list of safe, reversible actions is generated — restart, rollback, scale, or patch.
4
Auto-execute
With auto-fix enabled, the top-ranked safe action executes via MCP tools (kubectl, AWS SDK, etc.).
5
Verify & close
The agent waits for health checks to pass, then closes the incident and notifies the team.
Warning
Auto-remediation only executes pre-approved action types. Destructive actions (delete, scale-to-zero, production config changes) always require human confirmation regardless of settings.

Observability

Full-stack visibility across metrics, logs, traces, and Kubernetes — all in one place.

Live Dashboards
Real-time health score, KPI cards, cost savings, and AI insights panel.
Read docs
Log Intelligence
AI-powered log search, clustering, anomaly detection, and noise reduction.
Read docs
Metrics & Traces
Prometheus-compatible metrics, distributed traces (OpenTelemetry), Tempo.
Read docs
Kubernetes Monitoring
Pod health, deployments, namespaces, RBAC, events — any K8s cluster.
Read docs
Alerting & On-Call
Smart alert routing to Slack, PagerDuty, Opsgenie, Teams with auto-escalation.
Read docs

Log Intelligence

AI-powered log analysis that turns noise into insight.

Infravox ingests logs from Loki, ELK Stack, ClickHouse, CloudWatch, and GCP Logging. The Log Intelligence engine clusters similar log lines, detects anomalies in frequency or content, and surfaces the relevant context for each incident automatically.

// Example: AI surfaces the relevant log cluster automatically ERROR api-gateway db-connection-pool timeout (47 occurrences) WARN api-gateway upstream latency p99 = 3.4s INFO ai-sre-agent root cause identified — auto-fix queued

Git & Code Intelligence

AI-powered PR reviews, SAST scanning, and pipeline monitoring across all major Git platforms.

Popular
AI PR Reviews
Automated code review with security, performance, and quality feedback on every PR.
Read docs
SAST Code Scanning
Static analysis for secrets, CVEs, injection flaws, and insecure patterns.
Read docs
Pipeline Monitoring
CI/CD health, failure analysis, flaky test detection, build time trends.
Read docs
Native
GitHub
Actions, Advanced Security, Dependabot — full GitHub integration.
Read docs
Native
GitLab
CI pipelines, MRs, GitLab SAST, runner monitoring.
Read docs
Native
Bitbucket
Pipelines, pull requests, Bitbucket Cloud & Server.
Read docs
Native
Azure Repos
Azure DevOps PRs, pipelines, Azure Boards work-item linking.
Read docs
Note
The Git Agent uses a combination of LLM-powered code understanding and traditional SAST pattern matching. Critical security findings automatically block merges when merge protection is enabled.

SAST Code Scanning

Detect security issues before they hit production.

Every PR triggers a full static analysis scan. Results are posted as inline code comments with severity ratings, remediation guidance, and links to relevant CVE records.

Secrets & API keys
SQL / Command injection
CVE dependency scan
IaC misconfigurations
Insecure crypto usage
OWASP Top 10 patterns

Security

Continuous security posture management across your entire infrastructure and code.

Security Overview
Risk score dashboard, open findings, compliance status, trend charts.
Read docs
Auto
RBAC & Policies
Audit Kubernetes RBAC, IAM roles, and access policies for over-permissions.
Read docs
Compliance
Automated checks for SOC2, HIPAA, GDPR, ISO 27001, PCI-DSS, CIS Benchmarks.
Read docs
Secret Scanning
Detect secrets committed to Git repos or present in running container envs.
Read docs
IaC Security
Terraform, CloudFormation, Helm chart scanning for misconfigurations.
Read docs

Compliance

Automated compliance checks across six major frameworks.

SOC 2 Type II
87 checks
HIPAA
64 checks
GDPR
42 checks
ISO 27001
114 checks
PCI-DSS
78 checks
CIS Benchmarks
200+ checks
Tip
Compliance reports can be exported as PDF or JSON and are updated continuously as your infrastructure changes. Available on the Unlimited and Enterprise plans.

Cost Optimization

The FinOps Agent identifies waste and saves teams $2K–$15K/month on average.

Cost Dashboard
Real-time cloud spend across AWS, Azure, and GCP with trend and forecast.
Read docs
Saves $
Rightsizing
CPU & memory rightsizing recommendations for EC2, AKS, GKE, and EKS nodes.
Read docs
Idle Resource Cleanup
Detect and remove unattached EBS volumes, idle load balancers, unused IPs.
Read docs
Budget Alerts
Set monthly budgets per service or team. Alert when projected spend exceeds.
Read docs
Average customer savings
$8,340
Saved per month
21×
ROI on Unlimited plan
40h
Eng. hours reclaimed
60%
MTTR improvement

Integrations

40+ native integrations. Connect your full stack in minutes.

Cloud Providers
Amazon Web Services (EC2, EKS, RDS, Lambda, CloudWatch, Cost Explorer)
Microsoft Azure (AKS, Azure Monitor, Azure Repos, Cost Management, Defender)
Google Cloud Platform (GKE, Cloud Logging, Cloud Monitoring, BigQuery)
Kubernetes (any cluster — EKS, AKS, GKE, on-prem)
CI/CD & Deploy
GitHub Actions
GitLab CI
Jenkins
ArgoCD
CircleCI
Azure Pipelines
Terraform
Helm
Messaging & Alerts
Slack
Microsoft Teams
PagerDuty
Opsgenie
Email webhooks
Data & Storage
PostgreSQL
Redis
Apache Kafka
HashiCorp Vault
ClickHouse
Loki
Prometheus
View all 40+ integrations Connect your stack →

How-To Guides

Step-by-step guides for setting up Infravox, connecting integrations, and getting autonomous in 30 minutes.

30-MINUTE ONBOARDING PATH
START HERE
Getting Started
Install Infravox in 5 Minutes
3 min readRead
Getting Started
Connect Your First Cloud
5 min readRead
POPULAR
Kubernetes
Connect Kubernetes Cluster
8 min readRead
AI Agents
Configure AI Agent Permissions
5 min readRead
Getting Started
Run Your First AI Scan
4 min readRead
NEW
CLI
Install & Authenticate the CLI
3 min readRead
Incident Response
Manage Active Incidents
7 min readRead
Incident Response
Create AI Runbooks
5 min readRead
Prefer the terminal?
Full platform access from the CLI. Same auth, same data.
npm install -g @infravox/cliCLI docs

Get Started with Infravox AI

Set up your autonomous AI SRE platform in under 30 minutes. No agents to install. No rip-and-replace.

01
Create your account
Sign up free with your Google Workspace or work email. No credit card required.
02
Connect your stack
Link your cloud provider, Kubernetes cluster, CI/CD pipelines, and alerting tools in minutes.
03
AI starts monitoring
Infravox AI agents begin watching your infrastructure 24/7 — logs, metrics, deployments, and security.
04
Get value on day one
Receive your first AI insight, cost saving opportunity, or incident resolution within hours of connecting.

Choose your plan

Starter
Free
1 cloud account
Basic AI monitoring
Community support
Core dashboards
MOST POPULAR
Professional
$149 / mo
Full AI monitoring
Auto-remediation
Slack + PagerDuty
Git intelligence
Unlimited
$399 / mo
Unlimited accounts
Security & FinOps AI
Priority support
Custom AI rules
Enterprise
Custom
SSO + RBAC + SAML
Custom AI agents
Dedicated SLA
On-prem option
Tip
Need a custom plan or enterprise pricing? Reach out directly at shivamsingh@infravox.ai for a tailored demo and commercial discussion.

Start today

Free tier. No credit card. Live in 30 minutes.

Start Free
Sign up with your Google Workspace or work email. Starter tier is free — no credit card required.
Create Account
Book a Demo
See Infravox AI in action with a live walkthrough tailored to your infrastructure stack.
Talk to Sales
Tip
Already have an account? Sign in here → For enterprise access, SSO, or custom pricing contact shivamsingh@infravox.ai.
Join the Infravox Community on Slack
Ask questions, share feedback, get help from the team and other users. We're active daily.
#general#help#feature-requests#announcements
Open Slack
© 2026 Infravox AI
PrivacyTermsSecurityStatusContact