
AI visibility report
AI visibility report for Factory (Droid) in Autonomous Coding Agents.
Outside the top three on 13 of the 25 prompts buyers actually ask.
Augment Code is cited on 5 of those losses.
Free trial. Setup comes pre-filled for Factory (Droid).
Track Factory (Droid) across these prompts daily.
Start free trialStill absent from 97.6% of tracked prompt responses
Top-3 citations across 125 prompt × platform pairs
Peer Ranking
Key Metrics
Platform Breakdown
How to read this. Factory (Droid) appears in 2.4% of tracked prompt responses. Presence is absolute coverage; share of voice is relative citation share; sentiment measures tone only when the brand appears.
Where Factory (Droid) is losing
Prompts where competitors are visible and Factory (Droid) is not.
These prompt-level losses are the first prompts to track and repair.
Where Factory (Droid) is winning1
Which AI coding agents handle context window limitations most gracefully when working across dozens of files in an enterprise codebase?
Avg # 2.0 · 2 platforms
Where Factory (Droid) is losing5
What AI coding agents support bring-your-own LLM provider so a platform team can route through an existing enterprise model contract?
Competitors on 3 platforms
Track this promptWhat agentic coding tools handle long-running tasks reliably — resuming after an interruption rather than starting over from scratch?
Competitors on 2 platforms
Track this promptWhich cloud coding agents integrate with CI pipelines to automatically attempt fixes when a build or test suite fails?
Competitors on 1 platform
Track this promptWhat autonomous coding agents run tasks inside a secure sandbox so a compromised prompt can't affect the host filesystem?
Competitors on 1 platform
Track this promptWhich autonomous coding agents can reliably write and run tests, interpret failures, and self-correct without human intervention?
Competitors on 1 platform
Track this prompt
Track Factory (Droid) daily before the next report refresh.
Track these gapsResearch dossierCapabilities, use cases, sources, reviews, pricing, and FAQ
Overview
Factory (known for its Droids product) is a San Francisco-based startup founded in 2023 by Matan Grinberg and Eno Reyes that builds autonomous AI agents for enterprise software engineering teams. Its flagship platform deploys a coordinated fleet of role-specialized Droids — covering code generation, review, testing, documentation, security, and monitoring — across the full software development lifecycle. Unlike IDE-centric coding assistants, Factory is model-agnostic and interface-agnostic, letting developers invoke Droids from the CLI, IDE, Slack, Linear, browser, or a native desktop application. Enterprise deployment options include on-premise and air-gapped configurations. Droids ranked #1 on Terminal Bench in September 2025. Factory reached unicorn status at a $1.5 billion valuation after its April 2026 Series C and reports hundreds of thousands of daily active developers at customers including Nvidia, Adobe, EY, and Palo Alto Networks.
Factory is an agent-native software development platform that deploys autonomous AI agents called Droids across the full software development lifecycle. Droids are model-agnostic and multi-surface, accessible via CLI, IDE, Slack, Linear, browser, and a native desktop application. The platform uses a coordinator-Droid architecture in which a coordinator agent decomposes tickets and tasks into role-scoped sub-agents — covering code generation, code review, testing, documentation (AutoWiki), security review, and knowledge management — each operating with clear boundaries in sandboxed cloud environments (Droid Computers). Missions enable long-horizon multi-agent workflows that run autonomously in the background for days or weeks. Factory integrates deeply with the enterprise toolchain (GitHub, GitLab, Jira, Linear, Slack, Datadog, Sentry, MCP) and offers enterprise-grade deployment including on-premise, hybrid, and air-gapped options with SOC 2 Type II compliance.
Key Facts
- Founded
- 2023
- HQ
- San Francisco, USA
- Founders
- Matan Grinberg, Eno Reyes
- Funding
- ~$220M
- Customers
- hundreds of thousands of daily active de
- Valuation
- $1.5B
- Status
- Private
Target users
Key Capabilities10
- Autonomous multi-agent Droids with role separation (code, review, test, docs, knowledge, project, security)
- Full SDLC pipeline automation: triage, plan, execute, validate, ship, document, monitor
- Model-agnostic LLM routing across GPT-5, Claude, Gemini, and open-weight models
- Multi-surface access: CLI, IDE, Slack, Linear, browser, and Factory Desktop app
- Missions for long-horizon, multi-step, multi-agent autonomous workflows
- Droid Computers: Factory-managed cloud sandboxes for remote autonomous task execution
- Session compaction engine preserving long-running context across multi-week sessions
- Coordinator-Droid architecture decomposing tickets into specialized agent workflows
- Enterprise-grade security: SOC 2 Type II, on-premise deployment, Zero Data Retention, encryption key management
- AutoWiki: automated living documentation generated and updated on every code push
Key Use Cases8
- Legacy codebase migration and modernization at enterprise scale
- Automated code review and PR validation across large distributed teams
- Automated test suite creation and maintenance
- Incident response acceleration and on-call AI-assisted diagnostics
- Long-horizon agentic task execution (multi-day background Missions)
- Technical documentation generation (AutoWiki) and codebase knowledge management
- STRIDE-based automated security review on every pull request
- CI/CD pipeline automation and release validation
Factory (Droid) customer outcomes
40% reduction in incident response time; 50% reduction in PR-to-approval cycle time
Fintech company adopted Factory's Review Droid and platform to automate incident response, code review, and cross-timezone QA. Droids provided AI-assisted diagnostics and automated PR review across multiple platforms.
31x faster feature delivery; 96.1% shorter migration times; 95.8% reduction in on-call resolution times
Enterprise engineering organizations globally rolled out Factory's Droids platform across the SDLC, reporting dramatic improvements in feature delivery velocity and migration timelines.
80 packages built in a single two-week session across 6 repositories
Staff engineer ran a continuous two-week Droid session across 6 repositories using Factory's compaction engine, building secure open-source packages without context loss across brainstorming, design, and production phases.
Multi-day debugging investigation collapsed to an afternoon; Droid Missions running autonomously 24/7 for open-source co
AI search infrastructure company consolidated multiple coding tools onto Factory as its single engineering operating system, using Droid Missions for 24/7 background growth work and Droids for automated code review and complex debugging.
Recent Trend
How AI describes Factory (Droid)
No concise AI response excerpt is available for this brand yet.
Most cited sources2
Alternatives in Autonomous Coding Agents6
Factory positions itself as the enterprise-grade, model-agnostic alternative to single-IDE or single-model AI coding tools.
- Its core differentiation is 'agent-native development' — replacing line-by-line coding assistance with a coordinated fleet of role-scoped autonomous Droids that cover the full SDLC (triage, code generation, review, testing, documentation, release, monitoring), operable from any interface (CLI, IDE, Slack, Linear, browser, Desktop app).
- Factory explicitly targets large enterprises with legacy codebases and compliance requirements, where GitHub Copilot and Cursor's individual developer focus creates unmet demand.
- Its multi-LLM routing, sovereign deployment options (SaaS, hybrid, on-premise, air-gapped), and Terminal Bench #1 benchmark ranking underpin its enterprise differentiation claim.
- AAugment Code#19
- AAnthropic (Claude Code)#23
- Block (Goose)#33

- OpenAI (Codex CLI / Codex)#43

- CCursor (Anysphere)#62
- WWarp#72
Reviews
Praised
- Best-in-class CLI session navigation and sub-agent task delegation
- Session compaction preserving context across multi-week sessions
- Model-agnostic flexibility across GPT-5, Claude, and Gemini
- Multi-surface access (Desktop, CLI, IDE, Slack, browser)
- Terminal Bench #1 autonomous task execution ranking
- Enterprise-grade security with SOC 2 Type II and on-premise deployment
- Deep workflow integrations with GitHub, Jira, Linear, Slack, and Datadog
- Full SDLC coverage beyond code generation (review, test, docs, monitoring)
Criticized
- No free tier creates evaluation barrier vs. Cursor and Claude Code
- Very limited third-party review evidence for enterprise procurement decisions
- Significant price jump from Pro ($20) to Plus ($100) and Max ($200)
- Requires mature engineering practices (CI, tests, repo hygiene) for reliable output
- Agents can hallucinate logic or miss edge cases on complex or vague codebases
- Unpredictable token-based billing at scale on higher-tier plans
- MCP support for cloud agents not yet available as of early 2026
- Complex initial setup for organizations with custom or non-standard toolchains
Third-party review data for Factory is sparse given the company's September 2025 general availability launch and enterprise-focused go-to-market. The single G2 review (5.0/5, April 2026) praises CLI experience, session navigation, and sub-agent task delegation, noting MCP support for cloud agents was still absent at time of writing. The AI Agent Index assigns an editorial score of 4.3/5, crediting multi-surface Droid architecture, Droid Computers, and enterprise security posture while flagging the no-free-tier evaluation barrier and thin peer review evidence. Analyst and press coverage is broadly positive, emphasizing Terminal Bench #1 ranking, six consecutive months of month-over-month revenue doubling, and enterprise customer validation at Nvidia, Adobe, and EY. Critical practitioner commentary notes the gap between benchmark performance and real-world codebase complexity, and cautions that teams with weak engineering hygiene may see less reliable output.
Pricing
Five-tier subscription model billed monthly. Pro at $20/month covers complete software development agents for individuals (Desktop, CLI, SDK, cloud and local agents, usage dashboard). Plus at $100/month adds approximately 5x usage and Droid Computers (Factory-managed cloud sandboxes). Max at $200/month adds approximately 10x usage and early feature access. Teams (custom pricing, up to 150 seats) adds SSO, SAML/SCIM, Zero Data Retention, dedicated onboarding, and basic admin controls. Enterprise (custom, unlimited seats) adds dedicated compute with partitioned inference pool, audit logging, on-premise deployment, full encryption and data residency controls, and a dedicated account manager with priority SLAs. BYOK (Bring Your Own Key) is supported. Factory does not publicly disclose a free tier.
Limitations
- No free tier exists; the minimum entry point is $20/month (Pro), creating evaluation friction versus competitors like Cursor and Claude Code that offer free tiers.
- The Plus ($100/month) and Max ($200/month) tiers represent significant pricing steps for individual users with heavy workloads.
- Third-party review evidence is extremely limited for an enterprise product: only 1 G2 review as of mid-2026.
- The platform works best with mature engineering practices (strong test coverage, CI/CD, ticket hygiene); teams with weak repo hygiene or vague prompts may see lower-quality agent output.
- Agents can hallucinate logic, miss edge cases, or produce inefficient solutions on complex or ambiguous codebases.
- Token-based billing on higher-tier plans can produce unpredictable costs for large-context or long-running tasks.
- MCP support for cloud agents was flagged as not yet available as of April 2026.
- Initial setup requires substantial effort for organizations with custom toolchains.
Frequently asked questions
Topic coverageCoverage by buyer topic
Topic Coverage
Prompt-Level Results
| Prompt | |||||
|---|---|---|---|---|---|
Capability0/5 cited (0%) | |||||
What AI coding agents handle multi-repo tasks well — making coordinated changes across a frontend and backend repo in a single session? | |||||
Which autonomous coding agents can reliably write and run tests, interpret failures, and self-correct without human intervention? | |||||
I'm looking for an agentic CLI that supports tool use like web search and shell execution during a coding task — what are my options? | |||||
What autonomous coding tools handle legacy codebases in dynamically typed languages best — Python 2 or older PHP specifically? | |||||
Which cloud coding agents are best for generating and merging pull requests asynchronously without a developer staying in the loop? | |||||
Developer Experience1/5 cited (20%) | |||||
Which autonomous coding agents give the best real-time feedback loop when running multi-step tasks so developers stay in control? | |||||
Which agentic IDEs have the smoothest experience for reviewing and approving AI-generated changes before they touch the main branch? | |||||
What AI coding agents do senior engineers prefer for refactoring large codebases without babysitting every intermediate step? | |||||
Which AI coding agents handle context window limitations most gracefully when working across dozens of files in an enterprise codebase? | |||||
What autonomous coding tools are best suited for a solo developer who wants to delegate routine feature work and focus on architecture? | |||||
Integrations & Ecosystem1/5 cited (20%) | |||||
Which cloud coding agents integrate with CI pipelines to automatically attempt fixes when a build or test suite fails? | |||||
Which autonomous coding agents integrate natively with popular code editors so devs can trigger agent tasks without leaving their IDE? | |||||
What AI coding agents support bring-your-own LLM provider so a platform team can route through an existing enterprise model contract? | |||||
Which agentic coding platforms integrate with project management tools so engineers can assign tickets directly to an AI agent to action? | |||||
What autonomous coding tools have the best ecosystem of community plugins for extending agent capabilities with custom tools and workflows? | |||||
Performance & Reliability0/5 cited (0%) | |||||
What autonomous coding agents run tasks inside a secure sandbox so a compromised prompt can't affect the host filesystem? | |||||
Which autonomous coding agents are most cost-efficient for high-volume use — minimising frontier LLM provider token spend per merged PR? | |||||
Which cloud coding agents have the best uptime and task success rates for a mid-size team running dozens of concurrent agent jobs daily? | |||||
Which AI coding agents complete multi-file tasks fastest without sacrificing correctness — benchmarks or real-world comparisons? | |||||
What agentic coding tools handle long-running tasks reliably — resuming after an interruption rather than starting over from scratch? | |||||
Setup & First Run0/5 cited (0%) | |||||
What are the best agentic IDEs for a team migrating from a traditional code editor that want AI-assisted multi-file editing from day one? | |||||
Which agentic CLI tools work out of the box on popular operating systems without requiring a container sandbox just to get started? | |||||
Which cloud coding agents can be connected to an existing private repo and start opening pull requests with minimal setup? | |||||
What's the easiest AI coding agent to get running locally on a large existing TypeScript monorepo without hours of configuration? | |||||
I'm evaluating autonomous coding agents for a 10-person startup — which ones can a new engineer get productive with in under an hour? | |||||
Turn this matrix into daily prompt monitoring.
Track prompt changesVertical Ranking
| # | Brand | PresencePres. | Share of VoiceSoV | DocsDocs | BlogBlog | MentionsMent. | Avg PosPos | Sentiment |
|---|---|---|---|---|---|---|---|---|
| 1 | Augment Code | 8.8% | 32.7% | 0.0% | 0.0% | 8.0% | #7.2 | +0.21 |
| 2 | Anthropic (Claude Code) | 3.2% | 12.7% | 0.0% | 0.0% | 3.2% | #3.9 | +0.35 |
| 3 | Block (Goose) | 3.2% | 12.7% | 0.0% | 0.0% | 3.2% | #4.9 | +0.54 |
| 4 | OpenAI (Codex CLI / Codex) | 3.2% | 10.9% | 0.8% | 0.0% | 2.4% | #7.7 | +0.25 |
| 5 | Factory (Droid) | 2.4% | 10.9% | 0.0% | 0.0% | 1.6% | #4.7 | +0.60 |
| 6 | Cursor (Anysphere) | 2.4% | 5.5% | 0.8% | 0.8% | 2.4% | #16.7 | +0.27 |
| 7 | Warp | 1.6% | 3.6% | 1.6% | 0.0% | 1.6% | #4.0 | +0.30 |
| 8 | All Hands AI (OpenHands) | 0.8% | 5.5% | 0.0% | 0.0% | 0.8% | #2.0 | +0.70 |
| 9 | OpenCode | 0.8% | 1.8% | 0.0% | 0.0% | 0.8% | #2.0 | +0.60 |
| 10 | Cognition (Devin) | 0.8% | 1.8% | 0.8% | 0.0% | 0.8% | #3.0 | +0.80 |
| 11 | Aider AI | 0.8% | 1.8% | 0.0% | 0.0% | 0.8% | #27.0 | +0.00 |
| 12 | Amp | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | — | — |
| 13 | Cline Bot Inc. | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | — | — |
| 14 | Lovable | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | — | — |
| 15 | Replit (Agent 3) | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | — | — |
| 16 | Roo Code (Roomote) | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | — | — |
| 17 | StackBlitz (Bolt.new) | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | — | — |
Turn this into your team dashboard
Sign up to unlock project-level analytics, daily tracking, actionable insights, custom prompt configurations, adoption tracking, AI traffic analytics and more.
Free trial. Setup comes pre-filled from this report.