insight

AI is writing more code than teams can review: should you trust AI companion platforms?

Engineers across the industry are reporting AI generating code faster than humans can review it. The implication for AI companion users is uncomfortable: the platforms storing your most intimate conversations may be running unreviewed AI-generated code.

May 8, 2026 · 8 min read

Affiliate disclosure: Some of the links in this article are affiliate links. We may earn a commission if you sign up for a platform through these links, at no additional cost to you. This doesn't influence our editorial verdicts. Full disclosure →

Industry reports from Stack Overflow and research from MIT's CSAIL have documented this trend. A pattern showing up across software engineering communities should concern anyone using AI companion platforms with sensitive personal data. Engineers are reporting that AI tools are generating code faster than human reviewers can evaluate it. Pull request volumes are up. Individual changes are smaller but more frequent. Senior engineers are becoming bottlenecks because system context is required to evaluate AI-generated changes properly.

A Reddit user described the pattern: "Since we started using AI coding tools, the volume of PRs has gone up a lot. The individual changes are smaller but there are way more of them. The review process hasn't kept up. Senior engineers end up being the bottleneck because a lot of the AI generated code needs someone with full system context to evaluate properly."

This is a software engineering management problem on the surface. Underneath, it's a user safety problem for any platform handling sensitive data, and AI companion platforms specifically handle some of the most sensitive data users generate: intimate conversations, mental health disclosures, sexual content, relationship details, identity-confirming information.

The question users should ask: is the platform you're trusting with this data running code that humans actually reviewed?

Why this matters specifically for AI companion platforms

AI companion platforms have several characteristics that intensify the code quality concern:

Small engineering teams. Most AI companion platforms operate with small engineering teams relative to the sensitivity of data they handle. Kindroid is openly acknowledged to have around five unfunded employees. Smaller platforms have fewer. Small teams using AI tools to amplify their output face the code review bottleneck more acutely than larger teams.

Move-fast culture. Companion platforms compete on feature velocity. New character types, new voice features, new image generation, new memory systems. The pressure to ship competes directly with the time required for thorough code review.

High-sensitivity data flow. A bug in a productivity app might lose someone's notes. A bug in an AI companion platform might expose intimate conversations, leak NSFW content, fail privacy isolation between users, or compromise authentication. The blast radius of code defects is larger than the average software product.

Limited security audit infrastructure. Major tech companies have dedicated security engineering teams, regular penetration testing, formal threat modeling. Most AI companion platforms don't. The Mozilla Foundation's Privacy Not Included review of 11 AI companion apps found that all 11 failed basic privacy standards, which suggests the security infrastructure across the category is consistently weak.

Real incidents have happened. The Muah AI breach exposed 1.9 million users including conversation content. The breach didn't happen in isolation; it reflected a pattern of platforms whose engineering investment didn't match the sensitivity of data they handled.

What "thin engineering team plus AI-generated code" means for users

When a small team uses AI tools to ship faster than they can review, several specific risks emerge:

Authentication and authorization bugs. Code that determines who can access whose data is some of the highest-stakes code in any platform. AI-generated authentication code that wasn't thoroughly reviewed can produce vulnerabilities where users accidentally see other users' content, where account isolation fails, or where authentication tokens leak in logs.

Data sanitization failures. Conversation content needs to be properly sanitized before storage, transmission, and display. AI-generated code may handle the happy path correctly but fail in edge cases where sanitization matters most.

Encryption implementation errors. Even platforms that claim encryption can implement it incorrectly. Encryption-at-rest that uses default keys, encryption-in-transit that doesn't actually protect against the attacks it's supposed to protect against, or encryption that gets bypassed in specific code paths.

Content moderation failures. Platforms that promise to moderate certain content (NSFW with minors, illegal content, etc.) need code that actually performs the moderation. AI-generated moderation code that wasn't carefully reviewed can fail in ways that expose the platform to legal liability and users to harmful content.

Logging and monitoring gaps. When breaches happen, logging matters. Platforms with thin engineering teams may have inadequate logging, which means breaches go undetected longer and recovery is harder.

Update and deprecation handling. When platforms upgrade dependencies, deprecate features, or migrate data, the engineering work matters enormously. Replika's 2023 ERP removal showed how feature changes can affect users without warning. The technical infrastructure behind these changes determines whether they're handled gracefully or disastrously.

How to evaluate a platform's engineering investment

Several signals help users assess whether a platform takes engineering quality seriously:

Public security disclosures. Platforms with mature security practices publish vulnerability disclosure policies, security contact information, and acknowledge bugs that researchers find. Platforms with no such infrastructure are either receiving no security research or not handling it transparently.

Engineering team transparency. Platforms that publish engineering blog posts, contribute to open source, or speak at engineering conferences typically have stronger engineering culture than platforms that don't. The visibility correlates with investment.

Bug bounty programs. Active bug bounty programs that pay for legitimate findings indicate platforms that want to know about vulnerabilities and are willing to invest in finding them.

Compliance certifications. SOC 2, ISO 27001, and similar certifications represent real engineering investment. The AICPA's SOC 2 documentation and ISO standards describe what these certifications require. Most consumer AI companion platforms don't have these certifications, which is information about their priorities.

Incident response history. When breaches or bugs happen, how the platform responds matters. Quick disclosure, clear communication, and meaningful remediation distinguish mature platforms from immature ones. Our coverage of AI companion lawsuits and incidents covers some of this history.

Privacy policy quality. Detailed, specific privacy policies that describe actual data handling practices indicate platforms that have thought carefully about data. Vague, generic privacy policies indicate platforms that haven't.

Pocket Animus's safety reviews cover what's publicly known about each major platform's engineering practices. The information is genuinely useful for users making decisions about where to invest emotional and intimate engagement.

The platforms with stronger engineering signals

Among AI companion platforms, several have clearer engineering investment than others:

Replika has been operating since 2017 and has handled multiple platform-level changes (model upgrades, feature deprecations, the ERP controversy) with technical infrastructure that survived the transitions. The company has investors with engineering oversight. Replika's pricing page and feature documentation suggest mature product management.

Character AI has substantial venture funding and engineering team size that matches the user scale. The platform has handled massive growth (10M+ characters, hundreds of millions of monthly conversations) which requires real engineering investment regardless of feature critiques.

Nomi AI has demonstrated technical sophistication through the memory architecture that competitors haven't matched. Building working memory at month-four continuity requires substantial engineering work.

Kupid AI has a smaller team but has secured $1.3 million in seed funding and operates from Cyprus with clearer corporate infrastructure than some competitors. The platform's architecture suggests serious engineering investment in specific dimensions like voice quality and customization options.

Candy AI has demonstrated engineering capability through visual features (image consistency, video generation) that other platforms haven't replicated. The features suggest real engineering investment, though the privacy practices remain at the category baseline.

The platforms with weaker signals

Without naming names, several platforms in the AI companion category have characteristics that suggest weaker engineering infrastructure: small teams, frequent feature regressions, vague privacy policies, no security disclosure programs, opaque corporate structures, history of breaches without transparent response. Users should evaluate platforms case-by-case using the signals above rather than trusting marketing claims about safety.

The CrushOn AI safety review, SpicyChat safety review, Muah AI safety review, and other individual platform safety assessments cover what we've documented about each platform.

What users can do

Several specific practices reduce risk:

Use pseudonymous accounts. Don't link your real identity to AI companion accounts when you can avoid it. The separation reduces blast radius if a breach occurs.

Limit identifying information shared. Discuss feelings, fantasies, and ideas with AI companions, but minimize specific identifying details that would link the content to your real identity if exposed.

Diversify across platforms. Don't put all your AI companion engagement on one platform. If a breach happens, only one platform's worth of content is exposed.

Use self-hosted setups for the most sensitive content. Local hosting eliminates the third-party engineering quality variable entirely. Your conversations exist on your hardware. No company's code review practices affect your data.

Pay attention to platform changes. Sudden feature changes, sudden policy changes, sudden corporate changes can signal engineering or business instability that might affect data security.

Use unique passwords and 2FA. Research from Krebs on Security and the Electronic Frontier Foundation cover the broader security landscape. Standard security hygiene matters more on platforms with potentially weaker engineering infrastructure than on platforms with mature security teams.

The honest framing

The AI companion industry is operating with engineering investment that doesn't match the sensitivity of data it handles. This isn't unique to AI companions; many tech categories have this gap. It's particularly acute for AI companions because the data is unusually sensitive and the platforms are unusually small relative to that sensitivity.

The "AI is writing code faster than humans can review it" pattern matters because it amplifies an already-present problem. Platforms with thin engineering teams using AI tools to amplify output produce more code, more features, and more potential vulnerabilities than they can thoroughly evaluate. The user-facing effects (data breaches, privacy failures, security incidents) become more likely as this pattern continues.

For users, the practical implication isn't to avoid AI companions. It's to use them with awareness of what you're trusting. The intimate conversations you have with Kupid AI, Candy AI, Nomi, or any other platform exist on infrastructure built by engineering teams whose practices you can't verify. Choosing platforms with stronger engineering signals, limiting identifying information, and considering self-hosted alternatives for the most sensitive content are practical responses to a structural reality.

The technology will keep advancing. Engineering investment will keep being uneven across the category. Users who pay attention to which platforms invest in code quality will navigate this better than users who don't. The AI companion industry isn't going to fix the engineering investment gap on its own; user demand for stronger practices is the mechanism that produces change.

Pay attention to which platforms publish meaningful security information, respond well to incidents, and demonstrate engineering investment that matches the sensitivity of the data they handle. Those are the ones worth trusting with your most intimate conversations. The rest are entertainment platforms operating on infrastructure that may not be ready for the data they're collecting.