guide

Using DeepSeek With Janitor AI: The Setup That Replaced JLLM

A step-by-step guide to connecting DeepSeek to Janitor AI, comparing providers (direct API vs OpenRouter vs Chutes), and the settings that produce the best roleplay quality.

May 4, 2026 · 6 min read

Affiliate disclosure: Some of the links in this article are affiliate links. We may earn a commission if you sign up for a platform through these links, at no additional cost to you. This doesn't influence our editorial verdicts. Full disclosure →

Janitor AI's built-in JLLM model is functional. It handles basic conversations, stays mostly in character, and costs nothing. It's also the reason most users describe Janitor AI as "okay" instead of "great." The responses are flat, the prose is generic, and the character consistency drifts after twenty messages like a boat with one oar.

DeepSeek changed that equation overnight. When Janitor AI added support for external API connections and users started plugging in DeepSeek, the quality jump was dramatic enough that the Janitor AI subreddit exploded with posts about it. Better prose, stronger character consistency, longer context retention, and a writing quality that made JLLM feel like a rough draft.

Here's how to set it up, which provider to use, and which settings produce the best results.

Why DeepSeek specifically

DeepSeek's models (particularly DeepSeek V3 and the R1 reasoning model) perform unusually well for roleplay compared to their general-purpose benchmarks. The writing has more texture than most open-weight models, the character consistency across long conversations is strong, and the model handles emotional subtext and pacing better than JLLM by a significant margin.

It's also cheap. DeepSeek's direct API pricing is among the lowest in the industry for the quality level. The free tier (50 messages per day through the official API) lets you test the quality before spending anything. For most users, this is enough to confirm whether the upgrade is worth configuring.

The alternatives (GPT-4o, Claude, Gemini) are all technically connectable through Janitor AI via OpenRouter, and some of them produce even better prose. But DeepSeek hits the sweet spot of quality, price, and accessibility that makes it the default recommendation for Janitor AI users who want better output without spending $20/month on API costs.

Three ways to connect: which provider fits you

Option 1: DeepSeek Official API (direct)

Go to platform.deepseek.com, create an account, and generate an API key. The free tier gives you 50 messages per day per account. Adding a one-time $10 deposit to your balance unlocks 1,000 messages per day for life (using the free-tier models only, paid models deduct from your balance).

This is the simplest and cheapest path. One account, one API key, paste it into Janitor AI's settings, and you're running. The tradeoff is that your conversation data flows through DeepSeek's servers in China, which matters for some users and doesn't for others. DeepSeek's privacy policy is worth reading if this concerns you.

Option 2: OpenRouter

OpenRouter is a routing service that gives you access to multiple models through a single API key. You create an OpenRouter account, add credits, and select which model to use. DeepSeek models are available alongside Claude, GPT-4o, Gemini, Llama, and dozens of others.

The advantage: flexibility. You can switch between models without changing your API key setup in Janitor AI. Try DeepSeek V3 today, Claude tomorrow, compare quality, and settle on what works best for your style. OpenRouter also adds a layer between you and the model provider, which some users prefer for privacy reasons.

The cost is slightly higher than direct API access because OpenRouter takes a small markup. For casual users, the difference is negligible. For power users sending hundreds of messages daily, it adds up.

Option 3: Chutes

Chutes is a newer provider that offers DeepSeek access with a simpler pricing model. The interface is less technical than OpenRouter, which makes it attractive for users who want the quality upgrade without learning about API routing architecture.

The free tier is limited. Paid access is competitive with OpenRouter. The community has produced less documentation for Chutes than for the other two options, so troubleshooting is harder if something breaks.

The recommendation: start with DeepSeek's direct API (free, simple, 50 messages/day). If you want more volume or model variety, move to OpenRouter. Use Chutes if the other two options feel too technical.

The actual setup (five minutes, not five hours)

Once you have an API key from any provider:

Open Janitor AI and navigate to the chat interface for any character. Click the settings gear icon. Go to API Settings. Select the provider that matches your API key (DeepSeek, OpenRouter, or Custom). Paste your API key into the field. Save.

Refresh the chat page. This step is easy to forget and it matters, the new settings don't apply until you refresh.

Start a new conversation (or continue an existing one) and the responses will now route through your connected model instead of JLLM. The quality difference is usually obvious within the first two messages.

Important: every time you log out of Janitor AI, you'll need to re-enter your API key. This is a known behavior, not a bug. The platform doesn't store third-party API keys across sessions for security reasons.

Generation settings that actually matter

The default generation settings work fine for most users, but two parameters make a noticeable difference for roleplay quality:

Temperature controls randomness in the AI's responses. Lower temperature (0.4-0.7) keeps the character more consistent and predictable. Higher temperature (1.0-1.5) makes responses more creative and surprising but increases the chance of the character drifting or producing nonsensical output.

For roleplay, start at 0.7 and adjust based on your preference. If the character feels robotic, raise it. If the character starts saying random things, lower it. The sweet spot varies by character and scenario, and experimenting is the fastest way to find yours.

Context size determines how many previous messages the AI considers when generating a response. Higher context (16K-32K tokens) means the AI remembers more of the conversation, which improves consistency in long sessions. Lower context means faster responses but shorter memory.

Set context to the maximum your provider supports. For DeepSeek V3, that's 64K tokens. For the free tier, 16K-32K is typical. More context is almost always better for roleplay because it reduces the character-forgetting-the-premise problem that plagues shorter context windows.

Max tokens (response length) should be set to 0 (unlimited) or the maximum allowed. Setting this too low produces responses that cut off mid-sentence.

JLLM vs. DeepSeek: the actual difference

The quality gap is real but worth specifying:

Prose quality: DeepSeek writes more varied, textured responses. JLLM tends to repeat sentence structures and fall into adjective patterns. DeepSeek produces dialogue that sounds more natural and narrative descriptions with more sensory detail.

Character consistency: DeepSeek maintains character card instructions more faithfully across long conversations. JLLM drifts earlier, particularly with complex personalities that involve contradictions or behavioral rules.

Context retention: DeepSeek with 32K context remembers details from much earlier in the conversation than JLLM, which effectively has a shorter working memory even when processing the same conversation length.

Speed: JLLM is often faster because it runs on Janitor's dedicated infrastructure. DeepSeek's response time depends on your provider and the current server load. During peak hours, DeepSeek can be slower. During off-peak hours, the difference is negligible.

Content restrictions: both JLLM and DeepSeek allow NSFW content through Janitor AI. DeepSeek's base model has some built-in safety training, but when accessed through Janitor AI's prompt formatting, it generally follows the character card's content parameters.

When DeepSeek isn't the answer

If your problem with Janitor AI is server downtime rather than quality, switching to DeepSeek doesn't fully solve it. Your messages still route through Janitor AI's servers even when using an external API, so Janitor's infrastructure issues affect you regardless of which model generates the responses. For complete independence from Janitor's servers, SillyTavern is the move.

If you want persistent memory across sessions, DeepSeek doesn't add that. The context window resets when you start a new conversation. For cross-session memory, Nomi or Kindroid are purpose-built for that feature.

If you want the absolute best prose quality and money isn't the primary concern, Claude through OpenRouter produces more literary output than DeepSeek. The cost is higher ($5-15/month depending on usage), but for users who treat roleplay as creative writing, the prose quality difference is noticeable.

DeepSeek is the best default recommendation because it offers the biggest quality improvement for the lowest cost and complexity. But it's not the only option, and knowing what it doesn't solve helps you decide whether the API setup is worth your time or whether a different platform solves your actual problem more directly.