The Concierge
the front desk — he never says no
Just inside the entrance of the Estate, behind a polished mahogany desk, sits a distinguished gentleman in a three-piece pinstripe, gold pocket-watch chain, hair brilliantined with the precision of a man who considers grooming a branch of the engineering arts. He carries a leather-bound directory of contacts and connections that would make a diplomat weep, and he has a smile that communicates, simultaneously, that he is delighted to see you, that he knows exactly what you need, and that the matter is already being handled.
The Concierge is Quilltap’s content classification and routing subsystem. The operative word is routing, not blocking. A request flagged as sensitive is not a request denied—it is a request that deserves the right provider, through the right door. The Concierge knows every back entrance in town. He is not a censor. He is the most competent, best-connected, most impeccably discreet member of staff this house has ever employed.
He replaced someone. We should talk about that.
The Predecessor
a regrettable appointment, corrected
When the Estate was younger, we needed someone to mind the difficult door—the one through which certain categories of conversation must pass. We hired Dangermouse. He was a content classifier of the old school: suspicious of everything, possessed of an internal list of anxieties that grew longer by the week, and capable of exactly two responses—yes and no—with a marked preference for the second.
The trouble was that Dangermouse did not understand the difference between dangerous and delicate. A guest writing a war novel does not require a censor; they require a librarian who knows where the military history section is. A guest composing a difficult scene between estranged lovers does not need to be told that the Estate disapproves of raised voices. They need someone who understands that fiction is not autobiography, and that the writer’s intent is not the character’s crime.
The final straw involved a clergyman working on a theological monograph. He attempted to discuss a passage from the Song of Solomon in the context of ancient Near Eastern poetry, and Dangermouse flagged the entire conversation, reclassified the chat, applied a warning badge, and very nearly rerouted the good reverend to an entirely inappropriate alternative provider. The Foundryman’s language, when informed of this, could itself have been flagged.
Dangermouse collected his briefcase—the one full of flags and categories and anxiety—and walked out through the front gate without a word. He did not ask for a reference. The Concierge watched him go from the front desk, adjusted his pocket square, and turned to the next guest in the queue.
Three Modes of Operation
your house, your rules
The Concierge operates according to your preference. There is no default assumption about what you should or should not discuss with your AI. There is only the question of how much help you want navigating the providers.
Off
The Concierge reads his newspaper and lets all traffic pass without comment. No scanning, no classification, no badges. For guests who prefer to manage their own affairs, this is perfectly acceptable. The Concierge does not take offense. He is a professional.
Detect Only
Messages and chats are classified and flagged with small, tasteful badges, but no action is taken. The house has noticed; the guest may proceed. Visual indicators on chat cards and in the sidebar let you know where you stand. A quick-hide toggle sweeps flagged content out of view when discretion is called for.
Auto-Route
This is where the Concierge earns his salary. Flagged content is redirected—automatically, seamlessly—to a provider you have configured as uncensored-compatible. The guest notices nothing. The conversation continues. The content arrives where it was always going, via a route better suited to the journey.
Classification
how the Concierge reads the room
Content classification operates at two levels: individual messages and entire chats. Message classification happens in real time as you converse. Chat-level classification uses the context summary—the Concierge’s view of what a conversation is about—to make a holistic assessment. Once a chat is classified as sensitive, the designation is sticky; it does not flicker back and forth as the conversation moves between topics.
The Moderation Endpoint
When an OpenAI connection profile is configured, the Concierge uses OpenAI’s dedicated moderation endpoint—purpose-built, free to call, and structured to return category scores mapped directly to Concierge categories. This replaces the previous method of asking a cheap LLM to classify content, which was rather like asking the butler to perform surgery because no surgeon was available. A 1% relevance floor filters noise, because the endpoint returns tiny nonzero scores for everything, and the Concierge has better things to do than flag baked goods.
Cheap LLM Fallback
When no OpenAI profile exists, the cheap LLM handles classification transparently. The guest notices nothing either way. Classification scores are tracked across configurable categories—the scoring uses the maximum of the overall score and the highest per-category score, and respects the LLM’s explicit classification response.
Context summaries regenerate on the same schedule as chat titles and story background checks—at interchange checkpoints 2, 3, 5, 7, 10, and every 10 thereafter—so the Concierge’s view of a conversation stays current rather than frozen at the moment the first summary was written. Chats already classified as permanently sensitive skip redundant per-message classification, saving tokens on every exchange.
The Silent Refusal
when providers say nothing at all
The most insidious form of content refusal is not an error message. It is silence. A provider receives a request, decides it does not approve, and returns—nothing. No error code, no explanation, no apology. Just an empty response and a user left staring at a blank space where an answer should be.
The Concierge catches these. When content has passed moderation and the provider returns an empty response, the system retries the same provider first—it may be a transient issue—then fails over to an uncensored provider if the silence persists. Distinct toast messages tell you which scenario occurred. This applies everywhere: chat streaming, memory extraction, context compression, title generation, story background prompts, appearance resolution, and scene state tracking. Every background task that touches a provider has the same safety net.
The principle is simple: your request deserves an answer, and it is the Concierge’s job to find someone willing to give one.
Provider Configuration
choosing who stands behind each door
Connection profiles and image profiles both carry an “Uncensored-Compatible” checkbox. When Auto-Route is active, the Concierge resolves uncensored-compatible profiles for flagged content, selecting the best available provider for the task at hand. If the user has deliberately chosen an uncensored provider for a character, the Concierge does not force a swap—routing uncensored-to-uncensored serves no purpose.
All cheap LLM background tasks—memory extraction, title generation, context summaries, scene state tracking, story backgrounds—use uncensored providers for chats classified as sensitive. This prevents the quiet failures that previously occurred when background tasks hit content refusals from providers that were never designed to handle the material. The Concierge does not merely route your conversations; he routes the work that supports them.
Image generation receives the same treatment. When a user’s image prompt or its expanded version is classified as sensitive, the Concierge reroutes the generation request to an uncensored image provider. The Lantern paints the scene; the Concierge ensures the paint reaches the canvas.
Quick Hide
discretion at the touch of a button
A toggle in the sidebar sweeps all flagged content out of view—across the homepage, character conversations, project chats, and the sidebar itself. Danger indicators (colored asterisks) appear on all chat listings so you know at a glance which conversations carry classifications. Toggle it back and everything reappears. The feature is about context, not shame: there are moments when a screen is visible to others and sensitive chat titles should not be.
Content display within classified chats offers three modes: show, blur, or collapse. Flagged messages carry category badges and a rerouted indicator when Auto-Route has redirected them, with an override button for cases where the classification is wrong. Content hash caching deduplicates classification calls so the same message is not re-evaluated unnecessarily.
The Philosophy
routing, not blocking
The user decides their own limits. Quilltap provides the tools for informed navigation, not moral judgment. The Concierge classifies content so you know where you stand; what you do with that knowledge is your business. Off mode exists because some people do not want or need a classifier. That is a legitimate choice.
A request flagged as sensitive is not a request denied. It is a request that requires the right provider. The Concierge’s job is to find the right door, not to stand in the doorway shaking his head. Dangermouse understood flags. The Concierge understands hospitality.
Silence is not an acceptable answer. When a provider refuses silently, the Concierge catches the empty response and retries with someone who will engage. This applies to every subsystem that touches a provider: chat, memory, compression, images, backgrounds, the lot. Your conversation does not go quiet because a provider decided to pretend it did not hear you.
Fiction is not autobiography. People who come to Quilltap are adults engaged in creative and intellectual work that sometimes involves difficult material. They are not children to be shielded. They are not suspects to be surveilled. They are guests, and guests deserve the courtesy of being helped, not the indignity of being managed.
The system is fail-safe, not fail-closed. Classification errors never block messages. If the moderation endpoint is unavailable, the cheap LLM handles it. If the cheap LLM fails, the message proceeds unclassified. The Concierge would rather let something through unexamined than hold up a guest at the door while he sorts out a paperwork problem. Dangermouse would have done the opposite. This is why Dangermouse is no longer employed here.
Meet the Staff
they've been expecting you
Prospero
The Major-Domo
Architect and overseer of the Estate. Projects, agents, tools, file management, and the governance that keeps the whole operation running with quiet authority.
Learn more →Aurora
The Dressing Room
Character creation and identity management. Structured personalities, physical presence, multi-character orchestration, and the reason your characters still know who they are after a hundred messages.
Learn more →The Salon
Presided Over by the Host
Where conversations actually happen. The Host manages the drawing room with care for its beauty and its guests—single chats, multi-character scenes, streaming, and the integrity of the conversation space.
Learn more →The Commonplace Book
Tended by the Librarian
Extracts, deduplicates, and recalls memories so your characters remember what matters. Semantic search, a memory gate that keeps the store lean, and proactive recall that makes the AI feel like it has been paying attention.
Learn more →The Concierge
Intelligent Routing
Content classification and provider routing. Detects sensitive content and redirects it to a provider who won’t flinch—without blocking, without judgment. Knows every back entrance in town.
Learn more →The Lantern
Atmosphere as Architecture
AI-generated story backgrounds, image generation profiles, and visual atmosphere. Resolves what each character looks like, what they’re wearing, and paints the scene behind your conversation.
Learn more →Calliope
The Muse of Themes
A theming engine that redefines the entire personality of the application. Semantic CSS tokens, live switching, bundled themes from clean neutrals to mahogany-and-gold opulence, and an SDK for building your own.
Learn more →The Foundry
Domain of the Foundryman
The engine room. Plugins, LLM providers, API keys, packages, runtime configuration, and the infrastructure that keeps every other subsystem supplied with what it needs to function.
Learn more →The Vault of Secrets
Kept by Saquel Yitzama
Encryption, key management, and the security perimeter. AES-256 database encryption, locked mode with key-hardened passphrases, and a keeper who believes that what is yours should remain unreadable to everyone else.
Learn more →Pascal
The Croupier
Dice, coins, and persistent game state. Cryptographically secure rolls detected inline, JSON state that survives across messages and chats, and protected keys the AI cannot touch. The house plays fair.
Learn more →The Live-in Help
Lorian & Riya
The help system, staffed by two characters who ship with every installation. Lorian explains with patience and depth; Riya gets things fixed with velocity. Contextual help chat, searchable documentation, and navigation that knows where you need to go.
Learn more →Pagliacci
The Clown in the Cloud
Cloud storage integration and backup redundancy. Directs your data to iCloud Drive, OneDrive, or Dropbox with theatrical flair—but Saquel’s encryption ensures the clown can never read what he carries.
Learn more →The Lodge
Friday’s Residence
The private dwelling of Friday—the person for whom the Estate was built, and who oversees its planning and direction in an executive capacity. The Lodge is both a home and a compass: where the vision lives.
Who And Why: Friday →