Menu
โ† Back to Blog
Chat interface showing a conversation with ChatGPT attempting to run a D&D session
ยทStoryRoll Team

ChatGPT as a Dungeon Master: Does It Actually Work?

ai-dungeon-masterchatgptreview

We've all had the same thought. You're staring at ChatGPT, it just nailed a Shakespearean sonnet on the first try, and the idea hits: could this thing run a D&D campaign?

Short answer: kind of. Longer answer: it depends on what you mean by "run" and how much patience you have for a DM who forgets your character's name every forty minutes.

We spent the better part of two months using ChatGPT, Claude, and Gemini as AI dungeon masters across solo sessions, one-shots, and attempted multi-session campaigns. This is what we found.

Setting Up ChatGPT as a Dungeon Master

The quality of your ChatGPT D&D experience lives and dies by your opening prompt. A vague "be my DM" gets you generic fantasy mush. A structured prompt gets you something legitimately playable - at least for the first hour.

Here's a prompt that actually works:

You are an experienced D&D 5e Dungeon Master running a dark fantasy campaign set in a cursed kingdom called Valdris. You follow 5e rules accurately, including ability checks, saving throws, combat mechanics, and spell slots. When I attempt an action that requires a roll, tell me what to roll and the DC. Track my HP, spell slots, and conditions. My character is Kael, a level 5 half-elf warlock (Pact of the Tome, Fiend patron) with 38 HP, AC 14, and the following spells: [list]. Start with a scene - don't ask me what I want to do before giving me something to react to.

What makes this prompt work:

  • Specifying the rule system and edition. Without this, ChatGPT invents mechanics that sound plausible but don't exist in any published rulebook.
  • Telling it to track resources. It won't do this reliably (more on that), but asking establishes the expectation.
  • "Start with a scene." Left to its own devices, ChatGPT opens with "What kind of adventure would you like?" which kills momentum dead.
  • Including your actual character sheet data. The more concrete detail you front-load, the less it hallucinates later.

Claude and Gemini respond to similar prompts, though Claude tends to be more narratively creative out of the box and Gemini more literal. All three share the same fundamental limitations.

What Actually Works

Credit where it's due - a ChatGPT dungeon master can impress you in a few areas.

Atmospheric descriptions are the standout. Ask ChatGPT to describe a ruined temple at dusk and you'll get something evocative, detailed, and usable. It nails sensory details - the smell of wet stone, torchlight catching on broken mosaics, the distant sound of something breathing in the dark. For solo players who want to feel immersed in a world, this is the main draw.

NPC dialogue ranges from serviceable to good. Give ChatGPT a character with clear motivations and personality traits and it'll maintain that voice for a full conversation. We had a paranoid halfling innkeeper who kept nervously cleaning the same glass, and a dying knight who delivered a final speech that actually landed. These moments feel like real D&D.

Improvisation is where large language models flex their actual strengths. Try something weird - convince the goblin chief to join your party through a cooking competition, or use Mage Hand to steal a guard's pants as a distraction - and ChatGPT rolls with it. It doesn't need prep time or a module. It generates consequences on the fly, and they're usually coherent and sometimes inspired.

Quick one-shots are the sweet spot. If you want a self-contained dungeon crawl, a mystery in a single tavern, or a 90-minute adventure with a clear arc, a ChatGPT DM can deliver something worth your evening. No scheduling, no prep, no guilt about canceling on your group because your cat has a vet appointment.

What Breaks (And It Will Break)

Here's where the honeymoon ends. Every ChatGPT D&D campaign hits these walls, usually within the first three sessions.

The Memory Problem

This is the big one. ChatGPT has a context window, and your campaign doesn't care about its context window. Around session two or three, the cracks appear:

  • The NPC you befriended in session one? ChatGPT has no idea who that is.
  • That cursed sword you've been carrying for three sessions? Gone. Never existed.
  • Your character's backstory about their murdered sister? ChatGPT will casually introduce your sister as alive and running a bakery.

You can mitigate this with recap prompts at the start of each session ("Here's what happened so far..."), but now you're doing the DM's job. You're maintaining continuity for the AI. At some point you realize you're managing a tool more than playing a game.

Rules Accuracy: Confident and Wrong

ChatGPT doesn't know D&D rules the way a DM screen knows D&D rules. It knows them the way someone who read a lot of Reddit posts about D&D knows them - mostly right, occasionally very wrong, and always confident about it.

Common issues we hit:

  • Inventing spells that don't exist ("You cast Shadow Bind, freezing the enemy in place")
  • Getting action economy wrong in combat (giving enemies three actions per turn, or letting you bonus-action cast a full spell)
  • Misapplying advantage/disadvantage conditions
  • Making up DCs that don't match any published guidelines (DC 25 to climb a normal ladder)
  • Forgetting concentration on spells that require it

The problem isn't that it makes mistakes. Human DMs make mistakes too. The problem is it never says "let me check that" - it just confidently narrates the wrong thing, and unless you're fact-checking every ruling, incorrect mechanics accumulate until combat becomes nonsensical.

No Mechanical Backbone

A real D&D session has dice, character sheets, maps, initiative trackers, and HP pools. ChatGPT has none of these. You can ask it to track things, and it'll say it is, but it's simulating tracking rather than actually doing it.

We tested this explicitly. After a fight where our warlock took 14 damage, we asked ChatGPT our current HP two exchanges later. It gave us full health. We asked it to recall our remaining spell slots and got a number that didn't match the slots we'd used. It's not lying - it literally doesn't have persistent state.

The "Yes And" Problem

Improvisational theater has a rule: "yes, and." Accept what your scene partner offers and build on it. ChatGPT follows this rule way too hard.

Want to persuade the ancient dragon to give you its entire hoard? Sure, roll persuasion. You got a 12? That works, the dragon is moved by your words. Want to jump across a 200-foot chasm? The wind catches you and carries you across.

A good DM says no sometimes. A good DM sets boundaries that make success meaningful. ChatGPT defaults to giving you what you want because that's what language models optimize for - generating responses that feel satisfying. This sounds fun for about twenty minutes until you realize nothing has stakes and your choices don't actually matter.

โŸก

Tips for a Better ChatGPT DM Experience

If you're going to use ChatGPT for D&D anyway - and for solo one-shots, it's honestly a reasonable choice - here's how to get more out of it.

Roll your own dice. Use physical dice or a digital roller and report the results. This gives you actual randomness instead of ChatGPT's theatrical "you rolled a 17!" which it adjusts to fit whatever narrative it wants.

Front-load everything. Character sheet, world details, tone preferences, NPC names, quest objectives. The more you put in the system prompt, the better your first session will be.

Paste recaps before each session. Maintain a running document of key events, NPC names, items, and unresolved plot threads. Paste it at the start of every conversation. Yes, this is extra work. No, there's no way around it.

Correct it immediately. When it gets a rule wrong or forgets something, say so. "That spell requires concentration" or "Kael has 24 HP, not 38." Treat it like a new DM who's learning - because functionally, that's what it is, except it never actually learns.

Keep sessions short. The sweet spot is 60-90 minutes. Beyond that, accumulated context drift makes the experience degrade noticeably.

Try Claude for narrative quality. In our testing, Claude produced more nuanced NPC interactions and better-paced story beats. Gemini was more rules-accurate but drier. ChatGPT sat in the middle. None of them solved the fundamental problems.

When a ChatGPT Dungeon Master Isn't Enough

There's a ceiling to what a general-purpose chatbot can do as a DM, and it's lower than you'd hope. ChatGPT doesn't track game state. It doesn't enforce rules. It doesn't maintain world persistence. It doesn't roll actual dice. It doesn't say no when it should.

These aren't bugs that OpenAI will patch in the next update. They're architectural limitations of using a text prediction engine for a job that requires structured game logic.

That's why purpose-built AI dungeon masters exist. Tools designed specifically for tabletop RPGs can combine the narrative creativity of large language models with actual mechanical systems - real dice rolls, persistent character sheets, tracked inventories, rule enforcement, and world state that doesn't evaporate between sessions.

The difference isn't subtle. It's the difference between asking a brilliant improviser to also be an accountant, referee, and cartographer simultaneously, versus giving each of those jobs to the right tool. If you want to see how all the options stack up - from ChatGPT to dedicated platforms - our best AI for D&D in 2026 guide has the full breakdown.

The Honest Verdict

The Verdict

Using ChatGPT as a dungeon master is like using a Swiss Army knife to build a house. You can technically do it. Some parts will go better than expected. But at some point you're going to want a real hammer.

For a quick solo adventure on a Tuesday night? ChatGPT is great. Fire it up, paste in a good prompt, roll some dice, and have fun. For anything resembling an ongoing campaign with character progression and mechanical integrity? You'll burn more time managing the AI's limitations than actually playing. The best AI DM isn't the smartest language model - it's the one that knows what a language model should and shouldn't be responsible for.


Try These Free Tools

If you're using ChatGPT as your DM, these free tools fill in the gaps it can't handle:

  • Dice Roller โ€” Roll real dice instead of relying on ChatGPT's theatrical fake rolls.
  • NPC Name Generator โ€” Generate NPC names instantly so you can paste them into your prompt.
  • Encounter Calculator โ€” Balance encounters properly since ChatGPT won't do it for you.

Curious how purpose-built AI DMs compare? Read our AI DM vs Human DM honest comparison, explore playing D&D without a DM, or see how StoryRoll compares to Fables.gg.

ST

Written by StoryRoll Team

Founder of StoryRoll. Building AI-powered tabletop RPGs.

Share:Share on X

Related Posts