Deep Dive Audio Style — Two-Host Conversational Podcast Format
Generate podcasts in the Deep Dive style — comprehensive two-host exploration of every major point in your source. Best for general curiosity, long-form articles, books, and material you want to understand broadly.
What the Deep Dive audio style is
Deep Dive is Podhoc’s two-host conversational format — a comprehensive exploration of the source where two AI voices interact, ask each other questions, and connect the material to broader context. It is the closest of the 8 audio styles to a well-produced human podcast — the kind you would actually subscribe to. The format was popularised by Google’s NotebookLM Audio Overview; Podhoc’s Deep Dive serves the same need across 74 languages with finer control over duration, voices, and source weighting.
This page covers when Deep Dive fits, what makes the conversational format distinctive, and how to generate a strong example.
When to use it
Deep Dive is the right default for most general listening. Concrete fits:
- Long-form articles — magazine essays, longreads, in-depth journalism — where you want the full content without committing to reading every word.
- Popular-science books and book chapters that benefit from being talked through.
- Curious-by-default learning — topics you find interesting but do not have a specific learning objective for.
- Pre-reading prep for material you will later read more carefully — Deep Dive gives you the lay of the land first.
- YouTube videos of talks, interviews, and lectures — the conversational format pairs well with conversational source material.
- Multi-source curiosity bundles — three articles on the same emerging topic, processed together into one episode.
A useful test: if your goal is “I want to understand this topic, not study it specifically”, Deep Dive is probably the right call. If you have a more focused goal — actively learn a hard concept, evaluate an argument, get a fast summary — a different style will fit better.
How it sounds
A Deep Dive episode opens with the hosts setting up the topic and acknowledging the source:
Host A: “So today we’re getting into a paper that argues Mediterranean diet adherence drops by about 30% within five years of moving abroad. It’s by a research group based in Barcelona.” Host B: “And it’s one of those papers that on first read sounds obvious — of course people eat differently when they move — but the interesting bits are the why.”
The body proceeds as a real conversation. One host introduces a concept, the other asks the natural follow-up:
Host A: “They use something called dietary acculturation index. It measures how much of someone’s intake matches their host country’s typical diet versus their origin country’s.” Host B: “And the headline is that this acculturation happens fastest in the first two years?” Host A: “Yeah, then it plateaus. Which is interesting because if you’d asked me, I’d have guessed it was more linear.”
Mild disagreement and follow-up curiosity drive the episode forward:
Host B: “Okay but I want to push on this — they’re measuring stated diet, not actual purchases. Doesn’t that bias toward people thinking they’re still eating Mediterranean when they’re not?”
And the hosts make connections to broader context the source itself does not always make explicit:
Host A: “This connects to a body of work on cultural retention more broadly — there’s a famous Sapir-Whorf-adjacent literature on how language patterns…”
The episode tends to close with the hosts naming what they found most surprising or what they would want to know next.
Deep Dive vs. its closest siblings
Deep Dive vs. Didactic. Both are educational, but Deep Dive is exploratory and conversational while Didactic is structured and directional. Deep Dive surfaces what is interesting; Didactic moves you toward a specific learning objective.
Deep Dive vs. Debate. Both use two voices, but Deep Dive’s hosts collaborate while Debate’s voices argue. Deep Dive feels like two friends talking through an article; Debate feels like a moderated panel.
Deep Dive vs. Critique. Deep Dive surfaces what is interesting in the source; Critique evaluates whether it holds up. Same source, very different episodes — and a useful pair to generate if you have credits to spare.
How to generate a strong Deep Dive episode
A few patterns help:
- Pick sources with depth. Deep Dive shines on material where there is enough substance for the hosts to chew on. A 500-word blog post will produce a thinner episode than a 5,000-word longread.
- Match voices to mood. The two-voice host pair has a real personality. Spend a minute choosing voices whose vibe matches the source — measured for serious material, relaxed for breezier reads.
- Sit in the 20- to 25-minute range. Most Deep Dives are at their best between 20 and 25 minutes. Below 15, the conversation does not develop; above 35, it starts to circle.
- Use it as the listenable layer. Deep Dive is the format you actually press play on. Pair it with a more focused style (Critique or Feynman) for the same source if you want both an enjoyable listen and a structured analysis.
A worked example
A reader uploads a 6,000-word Atlantic essay on the social consequences of remote work and asks for a 25-minute Deep Dive. The output structure looks roughly like:
- Setup (2 minutes): What the essay is and why it caught the hosts’ attention.
- Core argument (8 minutes): What the essay claims, walked through conversationally with the second host asking the obvious clarifying questions.
- The interesting tangents (8 minutes): The two or three asides in the essay the hosts find most thought-provoking, expanded on with broader context.
- Pushback (4 minutes): One host gently questions a couple of the essay’s stronger claims, the other defends them.
- Wrap (3 minutes): What each host took away, what they would want to read next.
The episode does not exhaustively replicate the essay; it walks the listener through it with the kind of conversational bandwidth that makes a long article feel digestible.
Try Deep Dive now
Pick a longread you have been meaning to get to, upload it to Podhoc, and select Deep Dive. 25 minutes is a strong default duration.
Try Podhoc and generate a Deep Dive episode →
Related styles
Frequently asked questions
- What is the Deep Dive audio style?
- Deep Dive is a comprehensive two-host exploration of the source. The hosts ask each other questions, surface implications, and connect the source to broader context. It is the most “podcast-like” of the 8 styles and the closest to the conversational format that NotebookLM popularised.
- When should I pick Deep Dive?
- Pick Deep Dive for general intellectual curiosity, long-form articles, books, and material you want to understand broadly without a specific learning or evaluation goal. It is also the most listenable format for passive intake — commute time, exercise, walking the dog.
- Is Deep Dive the same as NotebookLM's Audio Overview?
- It serves the same purpose — a two-host conversational summary of an arbitrary source — but Podhoc’s Deep Dive runs in 74 languages, integrates with the same upload pipeline as the other 7 audio styles, and gives you control over duration, voices, and source weighting that hosted alternatives do not always expose.
- How many voices does Deep Dive use?
- Two by default — one host who tends to drive the structure and one who tends to ask the curious follow-up questions. The two voices interact: they finish each other’s sentences, offer mild disagreements, and connect the source to broader context.
- How long should a Deep Dive episode be?
- 15 to 30 minutes is the sweet spot. Under 15 minutes, the conversation does not have room to develop; over 45 minutes, the format starts to circle. For very long sources (books, multi-paper bundles), consider splitting into two episodes rather than running one 60-minute episode.
- Why does Deep Dive feel more "produced" than other formats?
- The two-host conversational format with natural follow-up questions, gentle disagreement, and explicit transitions mirrors the production patterns of well-made human podcasts. It is the format that most often makes first-time users say “this sounds like a real show”.