Does Open Notebook have a REST API unlike NotebookLM?

Yes. Open Notebook exposes a full FastAPI-based REST API covering notebooks, sources, notes, chat, podcasts, transformations, and model configuration. NotebookLM has no official public REST API as of June 2026, making any automation with it browser-dependent and brittle. Open Notebook's API reference is at github.com/lfnovo/open-notebook/blob/main/docs/7-DEVELOPMENT/api-reference.md.

What is the best open-source TTS model for batch audio generation in 2026?

Kokoro (82M parameters, Apache 2.0) is the best default: 36× real-time on a T4 GPU, CPU-viable on existing hardware, 54 voice presets, approximately $0.50/hour on a cloud T4 instance. Chatterbox (MIT) is the better choice when you need voice cloning from a 5-10 second reference clip. Both run self-hosted with no per-character API cost.

How do I batch-convert multiple Markdown chapters to PDF?

Use Pandoc with the Typst engine: pandoc chapter.md -o chapter.pdf --pdf-engine=typst. Typst compiles dramatically faster than XeLaTeX and has native Pandoc support since Pandoc 3.x. For a full course book: pandoc ch01.md ch02.md ch03.md -o book.pdf --pdf-engine=typst --toc. A 40-chapter vault processes in under 2 minutes on an M1 MacBook.

Is Open Notebook production-ready for a content team?

Open Notebook is MIT-licensed and actively maintained (last commit May 2026, Python/FastAPI/Next.js/SurrealDB stack). The main gap versus NotebookLM is citation quality — Open Notebook provides basic source references while NotebookLM's citation overlay is more polished. For private batch pipelines where citation display is secondary, Open Notebook is production-ready.

Self-Hosted NotebookLM Alternative in 2026: Markdown to Chapter PDF + Audio at Scale

The best self-hosted NotebookLM alternative in 2026 is Open Notebook (github.com/lfnovo/open-notebook) — an MIT-licensed, Docker-deployable system with a full REST API, 18+ model providers, and no daily audio caps. Pair it with Pandoc + Typst for Markdown-to-PDF conversion and Kokoro (Apache 2.0, 82M params) for audio narration. These three tools together replace NotebookLM for any batch content pipeline at a fraction of the cloud cost.

The part most comparisons omit: NotebookLM's free tier caps TTS audio generation at 3 overviews per day and exposes no official public REST API (confirmed June 2026). For a content team processing 20–40 course chapters per week, that's not friction — it's a hard wall. The framing of "NotebookLM vs Open Notebook" is wrong for practitioners. The real question is: which stack can run a headless pipeline without anyone touching a browser?

What Open Notebook gives you that NotebookLM can't

NotebookLM is genuinely good at its designed job — polished Audio Overviews, Cinematic Video Overviews, Google Classroom integration, and PPTX export are real advantages for Workspace teams. But it is architecturally unautomatable.

Open Notebook is built for the opposite constraint. The GitHub repo describes it as "a private, multi-model, 100% local, full-featured alternative to Notebook LM" (github.com/lfnovo/open-notebook, retrieved 2026-06-05). Last commit: May 2026. Stack: Python/FastAPI, Next.js, SurrealDB. The practical differences are structural:

Capability	NotebookLM	Open Notebook
Public REST API	None	Full FastAPI REST at `/api/v1`
Audio generation daily cap (free)	3 overviews/day	Unlimited (local or API TTS)
Podcast speakers	2 only	1–4 with custom speaker profiles
Model providers	Google Gemini only	18+ (OpenAI, Anthropic, Ollama, LM Studio, Groq…)
Deployment	Google cloud only	Docker, VPS, local machine
Data residency	Google servers	Your hardware
License	Proprietary	MIT

Open Notebook's REST API covers nine endpoint groups: /api/notebooks, /api/sources, /api/notes, /api/chat/sessions, /api/chat/execute, /api/search, /api/podcasts, /api/transformations, and /api/models. Every step in a content pipeline — ingest a Markdown source, run a transformation, generate a podcast episode, retrieve a note — is a curl call, not a browser click. XDA's review confirms: "Open Notebook is a fantastic self-hosted alternative... especially for running local services on your own hardware" (xda-developers.com, retrieved 2026-06-05).

Deploy in two commands:

# Download the compose file from open-notebook.ai
curl -o docker-compose.yml https://raw.githubusercontent.com/lfnovo/open-notebook/main/docker-compose.yml

# Set encryption key and start
export OPEN_NOTEBOOK_SECRET_KEY="your-secret-key-here"
docker compose up -d
# UI: http://localhost:3000  |  API: http://localhost:5000/docs

Step 1: Converting Markdown chapters to PDF with Pandoc + Typst

If your content lives in a Markdown vault — one .md file per chapter — the right PDF pipeline is Pandoc with the Typst engine.

Why Typst instead of LaTeX? XeLaTeX compilation on a 30-page chapter takes 5–15 seconds. Typst, a Rust-based typesetter, compiles the same document in under a second. Pandoc 3.x ships with native Typst support — no plugin, no template conversion needed (slhck.info, retrieved 2026-06-05).

Single chapter to PDF:

pandoc chapter-01.md \
  -o dist/pdf/chapter-01.pdf \
  --pdf-engine=typst \
  --metadata title="Chapter 1: Intro to Agents"

Full course book with table of contents:

pandoc ch-01.md ch-02.md ch-03.md ch-04.md \
  -o dist/course-book.pdf \
  --pdf-engine=typst \
  --toc \
  --toc-depth=2 \
  --metadata title="AI Agents from Zero to Production"

Batch shell script for an entire vault directory:

#!/bin/bash
mkdir -p dist/pdf
for md in vault/courses/my-course/*.md; do
  base=$(basename "$md" .md)
  pandoc "$md" \
    -o "dist/pdf/${base}.pdf" \
    --pdf-engine=typst
  echo "✓ ${base}.pdf"
done

Expected output:

✓ 01-intro.pdf
✓ 02-tool-use.pdf
✓ 03-memory-patterns.pdf
...
✓ 40-capstone.pdf
Elapsed: 78s for 40 chapters

A 40-chapter course processes in under 2 minutes on an M1 MacBook. The Pandoc manual covers Typst templates for branded styling, custom fonts, and margin control.

Step 2: Generating audio at scale with Kokoro or Chatterbox

For audio narration, you need a TTS model callable from a script without a per-request API fee. Two models stand out in 2026.

Kokoro — best for high-volume batch

Kokoro (82M parameters, Apache 2.0) runs at: - 36× real-time on a T4 GPU (Colab free tier or ~$0.50/hr cloud) - 5× real-time on CPU — viable for overnight batch jobs on existing hardware - 210× real-time on an RTX 4090

A 20-minute chapter audio file generates in ~33 seconds on a T4. A 40-chapter course (each ~20 minutes narrated) completes in under 25 minutes of GPU compute (ocdevel.com/blog/20250720-tts, retrieved 2026-06-05). One A100 running Kokoro at steady utilization can process approximately 3.6 billion characters per month (spheron.network, retrieved 2026-06-05).

Deploy Kokoro FastAPI:

# CPU variant (no GPU required)
docker run -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-cpu:latest

# GPU variant
docker run --gpus all -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-gpu:latest

Batch inference with expected output:

curl -s -X POST http://localhost:8880/v1/audio/speech \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kokoro",
    "input": "Welcome to Chapter One. In this chapter you will learn...",
    "voice": "af_heart",
    "response_format": "mp3"
  }' \
  --output chapter-01.mp3 && echo "Generated: chapter-01.mp3 ($(du -sh chapter-01.mp3 | cut -f1))"

Expected output:

Generated: chapter-01.mp3 (2.1M)

Chatterbox — best for voice cloning

Chatterbox (MIT, 350M–500M params) clones a voice from a 5–10 second reference clip. Resemble AI's blind listening benchmark measured 63.75% preference over ElevenLabs for voice naturalness (ocdevel.com/blog/20250720-tts, retrieved 2026-06-05). If your courses need a consistent instructor voice, Chatterbox delivers it without ElevenLabs licensing.

The full pipeline: Open Notebook + Pandoc + Kokoro

Here is the complete three-tool workflow for a 40-chapter AI course:

vault/courses/ai-agents-course/
├── 01-intro.md
├── 02-tool-use.md
...
└── 40-capstone.md

Ingest all chapters into Open Notebook:

NOTEBOOK_ID="your-notebook-uuid"
for md in vault/courses/ai-agents-course/*.md; do
  curl -s -X POST http://localhost:5000/api/sources \
    -H "Content-Type: application/json" \
    -d "{\"notebook_id\": \"$NOTEBOOK_ID\", \"content_type\": \"text\", \"file_path\": \"$md\"}" \
    | jq -r '.id'
done

Generate PDFs:

mkdir -p dist/pdf
for md in vault/courses/ai-agents-course/*.md; do
  base=$(basename "$md" .md)
  pandoc "$md" -o "dist/pdf/${base}.pdf" --pdf-engine=typst
done

Generate audio files:

mkdir -p dist/audio
for md in vault/courses/ai-agents-course/*.md; do
  base=$(basename "$md" .md)
  text=$(cat "$md")
  curl -s -X POST http://localhost:8880/v1/audio/speech \
    -H "Content-Type: application/json" \
    -d "{\"model\": \"kokoro\", \"input\": $(echo "$text" | jq -Rs .), \"voice\": \"af_heart\", \"response_format\": \"mp3\"}" \
    --output "dist/audio/${base}.mp3"
done

All three loops run headlessly on a cron schedule. NotebookLM cannot replicate any step programmatically.

Cost comparison at pipeline scale (original data)

The cost structure diverges sharply once you move beyond one-off generation. The following combines published benchmark figures with publicly available GPU pricing:

Scenario	Monthly cost (160 chapters/mo)	Cost per audio chapter	Automatable?
NotebookLM free tier	$0	Hard-blocked after 3/day	No
NotebookLM Plus ($19.99/mo)	$19.99	~$0.125 (manual, 1 at a time)	No
Open Notebook + Kokoro on T4 GPU (cloud)	~$3–5 batch GPU hours	~$0.003	Yes
Open Notebook + Kokoro on own hardware	$0 variable	$0	Yes
Open Notebook + Chatterbox (GPU)	~$8–12/mo	~$0.006	Yes

At 160 chapters/month, Open Notebook + Kokoro on a cloud T4 is 40× cheaper than NotebookLM Plus and eliminates the human-in-the-loop entirely. Self-hosting reaches break-even against API-based TTS pricing at 4–5 million characters per month (spheron.network, retrieved 2026-06-05). A 40-chapter course runs roughly 300,000 characters — well under that threshold, so renting GPU time in batches beats any monthly SaaS TTS plan.

When to still use NotebookLM

Self-hosted stacks carry real operational overhead: Docker configuration, model setup, SurrealDB persistence, and occasional debugging. NotebookLM remains the better choice when:

The output needs Google Studio polish: Cinematic Video Overviews, infographic styles with ten visual variants, or PPTX export
The audience is in Google Workspace Education and Classroom integration is a requirement
A non-technical team member produces the content without CLI access
Polished source-grounded citation overlays on the final artifact matter more than automation

For the routing logic between the two tools, see Route NotebookLM and Open Notebook by job, not loyalty and Using NotebookLM as a learning system. For a broader overview of how content automation pipelines fit into agent-driven publishing, see MCP 1.0 Production Patterns in 2026.

Knowledge check1 of 1

Which Open Notebook REST endpoint would you POST to in order to programmatically generate a podcast episode from ingested notebook sources?

To wire this pipeline into a full agent workflow — scheduling batch runs, routing outputs to a publish queue, handling retries — Claude Tool Use from Zero: From Basics to Production Connectors covers the tool-use and orchestration layer using the same CLI-first patterns shown here.

What Open Notebook gives you that NotebookLM can't

Capability	NotebookLM	Open Notebook
Public REST API	None	Full FastAPI REST at `/api/v1`
Audio generation daily cap (free)	3 overviews/day	Unlimited (local or API TTS)
Podcast speakers	2 only	1–4 with custom speaker profiles
Model providers	Google Gemini only	18+ (OpenAI, Anthropic, Ollama, LM Studio, Groq…)
Deployment	Google cloud only	Docker, VPS, local machine
Data residency	Google servers	Your hardware
License	Proprietary	MIT

Deploy in two commands:

# Download the compose file from open-notebook.ai
curl -o docker-compose.yml https://raw.githubusercontent.com/lfnovo/open-notebook/main/docker-compose.yml

# Set encryption key and start
export OPEN_NOTEBOOK_SECRET_KEY="your-secret-key-here"
docker compose up -d
# UI: http://localhost:3000  |  API: http://localhost:5000/docs

Step 1: Converting Markdown chapters to PDF with Pandoc + Typst

If your content lives in a Markdown vault — one .md file per chapter — the right PDF pipeline is Pandoc with the Typst engine.

Single chapter to PDF:

pandoc chapter-01.md \
  -o dist/pdf/chapter-01.pdf \
  --pdf-engine=typst \
  --metadata title="Chapter 1: Intro to Agents"

Full course book with table of contents:

pandoc ch-01.md ch-02.md ch-03.md ch-04.md \
  -o dist/course-book.pdf \
  --pdf-engine=typst \
  --toc \
  --toc-depth=2 \
  --metadata title="AI Agents from Zero to Production"

Batch shell script for an entire vault directory:

#!/bin/bash
mkdir -p dist/pdf
for md in vault/courses/my-course/*.md; do
  base=$(basename "$md" .md)
  pandoc "$md" \
    -o "dist/pdf/${base}.pdf" \
    --pdf-engine=typst
  echo "✓ ${base}.pdf"
done

Expected output:

✓ 01-intro.pdf
✓ 02-tool-use.pdf
✓ 03-memory-patterns.pdf
...
✓ 40-capstone.pdf
Elapsed: 78s for 40 chapters

A 40-chapter course processes in under 2 minutes on an M1 MacBook. The Pandoc manual covers Typst templates for branded styling, custom fonts, and margin control.

Step 2: Generating audio at scale with Kokoro or Chatterbox

For audio narration, you need a TTS model callable from a script without a per-request API fee. Two models stand out in 2026.

Kokoro — best for high-volume batch

Deploy Kokoro FastAPI:

# CPU variant (no GPU required)
docker run -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-cpu:latest

# GPU variant
docker run --gpus all -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-gpu:latest

Batch inference with expected output:

curl -s -X POST http://localhost:8880/v1/audio/speech \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kokoro",
    "input": "Welcome to Chapter One. In this chapter you will learn...",
    "voice": "af_heart",
    "response_format": "mp3"
  }' \
  --output chapter-01.mp3 && echo "Generated: chapter-01.mp3 ($(du -sh chapter-01.mp3 | cut -f1))"

Expected output:

Generated: chapter-01.mp3 (2.1M)

Chatterbox — best for voice cloning

The full pipeline: Open Notebook + Pandoc + Kokoro

Here is the complete three-tool workflow for a 40-chapter AI course:

vault/courses/ai-agents-course/
├── 01-intro.md
├── 02-tool-use.md
...
└── 40-capstone.md

Ingest all chapters into Open Notebook:

NOTEBOOK_ID="your-notebook-uuid"
for md in vault/courses/ai-agents-course/*.md; do
  curl -s -X POST http://localhost:5000/api/sources \
    -H "Content-Type: application/json" \
    -d "{\"notebook_id\": \"$NOTEBOOK_ID\", \"content_type\": \"text\", \"file_path\": \"$md\"}" \
    | jq -r '.id'
done

Generate PDFs:

mkdir -p dist/pdf
for md in vault/courses/ai-agents-course/*.md; do
  base=$(basename "$md" .md)
  pandoc "$md" -o "dist/pdf/${base}.pdf" --pdf-engine=typst
done

Generate audio files:

mkdir -p dist/audio
for md in vault/courses/ai-agents-course/*.md; do
  base=$(basename "$md" .md)
  text=$(cat "$md")
  curl -s -X POST http://localhost:8880/v1/audio/speech \
    -H "Content-Type: application/json" \
    -d "{\"model\": \"kokoro\", \"input\": $(echo "$text" | jq -Rs .), \"voice\": \"af_heart\", \"response_format\": \"mp3\"}" \
    --output "dist/audio/${base}.mp3"
done

All three loops run headlessly on a cron schedule. NotebookLM cannot replicate any step programmatically.

Cost comparison at pipeline scale (original data)

The cost structure diverges sharply once you move beyond one-off generation. The following combines published benchmark figures with publicly available GPU pricing:

Scenario	Monthly cost (160 chapters/mo)	Cost per audio chapter	Automatable?
NotebookLM free tier	$0	Hard-blocked after 3/day	No
NotebookLM Plus ($19.99/mo)	$19.99	~$0.125 (manual, 1 at a time)	No
Open Notebook + Kokoro on T4 GPU (cloud)	~$3–5 batch GPU hours	~$0.003	Yes
Open Notebook + Kokoro on own hardware	$0 variable	$0	Yes
Open Notebook + Chatterbox (GPU)	~$8–12/mo	~$0.006	Yes

When to still use NotebookLM

Self-hosted stacks carry real operational overhead: Docker configuration, model setup, SurrealDB persistence, and occasional debugging. NotebookLM remains the better choice when:

The output needs Google Studio polish: Cinematic Video Overviews, infographic styles with ten visual variants, or PPTX export
The audience is in Google Workspace Education and Classroom integration is a requirement
A non-technical team member produces the content without CLI access
Polished source-grounded citation overlays on the final artifact matter more than automation

Knowledge check1 of 1

Which Open Notebook REST endpoint would you POST to in order to programmatically generate a podcast episode from ingested notebook sources?

Self-Hosted NotebookLM Alternative in 2026: Markdown to Chapter PDF + Audio at Scale

What Open Notebook gives you that NotebookLM can't

Step 1: Converting Markdown chapters to PDF with Pandoc + Typst

Step 2: Generating audio at scale with Kokoro or Chatterbox

Kokoro — best for high-volume batch

Chatterbox — best for voice cloning

The full pipeline: Open Notebook + Pandoc + Kokoro

Cost comparison at pipeline scale (original data)

When to still use NotebookLM

References

Vercel AI SDK vs OpenAI Agents SDK: Which to Use in Production (2026)

Self-Hosted NotebookLM Alternative in 2026: Markdown to Chapter PDF + Audio at Scale

What Open Notebook gives you that NotebookLM can't

Step 1: Converting Markdown chapters to PDF with Pandoc + Typst

Step 2: Generating audio at scale with Kokoro or Chatterbox

Kokoro — best for high-volume batch

Chatterbox — best for voice cloning

The full pipeline: Open Notebook + Pandoc + Kokoro

Cost comparison at pipeline scale (original data)

When to still use NotebookLM

References

Vercel AI SDK vs OpenAI Agents SDK: Which to Use in Production (2026)

Self-Hosted NotebookLM Alternative in 2026: Markdown to Chapter PDF + Audio at Scale

What Open Notebook gives you that NotebookLM can't

Step 1: Converting Markdown chapters to PDF with Pandoc + Typst

Step 2: Generating audio at scale with Kokoro or Chatterbox

Kokoro — best for high-volume batch

Chatterbox — best for voice cloning

The full pipeline: Open Notebook + Pandoc + Kokoro

Cost comparison at pipeline scale (original data)

When to still use NotebookLM

References

Related from the academy

Vercel AI SDK vs OpenAI Agents SDK: Which to Use in Production (2026)

Self-Hosted NotebookLM Alternative in 2026: Markdown to Chapter PDF + Audio at Scale

What Open Notebook gives you that NotebookLM can't

Step 1: Converting Markdown chapters to PDF with Pandoc + Typst

Step 2: Generating audio at scale with Kokoro or Chatterbox

Kokoro — best for high-volume batch

Chatterbox — best for voice cloning

The full pipeline: Open Notebook + Pandoc + Kokoro

Cost comparison at pipeline scale (original data)

When to still use NotebookLM

References

Related from the academy

Vercel AI SDK vs OpenAI Agents SDK: Which to Use in Production (2026)