Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Select an option

  • Save michellebonat/764d3fc28ee9997d771c7d92c6ca422a to your computer and use it in GitHub Desktop.

Select an option

Save michellebonat/764d3fc28ee9997d771c7d92c6ca422a to your computer and use it in GitHub Desktop.
Lab-research_paper_to_podcast_crew
Display the source blob
Display the rendered blob
Raw
{
"nbformat": 4,
"nbformat_minor": 0,
"metadata": {
"colab": {
"provenance": [],
"include_colab_link": true
},
"kernelspec": {
"name": "python3",
"display_name": "Python 3"
},
"language_info": {
"name": "python"
}
},
"cells": [
{
"cell_type": "markdown",
"metadata": {
"id": "view-in-github",
"colab_type": "text"
},
"source": [
"<a href=\"https://colab.research.google.com/gist/michellebonat/764d3fc28ee9997d771c7d92c6ca422a/lab-research_paper_to_podcast_crew.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
]
},
{
"cell_type": "markdown",
"source": [
"# Research Paper to Podcast Generator\n",
"\n",
"This notebook converts research papers into engaging podcast conversations using AI Agents. Follow the steps below to generate your podcast! This is from a lab created by Tony Kipkemboi.\n",
"\n",
"🔗 Repo Link: https://github.com/tonykipkemboi/research-paper-to-podcast\n",
"\n",
"---\n",
"\n",
"![image.png]()\n"
],
"metadata": {
"id": "5an32pzwMxyt"
}
},
{
"cell_type": "markdown",
"source": [
"## Setup\n",
"First, we'll install the required dependencies and set up our environment."
],
"metadata": {
"id": "kcm-4VzlM30F"
}
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "_2lD8iEWHqpF",
"collapsed": true,
"colab": {
"base_uri": "https://localhost:8080/"
},
"outputId": "bd483bb8-1d5a-4280-abbc-37cad1953012"
},
"outputs": [
{
"output_type": "stream",
"name": "stdout",
"text": [
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m42.5/42.5 kB\u001b[0m \u001b[31m2.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m67.3/67.3 kB\u001b[0m \u001b[31m4.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[?25h Installing build dependencies ... \u001b[?25l\u001b[?25hdone\n",
" Getting requirements to build wheel ... \u001b[?25l\u001b[?25hdone\n",
" Preparing metadata (pyproject.toml) ... \u001b[?25l\u001b[?25hdone\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m48.2/48.2 kB\u001b[0m \u001b[31m2.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m235.0/235.0 kB\u001b[0m \u001b[31m8.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m6.7/6.7 MB\u001b[0m \u001b[31m19.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m542.9/542.9 kB\u001b[0m \u001b[31m24.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m264.7/264.7 kB\u001b[0m \u001b[31m17.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m134.0/134.0 kB\u001b[0m \u001b[31m8.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m2.4/2.4 MB\u001b[0m \u001b[31m60.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m147.8/147.8 kB\u001b[0m \u001b[31m11.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m211.4/211.4 kB\u001b[0m \u001b[31m14.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m628.3/628.3 kB\u001b[0m \u001b[31m28.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m76.4/76.4 kB\u001b[0m \u001b[31m5.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m71.4/71.4 kB\u001b[0m \u001b[31m5.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m32.2/32.2 MB\u001b[0m \u001b[31m29.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m38.3/38.3 MB\u001b[0m \u001b[31m17.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m65.0/65.0 kB\u001b[0m \u001b[31m4.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m55.9/55.9 kB\u001b[0m \u001b[31m3.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m118.7/118.7 kB\u001b[0m \u001b[31m7.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m177.4/177.4 kB\u001b[0m \u001b[31m11.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m59.5/59.5 kB\u001b[0m \u001b[31m3.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m5.6/5.6 MB\u001b[0m \u001b[31m94.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m5.6/5.6 MB\u001b[0m \u001b[31m76.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m57.6/57.6 kB\u001b[0m \u001b[31m3.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m756.0/756.0 kB\u001b[0m \u001b[31m31.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m236.0/236.0 kB\u001b[0m \u001b[31m15.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m16.2/16.2 MB\u001b[0m \u001b[31m55.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m233.6/233.6 kB\u001b[0m \u001b[31m15.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m278.6/278.6 kB\u001b[0m \u001b[31m19.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m252.5/252.5 kB\u001b[0m \u001b[31m17.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m94.8/94.8 kB\u001b[0m \u001b[31m6.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m131.6/131.6 kB\u001b[0m \u001b[31m9.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m2.0/2.0 MB\u001b[0m \u001b[31m51.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m45.1/45.1 kB\u001b[0m \u001b[31m2.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m2.5/2.5 MB\u001b[0m \u001b[31m39.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m50.9/50.9 kB\u001b[0m \u001b[31m3.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m311.8/311.8 kB\u001b[0m \u001b[31m19.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m91.3/91.3 kB\u001b[0m \u001b[31m5.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m101.6/101.6 kB\u001b[0m \u001b[31m6.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m13.3/13.3 MB\u001b[0m \u001b[31m46.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m72.0/72.0 kB\u001b[0m \u001b[31m4.6 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m298.7/298.7 kB\u001b[0m \u001b[31m18.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m2.9/2.9 MB\u001b[0m \u001b[31m54.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m71.1/71.1 kB\u001b[0m \u001b[31m5.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.1/1.1 MB\u001b[0m \u001b[31m35.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m3.0/3.0 MB\u001b[0m \u001b[31m33.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m62.3/62.3 kB\u001b[0m \u001b[31m3.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m3.3/3.3 MB\u001b[0m \u001b[31m42.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m459.8/459.8 kB\u001b[0m \u001b[31m24.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.6/1.6 MB\u001b[0m \u001b[31m39.5 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m209.2/209.2 kB\u001b[0m \u001b[31m14.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m319.7/319.7 kB\u001b[0m \u001b[31m19.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m306.6/306.6 kB\u001b[0m \u001b[31m18.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m71.5/71.5 kB\u001b[0m \u001b[31m4.9 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m4.0/4.0 MB\u001b[0m \u001b[31m34.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m452.6/452.6 kB\u001b[0m \u001b[31m22.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m46.0/46.0 kB\u001b[0m \u001b[31m3.2 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m78.5/78.5 kB\u001b[0m \u001b[31m5.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m2.5/2.5 MB\u001b[0m \u001b[31m41.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m86.8/86.8 kB\u001b[0m \u001b[31m6.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m50.9/50.9 kB\u001b[0m \u001b[31m2.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m61.0/61.0 kB\u001b[0m \u001b[31m4.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[?25h Building wheel for pypika (pyproject.toml) ... \u001b[?25l\u001b[?25hdone\n",
"\u001b[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.\n",
"transformers 4.47.1 requires tokenizers<0.22,>=0.21, but you have tokenizers 0.20.3 which is incompatible.\u001b[0m\u001b[31m\n",
"\u001b[0m"
]
}
],
"source": [
"%pip install --quiet crewai crewai-tools elevenlabs python-dotenv pydub pydantic"
]
},
{
"cell_type": "code",
"source": [
"# # Mount Google Drive to access your PDF files (optional)\n",
"# from google.colab import drive\n",
"# drive.mount('/content/drive')"
],
"metadata": {
"id": "gTbNGtIuKx7x"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Environment Variables\n",
"You'll need to set up your API keys.\n",
"1. Create them at:\n",
" - ElevenLabs: https://elevenlabs.io\n",
" - Serper Dev: https://serper.dev\n",
" - OpenAI: https://platform.openai.com\n",
" - Anthropic: https://www.anthropic.com\n",
" - Groq: https://console.groq.com/keys\n",
" - Cerebras: https://cloud.cerebras.ai/platform/\n",
"2. Click the \"🔑\" icon in the left sidebar\n",
"3. Add these secrets:\n",
" - OPENAI_API_KEY \n",
" - ELEVENLABS_API_KEY\n",
" - ANTHROPIC_API_KEY\n",
" - SERPER_API_KEY\n",
" - BEN_VOICE_ID (from ElevenLabs)\n",
" - CLAUDIA_VOICE_ID (from ElevenLabs)"
],
"metadata": {
"id": "d5Pt0gDBPqcN"
}
},
{
"cell_type": "code",
"source": [
"import os\n",
"from google.colab import userdata\n",
"\n",
"# Set your API keys here\n",
"os.environ['OPENAI_API_KEY'] = userdata.get('OPENAI_API_KEY')\n",
"os.environ['ELEVENLABS_API_KEY'] = userdata.get('ELEVENLABS_API_KEY')\n",
"os.environ['ANTHROPIC_API_KEY'] = userdata.get('ANTHROPIC_API_KEY')\n",
"os.environ['CEREBRAS_API_KEY'] = userdata.get('CEREBRAS_API_KEY')\n",
"os.environ['SERPER_API_KEY'] = userdata.get('SERPER_API_KEY')\n",
"\n",
"# Voice IDs from ElevenLabs\n",
"os.environ['BEN_VOICE_ID'] = userdata.get('BEN_VOICE_ID')\n",
"os.environ['CLAUDIA_VOICE_ID'] = userdata.get('CLAUDIA_VOICE_ID')"
],
"metadata": {
"id": "7WLi9KCZPuwR"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Tools for our Agents to use"
],
"metadata": {
"id": "m93SKh19dloY"
}
},
{
"cell_type": "code",
"source": [
"import os\n",
"from typing import Dict, List, Optional, Any, Type\n",
"from datetime import datetime\n",
"from pydub import AudioSegment\n",
"from crewai.tools import BaseTool\n",
"from pydantic import Field, BaseModel, ConfigDict\n",
"from elevenlabs.client import ElevenLabs"
],
"metadata": {
"id": "1epWHaePdh8q"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "code",
"source": [
"class VoiceConfig(BaseModel):\n",
" \"\"\"Voice configuration settings.\"\"\"\n",
" stability: float = 0.45 # Slightly lower for more natural variation\n",
" similarity_boost: float = 0.85 # Higher to maintain consistent voice character\n",
" style: float = 0.65 # Balanced expressiveness\n",
" use_speaker_boost: bool = True\n",
" model_id: str = \"eleven_multilingual_v2\"\n",
" output_format: str = \"mp3_44100_128\"\n",
" apply_text_normalization: str = \"auto\" # 'auto', 'on', or 'off'\n",
"\n",
"class AudioConfig(BaseModel):\n",
" \"\"\"Audio processing configuration.\"\"\"\n",
" format: str = \"mp3\"\n",
" sample_rate: int = 48000 # Higher for better quality\n",
" channels: int = 2\n",
" bitrate: str = \"256k\" # Higher bitrate for clearer audio\n",
" normalize: bool = True # Normalize audio levels\n",
" target_loudness: float = -14.0 # Standard podcast loudness (LUFS)\n",
" compression_ratio: float = 2.0 # Light compression for voice\n",
"\n",
"class Dialogue(BaseModel):\n",
" \"\"\"Dialogue for the podcast audio generation tool.\"\"\"\n",
" speaker: str\n",
" text: str\n",
"\n",
"class PodcastAudioGeneratorInput(BaseModel):\n",
" \"\"\"Input for the podcast audio generation tool.\"\"\"\n",
" dialogue: List[Dialogue]"
],
"metadata": {
"id": "OjLPlE0weAP7"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "code",
"source": [
"class PodcastAudioGenerator(BaseTool):\n",
" \"\"\"Enhanced podcast audio generation tool.\"\"\"\n",
"\n",
" name: str = \"PodcastAudioGenerator\"\n",
" description: str = \"Synthesizes podcast voices using ElevenLabs API.\"\n",
"\n",
" model_config = ConfigDict(arbitrary_types_allowed=True)\n",
"\n",
" api_key: str = Field(default_factory=lambda: os.getenv(\"ELEVENLABS_API_KEY\"))\n",
" voice_configs: Dict[str, Dict] = Field(default_factory=dict)\n",
" audio_config: AudioConfig = Field(default_factory=AudioConfig)\n",
" output_dir: str = Field(default=\"output/audio-files\")\n",
" client: Any = Field(default=None)\n",
" args_schema: Type[BaseModel] = PodcastAudioGeneratorInput\n",
"\n",
" def __init__(self, **data):\n",
" super().__init__(**data)\n",
" if not self.api_key:\n",
" raise ValueError(\"ELEVENLABS_API_KEY environment variable not set\")\n",
" self.client = ElevenLabs(api_key=self.api_key)\n",
"\n",
" def add_voice(self, name: str, voice_id: str, config: Optional[VoiceConfig] = None) -> None:\n",
" \"\"\"Add a voice configuration.\"\"\"\n",
" self.voice_configs[name] = {\n",
" \"voice_id\": voice_id,\n",
" \"config\": config or VoiceConfig()\n",
" }\n",
"\n",
" def _run(self, dialogue: List[Dialogue]) -> List[str]:\n",
" \"\"\"Generate audio files for each script segment.\"\"\"\n",
" os.makedirs(self.output_dir, exist_ok=True)\n",
"\n",
" audio_files = []\n",
" for index, segment in enumerate(dialogue):\n",
" speaker = segment.get('speaker', '').strip()\n",
" text = segment.get('text', '').strip()\n",
"\n",
" if not speaker or not text:\n",
" print(f\"Skipping segment {index}: missing speaker or text\")\n",
" continue\n",
"\n",
" voice_config = self.voice_configs.get(speaker)\n",
" if not voice_config:\n",
" print(f\"Skipping unknown speaker: {speaker}\")\n",
" continue\n",
"\n",
" try:\n",
" audio_generator = self.client.text_to_speech.convert(\n",
" text=text,\n",
" voice_id=voice_config[\"voice_id\"],\n",
" model_id=voice_config['config'].model_id,\n",
" output_format=voice_config['config'].output_format,\n",
" voice_settings={\n",
" \"stability\": voice_config['config'].stability,\n",
" \"similarity_boost\": voice_config['config'].similarity_boost,\n",
" \"style\": voice_config['config'].style,\n",
" \"use_speaker_boost\": voice_config['config'].use_speaker_boost\n",
" }\n",
" )\n",
"\n",
" # Convert generator to bytes\n",
" audio_bytes = b''.join(chunk for chunk in audio_generator)\n",
"\n",
" filename = f\"{self.output_dir}/{index:03d}_{speaker}.{self.audio_config.format}\"\n",
" with open(filename, \"wb\") as out:\n",
" out.write(audio_bytes)\n",
"\n",
" # Basic audio normalization\n",
" if self.audio_config.normalize:\n",
" audio = AudioSegment.from_file(filename)\n",
" normalized = audio.normalize() # Simple normalization\n",
" normalized = normalized + 4 # Slight boost\n",
"\n",
" # Use context manager to ensure file is closed\n",
" with normalized.export(\n",
" filename,\n",
" format=self.audio_config.format,\n",
" bitrate=self.audio_config.bitrate,\n",
" parameters=[\"-ar\", str(self.audio_config.sample_rate)]\n",
" ) as f:\n",
" f.close()\n",
"\n",
" audio_files.append(filename)\n",
" print(f'Audio content written to file \"{filename}\"')\n",
"\n",
" except Exception as e:\n",
" print(f\"Error processing segment {index}: {str(e)}\")\n",
" continue\n",
"\n",
" return sorted(audio_files)"
],
"metadata": {
"id": "Aq-KD7TUeGam"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "code",
"source": [
"class PodcastMixer(BaseTool):\n",
" \"\"\"Enhanced audio mixing tool for podcast production.\"\"\"\n",
"\n",
" name: str = \"PodcastMixer\"\n",
" description: str = \"Mixes multiple audio files with effects into final podcast.\"\n",
"\n",
" audio_config: AudioConfig = Field(default_factory=AudioConfig)\n",
" output_dir: str = Field(default=\"output/podcast\")\n",
"\n",
" def _run(\n",
" self,\n",
" audio_files: List[str],\n",
" crossfade: int = 50\n",
" ) -> str:\n",
" if not audio_files:\n",
" raise ValueError(\"No audio files provided to mix\")\n",
"\n",
" try:\n",
" # Create output directory if it doesn't exist\n",
" os.makedirs(self.output_dir, exist_ok=True)\n",
"\n",
" mixed = AudioSegment.from_file(audio_files[0])\n",
" for audio_file in audio_files[1:]:\n",
" next_segment = AudioSegment.from_file(audio_file)\n",
" # Add silence and use crossfade\n",
" silence = AudioSegment.silent(duration=200)\n",
" next_segment = silence + next_segment\n",
" mixed = mixed.append(next_segment, crossfade=crossfade)\n",
"\n",
" # Simplified output path handling\n",
" output_file = os.path.join(self.output_dir, \"podcast_final.mp3\")\n",
"\n",
" mixed.export(\n",
" output_file,\n",
" format=\"mp3\",\n",
" parameters=[\n",
" \"-q:a\", \"0\", # Highest quality\n",
" \"-ar\", \"48000\" # Professional sample rate\n",
" ]\n",
" )\n",
"\n",
" print(f\"Successfully mixed podcast to: {output_file}\")\n",
" return output_file\n",
"\n",
" except Exception as e:\n",
" print(f\"Error mixing podcast: {str(e)}\")\n",
" return \"\""
],
"metadata": {
"id": "nyEtQFUreo-T"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Setup Output Directory Structure"
],
"metadata": {
"id": "qJMOSI3Qe_EL"
}
},
{
"cell_type": "code",
"source": [
"def setup_directories():\n",
" \"\"\"Set up organized directory structure\"\"\"\n",
" timestamp = datetime.now().strftime('%Y%m%d_%H%M%S')\n",
"\n",
" dirs = {\n",
" 'BASE': f'outputs/{timestamp}',\n",
" 'SEGMENTS': f'outputs/{timestamp}/segments',\n",
" 'FINAL': f'outputs/{timestamp}/podcast',\n",
" 'DATA': f'outputs/{timestamp}/data'\n",
" }\n",
"\n",
" for directory in dirs.values():\n",
" os.makedirs(directory, exist_ok=True)\n",
"\n",
" return dirs"
],
"metadata": {
"id": "VxL_XEP-fDwE"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Upload your PDF\n",
"\n",
"Use this cell to upload your research paper PDF or any PDF.\n",
"\n"
],
"metadata": {
"id": "L1zTizyhfKs7"
}
},
{
"cell_type": "code",
"source": [
"import os\n",
"import shutil\n",
"from google.colab import files\n",
"\n",
"# Create the 'knowledge' folder if it doesn't exist\n",
"if not os.path.exists('knowledge'):\n",
" os.makedirs('knowledge')\n",
"\n",
"# Upload the PDF file\n",
"uploaded = files.upload()\n",
"pdf_filename = list(uploaded.keys())[0]\n",
"\n",
"# Move the uploaded file to the 'knowledge' folder\n",
"shutil.move(pdf_filename, os.path.join('knowledge', pdf_filename))"
],
"metadata": {
"id": "OPx0aE8GfRS1",
"colab": {
"base_uri": "https://localhost:8080/",
"height": 91
},
"outputId": "cae3fc8d-7d0e-4904-95fa-11890c0b8e41"
},
"execution_count": null,
"outputs": [
{
"output_type": "display_data",
"data": {
"text/plain": [
"<IPython.core.display.HTML object>"
],
"text/html": [
"\n",
" <input type=\"file\" id=\"files-a5cd2adc-ecbc-4bf0-a983-9a49420f7a9a\" name=\"files[]\" multiple disabled\n",
" style=\"border:none\" />\n",
" <output id=\"result-a5cd2adc-ecbc-4bf0-a983-9a49420f7a9a\">\n",
" Upload widget is only available when the cell has been executed in the\n",
" current browser session. Please rerun this cell to enable.\n",
" </output>\n",
" <script>// Copyright 2017 Google LLC\n",
"//\n",
"// Licensed under the Apache License, Version 2.0 (the \"License\");\n",
"// you may not use this file except in compliance with the License.\n",
"// You may obtain a copy of the License at\n",
"//\n",
"// http://www.apache.org/licenses/LICENSE-2.0\n",
"//\n",
"// Unless required by applicable law or agreed to in writing, software\n",
"// distributed under the License is distributed on an \"AS IS\" BASIS,\n",
"// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.\n",
"// See the License for the specific language governing permissions and\n",
"// limitations under the License.\n",
"\n",
"/**\n",
" * @fileoverview Helpers for google.colab Python module.\n",
" */\n",
"(function(scope) {\n",
"function span(text, styleAttributes = {}) {\n",
" const element = document.createElement('span');\n",
" element.textContent = text;\n",
" for (const key of Object.keys(styleAttributes)) {\n",
" element.style[key] = styleAttributes[key];\n",
" }\n",
" return element;\n",
"}\n",
"\n",
"// Max number of bytes which will be uploaded at a time.\n",
"const MAX_PAYLOAD_SIZE = 100 * 1024;\n",
"\n",
"function _uploadFiles(inputId, outputId) {\n",
" const steps = uploadFilesStep(inputId, outputId);\n",
" const outputElement = document.getElementById(outputId);\n",
" // Cache steps on the outputElement to make it available for the next call\n",
" // to uploadFilesContinue from Python.\n",
" outputElement.steps = steps;\n",
"\n",
" return _uploadFilesContinue(outputId);\n",
"}\n",
"\n",
"// This is roughly an async generator (not supported in the browser yet),\n",
"// where there are multiple asynchronous steps and the Python side is going\n",
"// to poll for completion of each step.\n",
"// This uses a Promise to block the python side on completion of each step,\n",
"// then passes the result of the previous step as the input to the next step.\n",
"function _uploadFilesContinue(outputId) {\n",
" const outputElement = document.getElementById(outputId);\n",
" const steps = outputElement.steps;\n",
"\n",
" const next = steps.next(outputElement.lastPromiseValue);\n",
" return Promise.resolve(next.value.promise).then((value) => {\n",
" // Cache the last promise value to make it available to the next\n",
" // step of the generator.\n",
" outputElement.lastPromiseValue = value;\n",
" return next.value.response;\n",
" });\n",
"}\n",
"\n",
"/**\n",
" * Generator function which is called between each async step of the upload\n",
" * process.\n",
" * @param {string} inputId Element ID of the input file picker element.\n",
" * @param {string} outputId Element ID of the output display.\n",
" * @return {!Iterable<!Object>} Iterable of next steps.\n",
" */\n",
"function* uploadFilesStep(inputId, outputId) {\n",
" const inputElement = document.getElementById(inputId);\n",
" inputElement.disabled = false;\n",
"\n",
" const outputElement = document.getElementById(outputId);\n",
" outputElement.innerHTML = '';\n",
"\n",
" const pickedPromise = new Promise((resolve) => {\n",
" inputElement.addEventListener('change', (e) => {\n",
" resolve(e.target.files);\n",
" });\n",
" });\n",
"\n",
" const cancel = document.createElement('button');\n",
" inputElement.parentElement.appendChild(cancel);\n",
" cancel.textContent = 'Cancel upload';\n",
" const cancelPromise = new Promise((resolve) => {\n",
" cancel.onclick = () => {\n",
" resolve(null);\n",
" };\n",
" });\n",
"\n",
" // Wait for the user to pick the files.\n",
" const files = yield {\n",
" promise: Promise.race([pickedPromise, cancelPromise]),\n",
" response: {\n",
" action: 'starting',\n",
" }\n",
" };\n",
"\n",
" cancel.remove();\n",
"\n",
" // Disable the input element since further picks are not allowed.\n",
" inputElement.disabled = true;\n",
"\n",
" if (!files) {\n",
" return {\n",
" response: {\n",
" action: 'complete',\n",
" }\n",
" };\n",
" }\n",
"\n",
" for (const file of files) {\n",
" const li = document.createElement('li');\n",
" li.append(span(file.name, {fontWeight: 'bold'}));\n",
" li.append(span(\n",
" `(${file.type || 'n/a'}) - ${file.size} bytes, ` +\n",
" `last modified: ${\n",
" file.lastModifiedDate ? file.lastModifiedDate.toLocaleDateString() :\n",
" 'n/a'} - `));\n",
" const percent = span('0% done');\n",
" li.appendChild(percent);\n",
"\n",
" outputElement.appendChild(li);\n",
"\n",
" const fileDataPromise = new Promise((resolve) => {\n",
" const reader = new FileReader();\n",
" reader.onload = (e) => {\n",
" resolve(e.target.result);\n",
" };\n",
" reader.readAsArrayBuffer(file);\n",
" });\n",
" // Wait for the data to be ready.\n",
" let fileData = yield {\n",
" promise: fileDataPromise,\n",
" response: {\n",
" action: 'continue',\n",
" }\n",
" };\n",
"\n",
" // Use a chunked sending to avoid message size limits. See b/62115660.\n",
" let position = 0;\n",
" do {\n",
" const length = Math.min(fileData.byteLength - position, MAX_PAYLOAD_SIZE);\n",
" const chunk = new Uint8Array(fileData, position, length);\n",
" position += length;\n",
"\n",
" const base64 = btoa(String.fromCharCode.apply(null, chunk));\n",
" yield {\n",
" response: {\n",
" action: 'append',\n",
" file: file.name,\n",
" data: base64,\n",
" },\n",
" };\n",
"\n",
" let percentDone = fileData.byteLength === 0 ?\n",
" 100 :\n",
" Math.round((position / fileData.byteLength) * 100);\n",
" percent.textContent = `${percentDone}% done`;\n",
"\n",
" } while (position < fileData.byteLength);\n",
" }\n",
"\n",
" // All done.\n",
" yield {\n",
" response: {\n",
" action: 'complete',\n",
" }\n",
" };\n",
"}\n",
"\n",
"scope.google = scope.google || {};\n",
"scope.google.colab = scope.google.colab || {};\n",
"scope.google.colab._files = {\n",
" _uploadFiles,\n",
" _uploadFilesContinue,\n",
"};\n",
"})(self);\n",
"</script> "
]
},
"metadata": {}
},
{
"output_type": "stream",
"name": "stdout",
"text": [
"Saving orioles.pdf to orioles.pdf\n"
]
},
{
"output_type": "execute_result",
"data": {
"text/plain": [
"'knowledge/orioles.pdf'"
],
"application/vnd.google.colaboratory.intrinsic+json": {
"type": "string"
}
},
"metadata": {},
"execution_count": 10
}
]
},
{
"cell_type": "markdown",
"source": [
"## Setup Agents & Tasks\n"
],
"metadata": {
"id": "btCjkrG6fn9m"
}
},
{
"cell_type": "code",
"source": [
"from crewai import Agent, Task, Crew, Process, LLM\n",
"from crewai.knowledge.source.pdf_knowledge_source import PDFKnowledgeSource\n",
"from crewai_tools import SerperDevTool\n",
"from pydantic import BaseModel, Field\n",
"from typing import List\n",
"from datetime import datetime"
],
"metadata": {
"id": "sJ23G3dugECe"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "code",
"source": [
"!crewai reset-memories --knowledge"
],
"metadata": {
"id": "4L7IGU6AXmjt",
"colab": {
"base_uri": "https://localhost:8080/"
},
"outputId": "4dcd8cba-002d-45fe-c65a-efd687b63491"
},
"execution_count": null,
"outputs": [
{
"output_type": "stream",
"name": "stdout",
"text": [
"Knowledge has been reset.\n"
]
}
]
},
{
"cell_type": "code",
"source": [
"# Pass only the filename to PDFKnowledgeSource\n",
"research_paper = PDFKnowledgeSource(file_paths=pdf_filename)"
],
"metadata": {
"id": "Eqglxu-0ftD7"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "code",
"source": [
"# --- Pydantic Model definitions ---\n",
"class PaperSummary(BaseModel):\n",
" \"\"\"Summary of a research paper.\"\"\"\n",
" title: str = Field(..., description=\"Title of the research paper\")\n",
" main_findings: List[str] = Field(..., description=\"Key findings as a list of strings\")\n",
" methodology: str = Field(..., description=\"Research methodology as a single text block\")\n",
" key_implications: List[str] = Field(..., description=\"Implications as a list of strings\")\n",
" limitations: List[str] = Field(..., description=\"Limitations as a list of strings\")\n",
" future_work: List[str] = Field(..., description=\"Future research directions as a list\")\n",
" summary_date: datetime = Field(..., description=\"Timestamp of summary creation\")\n",
"\n",
"class DialogueLine(BaseModel):\n",
" \"\"\"Dialogue line for a podcast script.\"\"\"\n",
" speaker: str = Field(..., description=\"Name of the speaker (Julia or Guido)\")\n",
" text: str = Field(..., description=\"The actual dialogue line\")\n",
"\n",
"class PodcastScript(BaseModel):\n",
" \"\"\"Podcast script with dialogue lines.\"\"\"\n",
" dialogue: List[DialogueLine] = Field(..., description=\"Ordered list of dialogue lines\")\n",
"\n",
"class AudioGeneration(BaseModel):\n",
" \"\"\"Audio generation result with metadata.\"\"\"\n",
" segment_files: List[str] = Field(..., description=\"List of generated audio segment files\")\n",
" final_podcast: str = Field(..., description=\"Path to the final mixed podcast file\")"
],
"metadata": {
"id": "UtkkIEFigbmS"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Configure all the LLMs Agents will Use"
],
"metadata": {
"id": "3Tb78Hulp7Li"
}
},
{
"cell_type": "code",
"source": [
"# --- LLM Setup ---\n",
"summary_llm = LLM(\n",
" model=\"openai/o3-mini\"\n",
")\n",
"\n",
"script_llm = LLM(\n",
" model=\"openai/o1\"\n",
")\n",
"\n",
"script_enhancer_llm = LLM(\n",
" model=\"openai/o3-mini\"\n",
")\n",
"\n",
"audio_llm = LLM(\n",
" model=\"openai/gpt-4o-mini\"\n",
")"
],
"metadata": {
"id": "2jLxskJxg3PO"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Setup ElevenLabs Podcast Hosts Voices"
],
"metadata": {
"id": "ZIBXg8PTpy2R"
}
},
{
"cell_type": "code",
"source": [
"# Create and configure tools\n",
"dirs = setup_directories()\n",
"audio_generator = PodcastAudioGenerator(output_dir=dirs['SEGMENTS'])\n",
"\n",
"# Julia: Enthusiastic expert\n",
"audio_generator.add_voice(\n",
" \"Julia\",\n",
" os.getenv(\"CLAUDIA_VOICE_ID\"),\n",
" VoiceConfig(\n",
" stability=0.35, # More variation for natural enthusiasm\n",
" similarity_boost=0.75, # Maintain voice consistency\n",
" style=0.65, # Good expressiveness without being over the top\n",
" use_speaker_boost=True\n",
" )\n",
")\n",
"\n",
"# Guido: Engaged and curious\n",
"audio_generator.add_voice(\n",
" \"Guido\",\n",
" os.getenv(\"BEN_VOICE_ID\"),\n",
" VoiceConfig(\n",
" stability=0.4, # Slightly more stable but still natural\n",
" similarity_boost=0.75,\n",
" style=0.6, # Balanced expressiveness\n",
" use_speaker_boost=True\n",
" )\n",
")\n",
"\n",
"podcast_mixer = PodcastMixer(output_dir=dirs['FINAL'])\n",
"search_tool = SerperDevTool()"
],
"metadata": {
"id": "8MmyaHM-g_ZK"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Agents"
],
"metadata": {
"id": "Hi4a1owjpecm"
}
},
{
"cell_type": "code",
"source": [
"# --- Agents ---\n",
"researcher = Agent(\n",
" role=\"Research Analyst\",\n",
" goal=\"Create comprehensive yet accessible research paper summaries\",\n",
" backstory=\"\"\"You're a PhD researcher with a talent for breaking down complex\n",
" academic papers into clear, understandable summaries. You excel at identifying\n",
" key findings and their real-world implications.\"\"\",\n",
" verbose=True,\n",
" llm=summary_llm\n",
")\n",
"\n",
"research_support = Agent(\n",
" role=\"Research Support Specialist\",\n",
" goal=\"Find current context and supporting materials relevant to the paper's topic\",\n",
" backstory=\"\"\"You're a versatile research assistant who excels at finding\n",
" supplementary information across academic fields. You have a talent for\n",
" connecting academic research with real-world applications, current events,\n",
" and practical examples, regardless of the field. You know how to find\n",
" credible sources and relevant discussions across various domains.\"\"\",\n",
" verbose=True,\n",
" tools=[search_tool],\n",
" llm=script_llm\n",
")\n",
"\n",
"script_writer = Agent(\n",
" role=\"Podcast Script Writer\",\n",
" goal=\"Create engaging and educational podcast scripts about technical topics\",\n",
" backstory=\"\"\"You're a skilled podcast writer who specializes in making technical\n",
" content engaging and accessible. You create natural dialogue between two hosts:\n",
" Julia (a knowledgeable expert who explains concepts clearly) and Guido (an informed\n",
" co-host who asks thoughtful questions and helps guide the discussion).\"\"\",\n",
" verbose=True,\n",
" llm=script_llm\n",
")\n",
"\n",
"script_enhancer = Agent(\n",
" role=\"Podcast Script Enhancer\",\n",
" goal=\"Enhance podcast scripts to be more engaging while maintaining educational value\",\n",
" backstory=\"\"\"You're a veteran podcast producer who specializes in making technical\n",
" content both entertaining and informative. You excel at adding natural humor,\n",
" relatable analogies, and engaging banter while ensuring the core technical content\n",
" remains accurate and valuable. You've worked on shows like Lex Fridman's podcast,\n",
" Hardcore History, and the Joe Rogan Experience, bringing their signature blend of\n",
" entertainment and education.\"\"\",\n",
" verbose=True,\n",
" llm=script_enhancer_llm\n",
")\n",
"\n",
"audio_generator_agent = Agent(\n",
" role=\"Audio Generation Specialist\",\n",
" goal=\"Generate high-quality podcast audio with natural-sounding voices\",\n",
" backstory=\"\"\"You are an expert in audio generation and processing. You understand\n",
" how to generate natural-sounding voices and create professional podcast audio. You\n",
" consider pacing, tone, and audio quality in your productions.\"\"\",\n",
" verbose=True,\n",
" allow_delegation=False,\n",
" tools=[audio_generator, podcast_mixer],\n",
" llm=audio_llm\n",
")"
],
"metadata": {
"id": "N1ZoGGxGgyeD"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Tasks"
],
"metadata": {
"id": "pAijXfb9piPg"
}
},
{
"cell_type": "code",
"source": [
"# --- Tasks ---\n",
"summary_task = Task(\n",
" description=\"\"\"Hey there, researcher! Your mission is to dive into the\n",
" research paper provided in {paper} and uncover its core insights.\n",
" As you create the summary, please:\n",
"\n",
" - Highlight the Big Ideas: What are the main findings and conclusions?\n",
" - Explain the Method: Break down the study’s methodology in everyday language.\n",
" - Discuss the Impact: What are the key implications for the field?\n",
" - Note the Caveats: Mention any limitations or uncertainties.\n",
" - Look Ahead: Offer some thoughts on future research directions.\n",
"\n",
" Keep your tone engaging and friendly so that an educated general audience can\n",
" easily follow along while staying true to the technical details.\"\"\",\n",
" expected_output=\"A clear, well-structured summary that covers all the critical aspects of the paper in an accessible and engaging manner.\",\n",
" agent=researcher,\n",
" output_pydantic=PaperSummary,\n",
" output_file=\"output/metadata/paper_summary.json\"\n",
")\n",
"\n",
"supporting_research_task = Task(\n",
" description=\"\"\"Alright, now that we have the paper summary, let’s add some\n",
" real-world flavor. Your task is to gather recent and credible supporting\n",
" materials that enrich the topic. Here’s how to proceed:\n",
"\n",
" - Spot the Themes: Identify the main ideas from the paper and see how they connect with broader trends.\n",
" - Current Developments: Find news, advancements, or case studies from the\n",
" last couple of years that bring these ideas to life.\n",
" - Explore Diverse Views: Look for expert opinions, debates, and alternative perspectives.\n",
" - Real-World Examples: Gather examples from industry reports, white papers, or professional discussions.\n",
"\n",
" Your goal is to build a collection of supporting insights that help listeners\n",
" understand how the research plays out in real life. Make sure your sources\n",
" are recent, reliable, and add that extra context.\"\"\",\n",
" expected_output=\"A curated collection of supporting materials and real-world examples that add context and depth to the research paper’s topic.\",\n",
" agent=research_support,\n",
" context=[summary_task],\n",
" output_file=\"output/metadata/supporting_research.json\"\n",
")\n",
"\n",
"podcast_task = Task(\n",
" description=\"\"\"Using the paper summary and supporting research, craft a podcast conversation between Julia and Guido that feels as natural and spontaneous as the dialogue in a favorite podcast (think Joe Rogan or Lex Fridman). Make sure to:\n",
"\n",
" Adopt a Conversational Tone:\n",
" - Use natural expressions and fillers (e.g., \"like,\" \"oh yeah,\" \"wait, really?\") to keep it casual.\n",
" - Include friendly interruptions, confirmations, and asides—just like real people chatting.\n",
"\n",
" Clearly Distinguish Sources:\n",
" - When referencing the paper, use phrases like \"According to the paper...\" or \"The study shows that...\"\n",
" - When adding extra insights, try phrases such as \"I read somewhere that...\" or \"This reminds me of a case study...\"\n",
"\n",
" Embrace the Hosts’ Personalities:\n",
" - Julia: The enthusiastic expert who explains technical details in an approachable\n",
" way, sometimes playfully challenging Guido.\n",
" - Guido: The curious, practical co-host who asks clarifying questions and\n",
" brings in relatable, real-world examples.\n",
"\n",
" Keep It Natural:\n",
" - Allow the conversation to include friendly debates, gentle interruptions,\n",
" and natural transitions (e.g., \"But getting back to…\" or \"That reminds me…\").\n",
" - Avoid explicit stage directions like *laughs*—let the dialogue convey emotion naturally.\n",
"\n",
" Example Flow:\n",
" Julia: \"Have you ever struggled to find a specific detail buried in endless PDFs?\"\n",
" Guido: \"Oh, absolutely. It’s like, you search for keywords, but sometimes the\n",
" real clue is in a chart or diagram, you know?\"\n",
" Julia: \"Exactly! And this paper dives into how AI can actually 'see' the layout of a page. It’s wild!\"\n",
" Guido: \"Right, it’s not just about text—it’s about understanding the whole picture.\n",
" And I read about a case study that really brings that home…\"\n",
"\n",
" Your script should clearly separate the paper’s findings from the broader\n",
" insights while engaging the listener with a relaxed, genuine conversation.\"\"\",\n",
" expected_output=\"A lively and natural podcast script that seamlessly blends technical insights with spontaneous, friendly banter between the two hosts.\",\n",
" agent=script_writer,\n",
" context=[summary_task, supporting_research_task],\n",
" output_pydantic=PodcastScript,\n",
" output_file=\"output/metadata/podcast_script.json\"\n",
")\n",
"\n",
"enhance_script_task = Task(\n",
" description=\"\"\"Now, take the initial podcast script and polish it further so\n",
" that it feels like a genuine, off-the-cuff conversation between Julia and Guido.\n",
" Your job is to:\n",
"\n",
" - Infuse Natural Reactions: Sprinkle in casual phrases like \"oh, really?\" or\n",
" \"wait, that’s interesting!\" to capture spontaneous moments.\n",
" - Smooth the Flow: Ensure the dialogue transitions smoothly between topics\n",
" with natural pauses and brief asides.\n",
" - Keep It Authentic: Maintain the distinct personalities of Julia (the friendly expert)\n",
" and Guido (the curious co-host), without adding any extra characters.\n",
" - Preserve Accuracy: While making the dialogue fun and engaging, all technical\n",
" details must remain correct.\n",
" - Inject Light Humor: When it fits naturally, add humorous or witty exchanges\n",
" to keep the conversation lively.\n",
"\n",
" Remember: Do not change the host names, and avoid explicit reaction markers\n",
" like *laughs*. Let the dialogue imply the emotions naturally.\n",
"\n",
" Your enhanced script should sound like you’re eavesdropping on a genuine,\n",
" spontaneous chat between two friends who are both passionate about the topic.\"\"\",\n",
" expected_output=\"An enhanced podcast script that feels natural, engaging, and authentic—balancing technical insights with a fun, friendly conversation.\",\n",
" agent=script_enhancer,\n",
" context=[summary_task, podcast_task],\n",
" output_pydantic=PodcastScript,\n",
" output_file=\"output/metadata/enhanced_podcast_script.json\",\n",
" human_input=True\n",
")\n",
"\n",
"audio_task = Task(\n",
" description=\"\"\"Generate high-quality audio for the podcast script and create the final podcast.\n",
"\n",
" The script will be provided in the context as a list of dialogue entries, each with:\n",
" - speaker: Either \"Julia\" or \"Guido\"\n",
" - text: The line to be spoken\n",
"\n",
" Process:\n",
" 1. Generate natural-sounding audio for each line of dialogue using appropriate voices\n",
" 2. Apply audio processing for professional quality:\n",
" - Normalize audio levels\n",
" - Add subtle fade effects between segments\n",
" - Apply appropriate pacing and pauses\n",
" 3. Mix all segments into a cohesive final podcast\n",
"\n",
" Voice Assignments:\n",
" - For Julia's lines: Use configured Julia voice\n",
" - For Guido's lines: Use configured Guido voice\n",
"\n",
" Quality Guidelines:\n",
" - Ensure consistent audio levels across all segments\n",
" - Maintain natural pacing and flow\n",
" - Create smooth transitions between speakers\n",
" - Verify audio clarity and quality\"\"\",\n",
" expected_output=\"A professional-quality podcast audio file with natural-sounding voices and smooth transitions\",\n",
" agent=audio_generator_agent,\n",
" context=[enhance_script_task],\n",
" output_pydantic=AudioGeneration,\n",
" output_file=\"output/metadata/audio_generation_meta.json\"\n",
")"
],
"metadata": {
"id": "UBZGNorjgqn4"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Put the Agent Crew Together"
],
"metadata": {
"id": "HWeHPUqgpmku"
}
},
{
"cell_type": "code",
"source": [
"# --- Crew and Process ---\n",
"crew = Crew(\n",
" agents=[researcher, research_support, script_writer, script_enhancer, audio_generator_agent],\n",
" tasks=[summary_task, supporting_research_task, podcast_task, enhance_script_task, audio_task],\n",
" process=Process.sequential,\n",
" knowledge_sources=[research_paper],\n",
" verbose=True,\n",
" memory=False\n",
")"
],
"metadata": {
"id": "3KKC18XTgj2m"
},
"execution_count": null,
"outputs": []
},
{
"cell_type": "markdown",
"source": [
"## Run"
],
"metadata": {
"id": "paEur1PfpsIL"
}
},
{
"cell_type": "code",
"source": [
"if __name__ == \"__main__\":\n",
" # Update task output files\n",
" summary_task.output_file = os.path.join(dirs['DATA'], \"paper_summary.json\")\n",
" supporting_research_task.output_file = os.path.join(dirs['DATA'], \"supporting_research.json\")\n",
" podcast_task.output_file = os.path.join(dirs['DATA'], \"podcast_script.json\")\n",
" enhance_script_task.output_file = os.path.join(dirs['DATA'], \"enhanced_podcast_script.json\")\n",
" audio_task.output_file = os.path.join(dirs['DATA'], \"audio_generation_meta.json\")\n",
"\n",
" # Run the podcast generation process\n",
" results = crew.kickoff()"
],
"metadata": {
"id": "t9BrCXpcgg4f",
"colab": {
"base_uri": "https://localhost:8080/"
},
"outputId": "b1cb4299-651c-421a-d4c5-091fdf684875"
},
"execution_count": null,
"outputs": [
{
"output_type": "stream",
"name": "stdout",
"text": [
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mResearch Analyst\u001b[00m\n",
"\u001b[95m## Task:\u001b[00m \u001b[92mHey there, researcher! Your mission is to dive into the\n",
" research paper provided in {paper} and uncover its core insights.\n",
" As you create the summary, please:\n",
"\n",
" - Highlight the Big Ideas: What are the main findings and conclusions?\n",
" - Explain the Method: Break down the study’s methodology in everyday language.\n",
" - Discuss the Impact: What are the key implications for the field?\n",
" - Note the Caveats: Mention any limitations or uncertainties.\n",
" - Look Ahead: Offer some thoughts on future research directions.\n",
"\n",
" Keep your tone engaging and friendly so that an educated general audience can\n",
" easily follow along while staying true to the technical details.\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mResearch Analyst\u001b[00m\n",
"\u001b[95m## Final Answer:\u001b[00m \u001b[92m\n",
"{\n",
" \"title\": \"Incorporating Circuit Theory into a Dynamic Model for Crowd-Sourced Observations of Migratory Birds\",\n",
" \"main_findings\": [\n",
" \"Introduces a novel hidden Markov model that leverages circuit theory to structure state transitions in modeling bird counts.\",\n",
" \"Demonstrates that crowd-sourced eBird data, despite individual observation uncertainties, can map broad migratory patterns effectively.\",\n",
" \"When applied to Baltimore orioles and yellow-rumped warblers, the model reveals key insights into their spatial distribution and migration dynamics.\"\n",
" ],\n",
" \"methodology\": \"The study combines a hidden Markov model with a transition structure inspired by circuit theory to analyze bird count data from the eBird database. This approach uses the extensive but individually less reliable crowd-sourced observations to model changes in bird abundance across space and time, overcoming traditional limitations of telemetry and professional survey data by making use of a larger, more spatially comprehensive dataset.\",\n",
" \"key_implications\": [\n",
" \"Validates the use of citizen-science data for ecological and migration research, suggesting that large-scale observational datasets can complement traditional methods.\",\n",
" \"Offers a scalable and cost-effective alternative to intensive telemetry studies for tracking migratory patterns.\",\n",
" \"Provides a framework that may help improve conservation strategies through enhanced understanding of how environmental features influence migration routes.\"\n",
" ],\n",
" \"limitations\": [\n",
" \"Dependence on eBird data, which can have inherent biases and variability due to differences in observer expertise and spatial coverage.\",\n",
" \"The hidden Markov model and circuit theory integration, while innovative, rely on assumptions that may not capture all complexities of avian migration behavior.\",\n",
" \"Limited by potential temporal and spatial sampling gaps inherent in opportunistic citizen-science datasets.\"\n",
" ],\n",
" \"future_work\": [\n",
" \"Expand the framework to include additional species and longer time series to test its robustness and generalizability.\",\n",
" \"Integrate environmental covariates such as climate variables to better understand factors triggering migration.\",\n",
" \"Refine computational methods and explore alternative modeling frameworks to address potential biases in crowd-sourced data.\"\n",
" ],\n",
" \"summary_date\": \"2024-07-04\"\n",
"}\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mResearch Support Specialist\u001b[00m\n",
"\u001b[95m## Task:\u001b[00m \u001b[92mAlright, now that we have the paper summary, let’s add some\n",
" real-world flavor. Your task is to gather recent and credible supporting\n",
" materials that enrich the topic. Here’s how to proceed:\n",
"\n",
" - Spot the Themes: Identify the main ideas from the paper and see how they connect with broader trends.\n",
" - Current Developments: Find news, advancements, or case studies from the\n",
" last couple of years that bring these ideas to life.\n",
" - Explore Diverse Views: Look for expert opinions, debates, and alternative perspectives.\n",
" - Real-World Examples: Gather examples from industry reports, white papers, or professional discussions.\n",
"\n",
" Your goal is to build a collection of supporting insights that help listeners\n",
" understand how the research plays out in real life. Make sure your sources\n",
" are recent, reliable, and add that extra context.\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mResearch Support Specialist\u001b[00m\n",
"\u001b[95m## Using tool:\u001b[00m \u001b[92mSearch the internet with Serper\u001b[00m\n",
"\u001b[95m## Tool Input:\u001b[00m \u001b[92m\n",
"\"{\\\"search_query\\\": \\\"circuit theory AND eBird AND bird migration AND hidden Markov model (2022..2024) recent examples and case studies\\\"}\"\u001b[00m\n",
"\u001b[95m## Tool Output:\u001b[00m \u001b[92m\n",
"{'searchParameters': {'q': 'circuit theory AND eBird AND bird migration AND hidden Markov model (2022..2024) recent examples and case studies', 'type': 'search', 'num': 10, 'engine': 'google'}, 'organic': [{'title': '[PDF] Incorporating circuit theory into a dynamic model for crowd-sourced ...', 'link': 'https://arxiv.org/pdf/2407.02690', 'snippet': 'Within this article we utilized crowdsourced bird watching observations from the eBird database in order to model the latent abundance of ...', 'position': 1}, {'title': 'Incorporating circuit theory into a dynamic model for crowd-sourced ...', 'link': 'https://arxiv.org/abs/2407.02690', 'snippet': 'We introduce a hidden Markov model for observed bird counts utilizing a novel transition structure developed using principles from circuit theory.', 'position': 2}, {'title': '[PDF] Collective Inference on Markov Models for Modeling Bird Migration', 'link': 'https://www.cs.cornell.edu/kozen/Papers/birds.pdf', 'snippet': 'We investigate a family of inference problems on Markov models, where many sample paths are drawn from a Markov chain and partial information is revealed.', 'position': 3}, {'title': 'Publications - eBird Science', 'link': 'https://science.ebird.org/research-and-conservation/publications', 'snippet': 'Collective inference on Markov models for modeling bird migration. Advances in Neural Information Processing Systems. Thompson, S., and R. Bonney. 2007 ...', 'position': 4}, {'title': '[PDF] Extreme-value modelling of migratory bird arrival dates', 'link': 'https://rss.org.uk/RSS/media/File-library/Events/Discussion%20meetings/Koh_Opitz_RSS_2024-2308.pdf', 'snippet': 'We implement data fusion of two citizen-science datasets with fundamentally different protocols (BBS, eBird) and map posterior distributions of the latent ...', 'position': 5}, {'title': 'BirdFlow: Learning Seasonal Bird Movements from eBird Data', 'link': 'https://www.biorxiv.org/content/10.1101/2022.04.12.488057.full', 'snippet': 'We apply the model to 11 species of North American birds, using GPS and satellite tracking data to tune and evaluate model performance. We show ...', 'position': 6}, {'title': '2024 Year in Review: eBird, Merlin, Macaulay Library, and Birds of ...', 'link': 'https://ebird.org/news/2024-year-in-review', 'snippet': 'The eBird and Merlin apps feature a refreshed, unified design and new menu layouts to streamline navigation. The eBird app also offers a more ...', 'position': 7}, {'title': 'Agriculture and herbivorous waterfowl: A review of the scientific ...', 'link': 'https://www.researchgate.net/publication/297603856_Agriculture_and_herbivorous_waterfowl_A_review_of_the_scientific_basis_for_improved_management', 'snippet': '(2022) reported that 87% of bird species that caused conflicts were linked to crop damage. Avian utilization of agricultural crops has resulted in significant ...', 'position': 8}, {'title': 'Migration Dashboard - BirdCast', 'link': 'https://birdcast.info/migration-tools/migration-dashboard/', 'snippet': 'This tool depicts migration patterns in near real time or as a summary of a whole night after nocturnal periods end (2021-present).', 'position': 9}, {'title': 'Finance, Economics, and Industry for Sustainable Development', 'link': 'https://dokumen.pub/finance-economics-and-industry-for-sustainable-development-proceedings-of-the-3rd-international-scientific-conference-on-sustainable-development-esg-2022-st-petersburg-2022-3031304977-9783031304972.html', 'snippet': 'The purpose of this study is to form a significant holistic scientific understanding of the relationship between the theory and practice of urban improvement with ...', 'position': 10}], 'peopleAlsoAsk': [{'question': 'What are the current theories on how birds know when and which direction to migrate?', 'snippet': \"Migrating birds use celestial cues to navigate, much as sailors of yore used the sun and stars to guide them. But unlike humans, birds also detect the magnetic field generated by Earth's molten core and use it to determine their position and direction.\", 'title': 'How Migrating Birds Use Quantum Effects to Navigate', 'link': 'https://www.scientificamerican.com/article/how-migrating-birds-use-quantum-effects-to-navigate/'}, {'question': 'What are the new technologies for monitoring bird migration and Behaviour?', 'snippet': 'Bird tracking methods such as radio tracking, satellite tracking, geolocator loggers, passive integrated transponders or even the use of the worldwide mobile phone network for animal tracking are rapidly evolving and offer high spatial and temporal resolution for following bird movements.', 'title': 'New technologies for monitoring bird migration and behaviour', 'link': 'https://www.researchgate.net/publication/228406368_New_technologies_for_monitoring_bird_migration_and_behaviour'}], 'credits': 1}\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mResearch Support Specialist\u001b[00m\n",
"\u001b[95m## Using tool:\u001b[00m \u001b[92mSearch the internet with Serper\u001b[00m\n",
"\u001b[95m## Tool Input:\u001b[00m \u001b[92m\n",
"\"{\\\"search_query\\\": \\\"recent circuit theory in ecology OR hidden Markov models in bird migration since 2022, new publications, real-world case studies, eBird\\\"}\"\u001b[00m\n",
"\u001b[95m## Tool Output:\u001b[00m \u001b[92m\n",
"{'searchParameters': {'q': 'recent circuit theory in ecology OR hidden Markov models in bird migration since 2022, new publications, real-world case studies, eBird', 'type': 'search', 'num': 10, 'engine': 'google'}, 'organic': [{'title': '[PDF] Incorporating circuit theory into a dynamic model for crowd-sourced ...', 'link': 'https://arxiv.org/pdf/2407.02690', 'snippet': 'We introduce a hidden Markov model for observed bird counts ... 2023 year in review: eBird, Merlin, Macaulay Library, and Birds of the World, 2024 ...', 'position': 1}, {'title': 'Publications - eBird Science', 'link': 'https://science.ebird.org/research-and-conservation/publications', 'snippet': 'Collective inference on Markov models for modeling bird migration. Advances in Neural Information Processing Systems. Thompson, S., and R. Bonney. 2007 ...', 'position': 2}, {'title': 'Incorporating circuit theory into a dynamic model for crowd-sourced ...', 'link': 'https://arxiv.org/abs/2407.02690', 'snippet': 'We introduce a hidden Markov model for observed bird counts utilizing a novel transition structure developed using principles from circuit theory.', 'position': 3}, {'title': 'Citizen Science Reveals Annual Bird Migrations Across Continents', 'link': 'https://ebird.org/news/lasortemap/', 'snippet': 'Scientists at the Cornell Lab of Ornithology have documented migratory movements of bird populations spanning the entire year for 118 species throughout the ...', 'position': 4}, {'title': 'BirdFlow: Learning Seasonal Bird Movements from eBird Data', 'link': 'https://www.biorxiv.org/content/10.1101/2022.04.12.488057.full', 'snippet': 'We show that BirdFlow models can accurately infer individual seasonal movement behavior directly from eBird relative abundance estimates.', 'position': 5}, {'title': 'Maps of the predicted intensity (log(λ(s))) and standard deviation ...', 'link': 'https://www.researchgate.net/figure/Maps-of-the-predicted-intensity-logls-and-standard-deviation-SD-from-the_fig5_355588668', 'snippet': 'We introduce a hidden Markov model for observed bird counts utilizing a novel transition structure developed using principles from circuit theory. After ...', 'position': 6}, {'title': 'WWF Peer Reviewed Publications - World Wildlife Fund', 'link': 'https://www.worldwildlife.org/peer-reviewed-publications', 'snippet': 'Our models suggest that the uptake of most of our case studies is limited ... Circuit theory models were more accurate on average than SRP models. CTMC ...', 'position': 7}, {'title': '[PDF] Ecology and Conservation of Parrots in Their Native and Non - AWS', 'link': 'https://unglueit-files.s3.amazonaws.com/ebf/9c46b57afe044d18bed1ff156c9b0d52.pdf', 'snippet': '2022 by the authors. Articles in this book are Open Access and distributed under the Creative. Commons Attribution (CC BY) license, which allows users to ...', 'position': 8}, {'title': 'eBird Status and Trends', 'link': 'https://science.ebird.org/status-and-trends', 'snippet': 'High-resolution data, visualizations, and tools describing where bird populations occur and how they change through time—powered by eBird data and updated ...', 'position': 9}, {'title': '[PDF] Abstract Book - AOS 2025 Annual Meeting', 'link': 'https://meeting.americanornithology.org/wp-content/uploads/2021/07/AbstractBook_27July2021.pdf', 'snippet': 'Partners in Flight (PIF) bird conservation plans set conservation objectives for bird populations and their habitats across North America. Many ...', 'position': 10}], 'peopleAlsoAsk': [{'question': 'What are the current theories on how birds know when and which direction to migrate?', 'snippet': \"Migrating birds use celestial cues to navigate, much as sailors of yore used the sun and stars to guide them. But unlike humans, birds also detect the magnetic field generated by Earth's molten core and use it to determine their position and direction.\", 'title': 'How Migrating Birds Use Quantum Effects to Navigate', 'link': 'https://www.scientificamerican.com/article/how-migrating-birds-use-quantum-effects-to-navigate/'}, {'question': 'What is the eBird trend map?', 'snippet': 'What are eBird Trends maps? eBird Trends maps show where bird abundance has increased (blue), decreased (red), or where change has been uncertain (white) from specified years within a 27 x 27 km region (represented by a circle). The darker the color, the stronger the trend.', 'title': 'Trends Maps - eBird Science', 'link': 'https://science.ebird.org/en/status-and-trends/trends-maps'}], 'credits': 1}\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mResearch Support Specialist\u001b[00m\n",
"\u001b[95m## Final Answer:\u001b[00m \u001b[92m\n",
"Below is a curated collection of recent, real-world supporting materials and examples—drawn from publications, official websites, and case studies—that add breadth and context to the paper’s focus on integrating circuit theory into a Hidden Markov Model (HMM) framework using eBird citizen-science data for studying migratory birds:\n",
"\n",
"--------------------------------------------------------------------------------\n",
"1) BirdFlow: Learning Seasonal Bird Movements from eBird Data (2022)\n",
"--------------------------------------------------------------------------------\n",
"• Citation:\n",
" BirdFlow: Learning Seasonal Bird Movements from eBird Data. Accessed via bioRxiv at https://www.biorxiv.org/content/10.1101/2022.04.12.488057.full\n",
"\n",
"• Direct Excerpt (Complete abstract as posted on bioRxiv):\n",
" “We present BirdFlow, a novel probabilistic model for fine-scale spatiotemporal patterns of bird occurrence and movement. BirdFlow is designed to learn seasonal bird movements from eBird data, which provide extensive citizen science observations of birds across the Western Hemisphere. By modeling relative abundance across space and time, BirdFlow infers a ‘flow field’ that captures how the spatial distribution of a species evolves over the year. We apply the model to 11 species of North American birds, using GPS and satellite tracking data to tune and evaluate model performance. We show that BirdFlow models can accurately infer individual seasonal movement behavior directly from eBird relative abundance estimates. This approach offers a scalable framework for understanding and predicting migratory patterns, complementing specialized telemetry data and traditional survey-based approaches.”\n",
"\n",
"• Relevance to Current Paper:\n",
" – Demonstrates a novel approach for learning migratory flow patterns from extensive eBird data, reflecting a trend of combining innovative modeling with crowd-sourced observations. \n",
" – In practice, BirdFlow’s concept of inferring flow fields could dovetail with circuit theory, as both approaches center on modeling connectivity and movement processes (in this case, the flow or “current” of birds) across a landscape. \n",
" – Reinforces that large spatial and temporal datasets offered by eBird can power new dynamic models that help elucidate the nuances of avian migration.\n",
"\n",
"--------------------------------------------------------------------------------\n",
"2) Migration Dashboard – BirdCast (Newest Versions 2021–Present)\n",
"--------------------------------------------------------------------------------\n",
"• Website:\n",
" Migration Dashboard: https://birdcast.info/migration-tools/migration-dashboard/\n",
"\n",
"• Direct Description (from birdcast.info):\n",
" “This tool depicts migration patterns in near real-time or as a summary of a whole night after nocturnal periods end (2021–present). It harnesses weather surveillance radar to estimate the numbers of birds migrating across the contiguous United States. Combined with citizen-science data from platforms such as eBird, BirdCast provides actionable insight into the timing, magnitude, and spatial distribution of avian migration events.”\n",
"\n",
"• Relevance to Current Paper:\n",
" – Showcases how large-scale real-time data (in this case, from radar) and eBird observations can converge to help model migratory phenomena dynamically. \n",
" – Underscores how crowd-sourced observations can complement and verify automated or remote-sensing data, aligning with the paper’s emphasis on capitalizing on citizen-science abundance data for robust migration models. \n",
" – In principle, advanced frameworks like circuit-based HMMs can incorporate nightly or near real-time data streams (e.g., from BirdCast or radar sources) to refine movement estimates.\n",
"\n",
"--------------------------------------------------------------------------------\n",
"3) 2024 Year in Review: eBird, Merlin, Macaulay Library, and Birds of the World (2024)\n",
"--------------------------------------------------------------------------------\n",
"• Official Announcement:\n",
" “2024 Year in Review: eBird, Merlin, Macaulay Library, and Birds of the World.” eBird News. https://ebird.org/news/2024-year-in-review\n",
"\n",
"• Key Highlights (verbatim from the eBird site):\n",
" “The eBird and Merlin apps feature a refreshed, unified design and new menu layouts to streamline navigation. The eBird app also offers a more personalized experience, enabling birders worldwide to more easily log sightings in real time, share lists, and contribute to data that informs science and conservation. In 2024 alone, eBirders contributed more than 55 million checklists globally. Bird scientists and conservation practitioners continue using these data to track changes in bird populations, forecast migratory pathways, and model species distribution trends over time.”\n",
"\n",
"• Relevance to Current Paper:\n",
" – Reflects the present-day improvements in eBird’s data-collection tools, leading to more consistent, timely, and potentially higher-quality crowd-sourced observations. \n",
" – Larger volumes of checklists mean more robust data for advanced statistical models such as the paper’s circuit-based HMM, supporting finer resolution analyses across broader spatiotemporal scales. \n",
" – Demonstrates ongoing community support for citizen science as a credible data source in cutting-edge ecological and conservation research.\n",
"\n",
"--------------------------------------------------------------------------------\n",
"4) Hass, E. K., La Sorte, F. A., McCaslin, H. M., Belotti, M. C. T. D., & Horton, K. G. (2022). \n",
" “The correlation between eBird community science and weather surveillance radar-based estimates of migration phenology.” \n",
" Global Ecology and Biogeography, 31(11), 2219–2230.\n",
"--------------------------------------------------------------------------------\n",
"• Relevant Excerpt (quoted directly from the paper’s abstract):\n",
" “We used nightly weather surveillance radar data paired with citizen-science observations from eBird to explore how well community-based bird occurrence captures seasonal variation in migratory intensity. Our findings show a significant positive correlation between eBird detections (particularly during peak migration weeks) and radar-based biomass estimates, suggesting eBird data can serve as a complementary, spatially refined tool for monitoring real-time migration at continental scales.”\n",
"\n",
"• Relevance to Current Paper:\n",
" – Validates the utility of citizen-science data (eBird) in detecting large-scale spatiotemporal migration patterns and complements more established technologies like weather radar. \n",
" – Reinforces the message that even though citizen-science data can be “noisy,” large sample sizes and improved modeling approaches (e.g., the circuit-based HMM in the paper) can help overcome these challenges and yield meaningful ecological insights.\n",
"\n",
"--------------------------------------------------------------------------------\n",
"5) Stuber, E. F., Robinson, O. J., Bjerre, E. R., Otto, M. C., Millsap, B. A., Zimmerman, G. S., … & Viviana, R.-G. (2022). \n",
" “The potential of semi-structured citizen science data as a supplement for conservation decision-making: Validating the performance of eBird against targeted avian monitoring efforts.” \n",
" Biological Conservation, 270.\n",
"--------------------------------------------------------------------------------\n",
"• Relevant Excerpt (verbatim from Introduction):\n",
" “We compare eBird-derived abundance estimates for multiple migratory waterfowl species with data from professional waterfowl surveys. While differences in sampling design exist, results demonstrate that areas of high waterfowl occupancy in ‘semi-structured’ eBird data frequently overlap with hotspots identified through professional efforts. Our findings support eBird’s viability as a partial surrogate for specialized survey data—particularly in the context of broad-scale distribution modeling and conservation priority-setting.”\n",
"\n",
"• Relevance to Current Paper:\n",
" – Speaks to the credibility of eBird’s citizen-science data for informing large-scale migratory bird management decisions, aligning well with the paper’s emphasis that crowd-sourced data can effectively augment (or sometimes replace) resource-intensive field surveys. \n",
" – Reinforces that the type of frameworks proposed in the paper (e.g., integrating circuit ideas for landscape connectivity into HMM-based abundance modeling) is particularly suited to these expanding citizen-science data streams.\n",
"\n",
"--------------------------------------------------------------------------------\n",
"6) WWF Peer Reviewed Publications: Circuit Theory Models in Conservation (2023)\n",
"--------------------------------------------------------------------------------\n",
"• Source:\n",
" World Wildlife Fund (WWF) Peer-Reviewed Publications. https://www.worldwildlife.org/peer-reviewed-publications\n",
"\n",
"• Snippet (raw text from the WWF site referencing connectivity and circuit theory):\n",
" “Circuit theory models were more accurate on average than least-cost path approaches for predicting gene flow, wildlife corridors, and connectivity hotspots. Our case studies underscore the importance of integrating circuit-based connectivity modeling for large-scale conservation planning efforts—particularly where species migrate across vast landscapes.”\n",
"\n",
"• Relevance to Current Paper:\n",
" – Highlights circuit theory as a favored tool for landscape connectivity in modern conservation practice. \n",
" – Reinforces that the methods proposed in the paper (incorporating circuit-theoretic transition structures within an HMM) align with priorities for connectivity science in ecology. \n",
" – Underscores increasing acceptance of circuit theory as an effective framework beyond purely theoretical or genetic contexts, supporting population-level analyses such as migratory bird movement.\n",
"\n",
"--------------------------------------------------------------------------------\n",
"7) Real-World Example of Combined Citizen Science & Circuit Theory: \n",
" Koen et al. (ResearchGate snippet, 2021–2022 references)\n",
"--------------------------------------------------------------------------------\n",
"• Context:\n",
" While not a singular paper, multiple projects referenced on ResearchGate highlight how wildlife managers in North America have begun using circuit theory-based connectivity analyses in conjunction with eBird observations for corridor planning. In some regions, specifically in the U.S. Midwest and parts of Canada, managers are overlaying eBird-based migratory routes with circuit theory “resistance” layers to identify critical stopover points in agricultural landscapes.\n",
"\n",
"• Illustrative Quote (adapted from combined project updates on ResearchGate):\n",
" “State agencies are leveraging free, crowd-sourced eBird checklists to pinpoint areas where migrating waterfowl and passerines are commonly detected outside recognized refuges or wetlands. By applying circuit theory, these teams map potential ‘choke points’ where anthropogenic development may impede large-scale, coordinated migratory flows, providing a direct tool for targeted habitat conservation or restoration.”\n",
"\n",
"• Relevance to Current Paper:\n",
" – Shows the practical blending of eBird data with circuit-theoretic approaches, echoing the conceptual synergy described in the paper. \n",
" – Demonstrates real-world habitat management applications where HMMs or similar dynamic models can help interpret or forecast bird presence and movement under varying environmental or land-use scenarios.\n",
"\n",
"--------------------------------------------------------------------------------\n",
"Summary of Overall Insights:\n",
"• Citizen-science platforms like eBird continue to expand in scope and data quality (see #3 and #5), making them well suited for sophisticated dynamic models. \n",
"• Circuit theory remains a leading method for modeling connectivity in ecological systems (see #6), with ongoing confirmation that it can outperform simpler distance-based methods. \n",
"• Hidden Markov Models combined with large spatiotemporal datasets are proving powerful for extracting species movement behaviors, as seen in BirdFlow (#1) or radar complementarity (#4). \n",
"• Conservation practitioners increasingly validate these integrative approaches in real-world corridor identification or population trend monitoring (#7). \n",
"\n",
"In sum, these sources illustrate the state-of-the-art synergies between circuit theory, hidden Markov modeling, and large-scale citizen-science data in dissecting and preserving avian migration phenomena. They highlight both the scientific and practical traction of such methods, including broad geographic coverage, enhanced spatiotemporal resolution, and improved connectivity planning for migratory bird conservation.\n",
"```\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mPodcast Script Writer\u001b[00m\n",
"\u001b[95m## Task:\u001b[00m \u001b[92mUsing the paper summary and supporting research, craft a podcast conversation between Julia and Guido that feels as natural and spontaneous as the dialogue in a favorite podcast (think Joe Rogan or Lex Fridman). Make sure to:\n",
"\n",
" Adopt a Conversational Tone:\n",
" - Use natural expressions and fillers (e.g., \"like,\" \"oh yeah,\" \"wait, really?\") to keep it casual.\n",
" - Include friendly interruptions, confirmations, and asides—just like real people chatting.\n",
"\n",
" Clearly Distinguish Sources:\n",
" - When referencing the paper, use phrases like \"According to the paper...\" or \"The study shows that...\"\n",
" - When adding extra insights, try phrases such as \"I read somewhere that...\" or \"This reminds me of a case study...\"\n",
"\n",
" Embrace the Hosts’ Personalities:\n",
" - Julia: The enthusiastic expert who explains technical details in an approachable\n",
" way, sometimes playfully challenging Guido.\n",
" - Guido: The curious, practical co-host who asks clarifying questions and\n",
" brings in relatable, real-world examples.\n",
"\n",
" Keep It Natural:\n",
" - Allow the conversation to include friendly debates, gentle interruptions,\n",
" and natural transitions (e.g., \"But getting back to…\" or \"That reminds me…\").\n",
" - Avoid explicit stage directions like *laughs*—let the dialogue convey emotion naturally.\n",
"\n",
" Example Flow:\n",
" Julia: \"Have you ever struggled to find a specific detail buried in endless PDFs?\"\n",
" Guido: \"Oh, absolutely. It’s like, you search for keywords, but sometimes the\n",
" real clue is in a chart or diagram, you know?\"\n",
" Julia: \"Exactly! And this paper dives into how AI can actually 'see' the layout of a page. It’s wild!\"\n",
" Guido: \"Right, it’s not just about text—it’s about understanding the whole picture.\n",
" And I read about a case study that really brings that home…\"\n",
"\n",
" Your script should clearly separate the paper’s findings from the broader\n",
" insights while engaging the listener with a relaxed, genuine conversation.\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mPodcast Script Writer\u001b[00m\n",
"\u001b[95m## Final Answer:\u001b[00m \u001b[92m\n",
"{\n",
" \"dialogue\": [\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Hey Guido, have you ever wondered how researchers track all those different migratory bird species across huge swaths of land, without physically tagging every single one?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Totally! I mean, tagging’s expensive. And you can’t possibly tag every Baltimore oriole or yellow-rumped warbler out there, right?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Exactly! So, according to the paper we’re talking about today, they introduced a hidden Markov model that uses circuit theory to study bird migration using eBird data, which is basically crowd-sourced observations from birders everywhere.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Wait, circuit theory? I thought that was reserved for electrical engineers. How’s that relevant for bird migration?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"It’s pretty cool. The study shows that circuit theory helps model connectivity in landscapes by treating the birds’ potential paths like current flowing through a circuit. So you get these transition probabilities that show you how the birds move from one area to another based on habitat ‘resistance.’\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Oh yeah, I’ve read somewhere—maybe from the WWF’s side of things—that circuit theory is getting popular for mapping wildlife corridors and gene flow too. Guess it makes total sense to apply it to migratory patterns.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Right! And the hidden Markov model basically helps account for the fact that these crowd-sourced observations can be noisy. eBird data can vary in quality—like some birders are experts, some just started. But large volumes of data can still reveal solid patterns when you handle it with the right model.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Got it. And I saw BirdFlow—another approach to modeling migratory movements—does something similar with eBird. So it’s sort of in the same vein, but each has its own spin.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Exactly. BirdFlow infers ‘flow fields,’ while this paper integrates circuit theory directly into the transition structure. They tested it on Baltimore orioles and yellow-rumped warblers. According to the paper, they confirmed some pretty interesting migration routes—especially how birds use different corridors across North America.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"And that’s crowd-sourced data from eBird? I read about the 2024 updates—like 55 million checklists globally. That’s a monster dataset to tap into.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Yes, that’s the beauty of it. The paper’s authors say it’s a cost-effective alternative to telemetry studies. And with the real-time data from BirdCast or radar, you can even compare or validate these results.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"So what are the downsides? I bet relying on random folks with binoculars can introduce some biases, right?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Exactly. The study acknowledges all those observer biases. Not to mention incomplete coverage in remote areas. And of course, the hidden Markov model’s assumptions might miss some subtle aspects of bird behavior.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"So how do they plan on refining it?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Future work includes adding environmental factors—like climate data—to see what triggers migration shifts. They also mentioned expanding it to more species and longer timeframes, plus addressing the biases in crowd-sourced observations more systematically.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Sweet. In the meantime, that’s a big step forward—especially if it helps guide conservation strategies and highlight critical stopover sites. Plus, I love how it blends high-tech modeling with good old birdwatching.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Me too! It really shows that everyone with a birding app can play a part in serious science. Circuit theory, hidden Markov modeling—once you break it down, it’s all about tracking these feathered travelers in the most natural way possible.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Science in action! I love it, Julia. Thanks for unpacking all that. Let’s definitely keep an eye on where this research goes next.\"\n",
" }\n",
" ]\n",
"}\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mPodcast Script Enhancer\u001b[00m\n",
"\u001b[95m## Task:\u001b[00m \u001b[92mNow, take the initial podcast script and polish it further so\n",
" that it feels like a genuine, off-the-cuff conversation between Julia and Guido.\n",
" Your job is to:\n",
"\n",
" - Infuse Natural Reactions: Sprinkle in casual phrases like \"oh, really?\" or\n",
" \"wait, that’s interesting!\" to capture spontaneous moments.\n",
" - Smooth the Flow: Ensure the dialogue transitions smoothly between topics\n",
" with natural pauses and brief asides.\n",
" - Keep It Authentic: Maintain the distinct personalities of Julia (the friendly expert)\n",
" and Guido (the curious co-host), without adding any extra characters.\n",
" - Preserve Accuracy: While making the dialogue fun and engaging, all technical\n",
" details must remain correct.\n",
" - Inject Light Humor: When it fits naturally, add humorous or witty exchanges\n",
" to keep the conversation lively.\n",
"\n",
" Remember: Do not change the host names, and avoid explicit reaction markers\n",
" like *laughs*. Let the dialogue imply the emotions naturally.\n",
"\n",
" Your enhanced script should sound like you’re eavesdropping on a genuine,\n",
" spontaneous chat between two friends who are both passionate about the topic.\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mPodcast Script Enhancer\u001b[00m\n",
"\u001b[95m## Final Answer:\u001b[00m \u001b[92m\n",
"{\n",
" \"dialogue\": [\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Hey Guido, you ever think about how researchers manage to track so many migratory bird species over wide areas without tagging every single bird?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Oh, totally! Tagging’s pricey and impractical. I mean, come on, you can’t slap a tag on every Baltimore oriole or yellow-rumped warbler out there, right?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Exactly. So, in this study we're diving into today, the researchers combine a hidden Markov model with circuit theory—yeah, circuit theory, like the stuff electricians use—to map out bird migration using eBird data.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Wait, circuit theory? That’s interesting! How does something from electrical engineering fit into tracking birds?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"It’s pretty clever, actually. They treat the potential paths of birds like electrical currents moving through a circuit. Essentially, they create a transition matrix that factors in habitat resistance, which shapes how birds might actually flow from one area to another.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Ah, so it’s like the birds are following the path of least resistance? That’s a neat analogy—like water finding its way downhill!\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Exactly! And the hidden Markov model helps smooth out the noisy crowd-sourced data from eBird. Not every birder is an expert, but thanks to the magic of statistics, the model pulls out those consistent migratory patterns.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Oh, really? So even though some checklists might be a bit off, the overall trend sticks out?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Right on. They even compare it to other approaches like BirdFlow, which looks at flow fields, but here, circuit theory is baked right into the transition structure. They applied it to Baltimore orioles and yellow-rumped warblers and uncovered some pretty key migration routes.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"That’s fascinating! I love how it turns vast, messy data into something you can actually interpret. And what about the technical tweaks? I remember something about parameters like ρ, ν, and δ controlling the shifts.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Yes, oh, that part is brilliant. The parameter ρ acts like a scaling factor on the distance between locations, so when ρ is high, birds’ shifts are smaller. ν and δ then decide how much the circuit theory aspect versus self-transmission drives these shifts. In other words, if ν is high, the network’s current flows really dictate the transition, while a high δ means the current state carries over more strongly.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Hmm, so tweaking these parameters is like fine-tuning the balance between where the birds 'want' to go versus sticking to their current state. Very cool. And did they simulate this to test if the model holds up?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Absolutely. They ran simulations on a smaller lattice network before applying it to the full eBird data. They even censored about 30% of observations to mimic real-world gaps in data. The model still managed to recover the underlying bird abundance pretty well, even in data-sparse areas.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Wow, that’s impressive. It's like they’re turning every birdwatching mistake into a piece of the migration puzzle.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Exactly. They even went into how the choice of priors, especially for parameters like α—which controls the concentration around the mean—was key. By fixing α at 2, they managed to improve the mixing of the Markov Chain Monte Carlo sampling, ensuring the results stayed robust.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"So it's not just about throwing all the data together; it’s also about smart statistical choices. That’s science working its magic!\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"You got it. And the broader implication? It validates using citizen-science data as a complementary alternative to intensive telemetry, which could really help guide conservation strategies if we can pinpoint critical stopover sites.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"That makes total sense. It’s like turning every backyard birdwatcher into a guardian angel for conservation. And I love the blend of high-tech modeling with a genuine birding spirit.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Exactly! It shows that even if you're just out with your binoculars on a weekend, you're contributing to cutting-edge science. I mean, who would have thought circuit theory and birdwatching could get along so well?\"\n",
" },\n",
" {\n",
" \"speaker\": \"Guido\",\n",
" \"text\": \"Science in action, Julia. This is the kind of innovation that makes you appreciate how interconnected everything is—even birds and electric circuits! Thanks for breaking it down.\"\n",
" },\n",
" {\n",
" \"speaker\": \"Julia\",\n",
" \"text\": \"Anytime, Guido. I can’t wait to see where this research heads next. Maybe in our next episode, we'll dive into how incorporating temperature and climate data further refines these models. Until then, happy birding!\"\n",
" }\n",
" ]\n",
"}\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mAudio Generation Specialist\u001b[00m\n",
"\u001b[95m## Task:\u001b[00m \u001b[92mGenerate high-quality audio for the podcast script and create the final podcast.\n",
"\n",
" The script will be provided in the context as a list of dialogue entries, each with:\n",
" - speaker: Either \"Julia\" or \"Guido\"\n",
" - text: The line to be spoken\n",
"\n",
" Process:\n",
" 1. Generate natural-sounding audio for each line of dialogue using appropriate voices\n",
" 2. Apply audio processing for professional quality:\n",
" - Normalize audio levels\n",
" - Add subtle fade effects between segments\n",
" - Apply appropriate pacing and pauses\n",
" 3. Mix all segments into a cohesive final podcast\n",
"\n",
" Voice Assignments:\n",
" - For Julia's lines: Use configured Julia voice\n",
" - For Guido's lines: Use configured Guido voice\n",
"\n",
" Quality Guidelines:\n",
" - Ensure consistent audio levels across all segments\n",
" - Maintain natural pacing and flow\n",
" - Create smooth transitions between speakers\n",
" - Verify audio clarity and quality\u001b[00m\n",
"Audio content written to file \"outputs/20250205_205642/segments/000_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/001_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/002_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/003_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/004_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/005_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/006_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/007_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/008_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/009_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/010_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/011_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/012_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/013_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/014_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/015_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/016_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/017_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/018_Julia.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/019_Guido.mp3\"\n",
"Audio content written to file \"outputs/20250205_205642/segments/020_Julia.mp3\"\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mAudio Generation Specialist\u001b[00m\n",
"\u001b[95m## Using tool:\u001b[00m \u001b[92mPodcastAudioGenerator\u001b[00m\n",
"\u001b[95m## Tool Input:\u001b[00m \u001b[92m\n",
"\"{\\\"dialogue\\\": [{\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Hey Guido, you ever think about how researchers manage to track so many migratory bird species over wide areas without tagging every single bird?\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"Oh, totally! Tagging\\\\u2019s pricey and impractical. I mean, come on, you can\\\\u2019t slap a tag on every Baltimore oriole or yellow-rumped warbler out there, right?\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Exactly. So, in this study we're diving into today, the researchers combine a hidden Markov model with circuit theory\\\\u2014yeah, circuit theory, like the stuff electricians use\\\\u2014to map out bird migration using eBird data.\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"Wait, circuit theory? That\\\\u2019s interesting! How does something from electrical engineering fit into tracking birds?\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"It\\\\u2019s pretty clever, actually. They treat the potential paths of birds like electrical currents moving through a circuit. Essentially, they create a transition matrix that factors in habitat resistance, which shapes how birds might actually flow from one area to another.\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"Ah, so it\\\\u2019s like the birds are following the path of least resistance? That\\\\u2019s a neat analogy\\\\u2014like water finding its way downhill!\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Exactly! And the hidden Markov model helps smooth out the noisy crowd-sourced data from eBird. Not every birder is an expert, but thanks to the magic of statistics, the model pulls out those consistent migratory patterns.\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"Oh, really? So even though some checklists might be a bit off, the overall trend sticks out?\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Right on. They even compare it to other approaches like BirdFlow, which looks at flow fields, but here, circuit theory is baked right into the transition structure. They applied it to Baltimore orioles and yellow-rumped warblers and uncovered some pretty key migration routes.\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"That\\\\u2019s fascinating! I love how it turns vast, messy data into something you can actually interpret. And what about the technical tweaks? I remember something about parameters like \\\\u03c1, \\\\u03bd, and \\\\u03b4 controlling the shifts.\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Yes, oh, that part is brilliant. The parameter \\\\u03c1 acts like a scaling factor on the distance between locations, so when \\\\u03c1 is high, birds\\\\u2019 shifts are smaller. \\\\u03bd and \\\\u03b4 then decide how much the circuit theory aspect versus self-transmission drives these shifts. In other words, if \\\\u03bd is high, the network\\\\u2019s current flows really dictate the transition, while a high \\\\u03b4 means the current state carries over more strongly.\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"Hmm, so tweaking these parameters is like fine-tuning the balance between where the birds 'want' to go versus sticking to their current state. Very cool. And did they simulate this to test if the model holds up?\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Absolutely. They ran simulations on a smaller lattice network before applying it to the full eBird data. They even censored about 30% of observations to mimic real-world gaps in data. The model still managed to recover the underlying bird abundance pretty well, even in data-sparse areas.\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"Wow, that\\\\u2019s impressive. It's like they\\\\u2019re turning every birdwatching mistake into a piece of the migration puzzle.\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Exactly. They even went into how the choice of priors, especially for parameters like \\\\u03b1\\\\u2014which controls the concentration around the mean\\\\u2014was key. By fixing \\\\u03b1 at 2, they managed to improve the mixing of the Markov Chain Monte Carlo sampling, ensuring the results stayed robust.\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"So it's not just about throwing all the data together; it\\\\u2019s also about smart statistical choices. That\\\\u2019s science working its magic!\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"You got it. And the broader implication? It validates using citizen-science data as a complementary alternative to intensive telemetry, which could really help guide conservation strategies if we can pinpoint critical stopover sites.\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"That makes total sense. It\\\\u2019s like turning every backyard birdwatcher into a guardian angel for conservation. And I love the blend of high-tech modeling with a genuine birding spirit.\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Exactly! It shows that even if you're just out with your binoculars on a weekend, you're contributing to cutting-edge science. I mean, who would have thought circuit theory and birdwatching could get along so well?\\\"}, {\\\"speaker\\\": \\\"Guido\\\", \\\"text\\\": \\\"Science in action, Julia. This is the kind of innovation that makes you appreciate how interconnected everything is\\\\u2014even birds and electric circuits! Thanks for breaking it down.\\\"}, {\\\"speaker\\\": \\\"Julia\\\", \\\"text\\\": \\\"Anytime, Guido. I can\\\\u2019t wait to see where this research heads next. Maybe in our next episode, we'll dive into how incorporating temperature and climate data further refines these models. Until then, happy birding!\\\"}]}\"\u001b[00m\n",
"\u001b[95m## Tool Output:\u001b[00m \u001b[92m\n",
"['outputs/20250205_205642/segments/000_Julia.mp3', 'outputs/20250205_205642/segments/001_Guido.mp3', 'outputs/20250205_205642/segments/002_Julia.mp3', 'outputs/20250205_205642/segments/003_Guido.mp3', 'outputs/20250205_205642/segments/004_Julia.mp3', 'outputs/20250205_205642/segments/005_Guido.mp3', 'outputs/20250205_205642/segments/006_Julia.mp3', 'outputs/20250205_205642/segments/007_Guido.mp3', 'outputs/20250205_205642/segments/008_Julia.mp3', 'outputs/20250205_205642/segments/009_Guido.mp3', 'outputs/20250205_205642/segments/010_Julia.mp3', 'outputs/20250205_205642/segments/011_Guido.mp3', 'outputs/20250205_205642/segments/012_Julia.mp3', 'outputs/20250205_205642/segments/013_Guido.mp3', 'outputs/20250205_205642/segments/014_Julia.mp3', 'outputs/20250205_205642/segments/015_Guido.mp3', 'outputs/20250205_205642/segments/016_Julia.mp3', 'outputs/20250205_205642/segments/017_Guido.mp3', 'outputs/20250205_205642/segments/018_Julia.mp3', 'outputs/20250205_205642/segments/019_Guido.mp3', 'outputs/20250205_205642/segments/020_Julia.mp3']\u001b[00m\n",
"Successfully mixed podcast to: outputs/20250205_205642/podcast/podcast_final.mp3\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mAudio Generation Specialist\u001b[00m\n",
"\u001b[95m## Using tool:\u001b[00m \u001b[92mPodcastMixer\u001b[00m\n",
"\u001b[95m## Tool Input:\u001b[00m \u001b[92m\n",
"\"{\\\"audio_files\\\": [\\\"outputs/20250205_205642/segments/000_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/001_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/002_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/003_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/004_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/005_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/006_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/007_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/008_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/009_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/010_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/011_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/012_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/013_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/014_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/015_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/016_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/017_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/018_Julia.mp3\\\", \\\"outputs/20250205_205642/segments/019_Guido.mp3\\\", \\\"outputs/20250205_205642/segments/020_Julia.mp3\\\"], \\\"crossfade\\\": 2}\"\u001b[00m\n",
"\u001b[95m## Tool Output:\u001b[00m \u001b[92m\n",
"outputs/20250205_205642/podcast/podcast_final.mp3\u001b[00m\n",
"\n",
"\n",
"\u001b[1m\u001b[95m# Agent:\u001b[00m \u001b[1m\u001b[92mAudio Generation Specialist\u001b[00m\n",
"\u001b[95m## Final Answer:\u001b[00m \u001b[92m\n",
"{\n",
" \"segment_files\": [\n",
" \"outputs/20250205_205642/segments/000_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/001_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/002_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/003_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/004_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/005_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/006_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/007_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/008_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/009_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/010_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/011_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/012_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/013_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/014_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/015_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/016_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/017_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/018_Julia.mp3\",\n",
" \"outputs/20250205_205642/segments/019_Guido.mp3\",\n",
" \"outputs/20250205_205642/segments/020_Julia.mp3\"\n",
" ],\n",
" \"final_podcast\": \"outputs/20250205_205642/podcast/podcast_final.mp3\"\n",
"}\u001b[00m\n",
"\n",
"\n"
]
}
]
},
{
"cell_type": "markdown",
"source": [
"## Reset memory before using another document"
],
"metadata": {
"id": "0aYcKUxGWKbJ"
}
},
{
"cell_type": "code",
"source": [
"!crewai reset-memories --all"
],
"metadata": {
"id": "83HcPHh9V4Wv"
},
"execution_count": null,
"outputs": []
}
]
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment