AI 音声文字起こし&Google ドキュメントレポートジェネレーター

中級

これはAI分野の自動化ワークフローで、9個のノードを含みます。主にWebhook, GoogleDocs, GmailTrigger, VlmRunなどのノードを使用、AI技術を活用したスマート自動化を実現。 AI音声文字起こしと Google Docs レポート生成ツール

前提条件
  • HTTP Webhookエンドポイント(n8nが自動生成)
  • Googleアカウント + Gmail API認証情報

カテゴリー

ワークフロープレビュー
ノード接続関係を可視化、ズームとパンをサポート
ワークフローをエクスポート
以下のJSON設定をn8nにインポートして、このワークフローを使用できます
{
  "meta": {
    "instanceId": "0a7ae129a7d703150135f6cc24d0469f20ffd22b5ed59c23ee3e0608264e6d98",
    "templateId": "3905",
    "templateCredsSetupCompleted": true
  },
  "nodes": [
    {
      "id": "2175eca1-4842-4601-b0ac-84264cdc48ca",
      "name": "🎙️ ワークフロー概要",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        -260,
        720
      ],
      "parameters": {
        "color": 7,
        "width": 460,
        "height": 720,
        "content": "## 🎙️ AI Audio Transcription with VLM Run\n\n**Overview:**\nAutomatically transcribes audio files from Gmail and generates formatted transcription reports in Google Docs.\n\n**Workflow:**\n1. 📧 Monitor Gmail for audio attachments\n2. 🎵 Extract and process audio files\n3. 🤖 Transcribe using VLM Run AI\n4. 🔗 Receive results via webhook\n5. 📄 Generate formatted Google Doc\n\n\n**Perfect for:**\n- Meeting recordings\n- Voice memos\n- Interview transcriptions\n- Podcast notes\n- Audio journalism\n- Accessibility documentation\n\n\n**Requirements:**\n- VLM Run API access\n- Gmail OAuth2\n- Google Docs OAuth2\n- Webhook endpoint"
      },
      "typeVersion": 1
    },
    {
      "id": "3d1b367c-226c-4d9c-bc9e-0643ff3896bc",
      "name": "📧 メール監視",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        220,
        720
      ],
      "parameters": {
        "color": 7,
        "width": 360,
        "height": 720,
        "content": "## 📧 Gmail Audio Detection\n\n**Monitors incoming emails and automatically downloads audio attachments.**\n\n**Process:**\n1. Polls Gmail every minute\n2. Detects new emails with attachments\n3. Downloads audio files automatically\n4. Triggers transcription workflow\n\n\n**Supported Formats:**\n- MP3, WAV, M4A\n- AAC, OGG, FLAC\n- Various bitrates\n- Mobile recordings\n\n\n**Configuration:**\n- Enable attachment downloads\n- Automatic file extraction"
      },
      "typeVersion": 1
    },
    {
      "id": "4c2744d9-c35d-4629-ac54-b3f299cf16e3",
      "name": "🤖 AI文字起こし",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        600,
        720
      ],
      "parameters": {
        "color": 2,
        "width": 360,
        "height": 720,
        "content": "## 🤖 Audio AI Processing\n\n**VLM Run transcribes audio with high accuracy**\n\n**Features:**\n- Advanced speech recognition\n- Automatic punctuation\n- Timestamp generation\n\n\n**Output:**\n- Complete transcription\n- Time-segmented text\n- Metadata extraction\n- Duration information\n\n\n**Benefits:**\n- Human-level accuracy\n- Fast processing\n- Consistent formatting\n- Professional quality"
      },
      "typeVersion": 1
    },
    {
      "id": "52c19808-27ae-4ccd-af4d-a977bd44f18a",
      "name": "🔗 非同期処理",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        980,
        720
      ],
      "parameters": {
        "color": 5,
        "width": 320,
        "height": 720,
        "content": "## 🔗 Webhook Processing Flow\n\n**How asynchronous transcription works:**\n\n**Step 1:** Audio sent to VLM Run with callback URL\n**Step 2:** Transcription starts in background\n**Step 3:** Webhook receives completion\n**Step 4:** Results formatted for Google Docs\n\n\n**Why Async?**\n- Handles long audio files\n- Prevents workflow timeouts\n- Better resource management\n- Reliable processing\n\n\n**Callback Data:**\n- Complete transcript\n- Segment timestamps\n- Processing metadata\n- Error handling"
      },
      "typeVersion": 1
    },
    {
      "id": "7046634c-3534-4f62-9df9-7ece42a4b969",
      "name": "📄 ドキュメント生成",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        1320,
        720
      ],
      "parameters": {
        "color": 7,
        "width": 320,
        "height": 720,
        "content": "## 📄 Professional Reports\n\n**Creates beautifully formatted Google Docs with transcription results.**\n\n**Report includes:**\n- Date and timestamp\n- Total audio duration\n- Segmented transcription\n- Time markers for each segment\n- Professional formatting\n\n\n**Output Location:**\n- Specified Google Doc\n- Appends new transcriptions\n- Maintains formatting"
      },
      "typeVersion": 1
    },
    {
      "id": "31d1f0c3-73e7-4e1a-bc08-72c5694e7153",
      "name": "メール添付ファイルの監視",
      "type": "n8n-nodes-base.gmailTrigger",
      "notes": "Continuously monitors Gmail for new emails with audio attachments. Automatically downloads all attachments and triggers the transcription workflow.",
      "position": [
        320,
        1260
      ],
      "parameters": {
        "simple": false,
        "filters": {},
        "options": {
          "downloadAttachments": true
        },
        "pollTimes": {
          "item": [
            {
              "mode": "everyMinute"
            }
          ]
        }
      },
      "credentials": {
        "gmailOAuth2": {
          "id": "7MVziRpoyD8arghO",
          "name": "Gmail account 2"
        }
      },
      "typeVersion": 1.2
    },
    {
      "id": "d2c14402-db05-4bf4-a8a5-402587d69ee8",
      "name": "VLM音声文字起こしの実行",
      "type": "@vlm-run/n8n-nodes-vlmrun.vlmRun",
      "notes": "Processes audio files using VLM AI to generate accurate transcriptions with timestamps. Runs asynchronously for large audio files.",
      "position": [
        720,
        1260
      ],
      "parameters": {
        "file": "attachment_0",
        "domain": "audio.transcription",
        "operation": "audio",
        "callbackUrl": "https://playground.vlm.run/webhook/audio-transcription",
        "processAsynchronously": true
      },
      "credentials": {
        "vlmRunApi": {
          "id": "cF5KpbVmnt36bdWr",
          "name": "VLM Run account 6"
        }
      },
      "typeVersion": 1
    },
    {
      "id": "98e98532-dc44-4b15-a75d-24b3d7823c84",
      "name": "文字起こし結果の受信",
      "type": "n8n-nodes-base.webhook",
      "notes": "Receives the completed transcription from VLM AI when asynchronous processing is finished. Contains full transcript with timestamps.",
      "position": [
        1080,
        1260
      ],
      "webhookId": "d599acd8-1891-4d07-b2f9-092cb6381b41",
      "parameters": {
        "path": "audio-transcription",
        "options": {},
        "httpMethod": "POST"
      },
      "typeVersion": 2
    },
    {
      "id": "c9468f04-42da-4725-a462-97cdcf6c40ac",
      "name": "文字起こしレポートの生成",
      "type": "n8n-nodes-base.googleDocs",
      "notes": "Creates a professionally formatted Google Doc with the transcription results, including timestamps and metadata.",
      "position": [
        1420,
        1260
      ],
      "parameters": {
        "simple": false,
        "actionsUi": {
          "actionFields": [
            {
              "text": "=📄 Audio Transcription Report\n\n🗓️ Date: {{ new Date($json.body.completed_at).toLocaleString('en-US', { dateStyle: 'medium', timeStyle: 'short' }) }}  \n⏱️ Total Duration: {{ $json.body.response.metadata.duration }} seconds  \n{{ \n$json.body.response.segments.map((segment, index) => \n  `\\n` +\n  `🔹 Segment ${index + 1}\\n` +\n  `⏰ Time: ${segment.start_time.toFixed(2)}s → ${segment.end_time.toFixed(2)}s\\n` +\n  `📝 Transcript: \"${segment.content.trim()}\"\\n`\n).join('\\n')\n}}\n",
              "action": "insert"
            }
          ]
        },
        "operation": "update",
        "documentURL": "https://docs.google.com/document/d/1QFHvOgEYKMLVpIRv6Mc7lXfDXhtoozgEK7O-wuNLzl0/edit?tab=t.0",
        "authentication": "oAuth2"
      },
      "credentials": {
        "googleDocsOAuth2Api": {
          "id": "3x0WI9iZ6ToA2fXw",
          "name": "Google Docs account"
        }
      },
      "typeVersion": 1
    }
  ],
  "pinData": {},
  "connections": {
    "31d1f0c3-73e7-4e1a-bc08-72c5694e7153": {
      "main": [
        [
          {
            "node": "d2c14402-db05-4bf4-a8a5-402587d69ee8",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "98e98532-dc44-4b15-a75d-24b3d7823c84": {
      "main": [
        [
          {
            "node": "c9468f04-42da-4725-a462-97cdcf6c40ac",
            "type": "main",
            "index": 0
          }
        ]
      ]
    }
  }
}
よくある質問

このワークフローの使い方は?

上記のJSON設定コードをコピーし、n8nインスタンスで新しいワークフローを作成して「JSONからインポート」を選択、設定を貼り付けて認証情報を必要に応じて変更してください。

このワークフローはどんな場面に適していますか?

中級 - 人工知能

有料ですか?

このワークフローは完全無料です。ただし、ワークフローで使用するサードパーティサービス(OpenAI APIなど)は別途料金が発生する場合があります。

ワークフロー情報
難易度
中級
ノード数9
カテゴリー1
ノードタイプ5
難易度説明

経験者向け、6-15ノードの中程度の複雑さのワークフロー

作成者
Shahrear

Shahrear

@shahrear

I’m Shahrear, a Software Engineer with over 5 years of experience in full-stack development and workflow automation. I specialize in building intelligent, scalable automations using n8n, helping teams streamline operations and boost productivity. I’m also an expert in developing custom n8n nodes, with published work on npm - including the @vlm-run/n8n-nodes-vlmrun package. https://github.com/vlm-run/n8n-nodes-vlmrun

外部リンク
n8n.ioで表示

このワークフローを共有

カテゴリー

カテゴリー: 34