PDF Vector AI를 사용하여 문서에서 법적 인용을 추출하고 검증
중급
이것은Document Extraction, AI Summarization분야의자동화 워크플로우로, 8개의 노드를 포함합니다.주로 If, Code, GoogleDrive, ManualTrigger, PdfVector 등의 노드를 사용하며. PDF Vector AI를 사용하여 문서에서 법적 참조를 추출하고 확인합니다.
사전 요구사항
- •Google Drive API 인증 정보
워크플로우 미리보기
노드 연결 관계를 시각적으로 표시하며, 확대/축소 및 이동을 지원합니다
워크플로우 내보내기
다음 JSON 구성을 복사하여 n8n에 가져오면 이 워크플로우를 사용할 수 있습니다
{
"meta": {
"instanceId": "placeholder"
},
"nodes": [
{
"id": "manual-trigger",
"name": "Manual Trigger",
"type": "n8n-nodes-base.manualTrigger",
"notes": "Start citation extraction",
"position": [
250,
300
],
"parameters": {},
"typeVersion": 1
},
{
"id": "google-drive",
"name": "Google Drive - 법률 문서 가져오기",
"type": "n8n-nodes-base.googleDrive",
"notes": "Retrieve document from Drive",
"position": [
450,
300
],
"parameters": {
"fileId": "={{ $json.fileId }}",
"operation": "download"
},
"typeVersion": 3
},
{
"id": "pdfvector-extract",
"name": "PDF Vector - 인용 추출",
"type": "n8n-nodes-pdfvector.pdfVector",
"notes": "Extract all citations",
"position": [
650,
300
],
"parameters": {
"prompt": "Extract all legal citations from this document or image. Include case citations (with reporter and year), statute citations (with title and section), regulatory citations, and academic citations (with author, title, journal, and year). For each citation, include the surrounding context (1-2 sentences) and page number where it appears. Use OCR if this is a scanned legal document or image.",
"schema": "{\"type\":\"object\",\"properties\":{\"documentInfo\":{\"type\":\"object\",\"properties\":{\"title\":{\"type\":\"string\"},\"documentType\":{\"type\":\"string\"},\"court\":{\"type\":\"string\"},\"date\":{\"type\":\"string\"},\"docketNumber\":{\"type\":\"string\"}}},\"caseCitations\":{\"type\":\"array\",\"items\":{\"type\":\"object\",\"properties\":{\"caseName\":{\"type\":\"string\"},\"reporter\":{\"type\":\"string\"},\"volume\":{\"type\":\"string\"},\"page\":{\"type\":\"string\"},\"year\":{\"type\":\"string\"},\"court\":{\"type\":\"string\"},\"context\":{\"type\":\"string\"},\"pageNumber\":{\"type\":\"number\"},\"pinCite\":{\"type\":\"string\"}}}},\"statuteCitations\":{\"type\":\"array\",\"items\":{\"type\":\"object\",\"properties\":{\"title\":{\"type\":\"string\"},\"code\":{\"type\":\"string\"},\"section\":{\"type\":\"string\"},\"subsection\":{\"type\":\"string\"},\"year\":{\"type\":\"string\"},\"context\":{\"type\":\"string\"},\"pageNumber\":{\"type\":\"number\"}}}},\"regulatoryCitations\":{\"type\":\"array\",\"items\":{\"type\":\"object\",\"properties\":{\"title\":{\"type\":\"string\"},\"agency\":{\"type\":\"string\"},\"section\":{\"type\":\"string\"},\"context\":{\"type\":\"string\"},\"pageNumber\":{\"type\":\"number\"}}}},\"academicCitations\":{\"type\":\"array\",\"items\":{\"type\":\"object\",\"properties\":{\"authors\":{\"type\":\"string\"},\"title\":{\"type\":\"string\"},\"journal\":{\"type\":\"string\"},\"volume\":{\"type\":\"string\"},\"page\":{\"type\":\"string\"},\"year\":{\"type\":\"string\"},\"doi\":{\"type\":\"string\"},\"context\":{\"type\":\"string\"},\"pageNumber\":{\"type\":\"number\"}}}},\"otherCitations\":{\"type\":\"array\",\"items\":{\"type\":\"object\",\"properties\":{\"text\":{\"type\":\"string\"},\"type\":{\"type\":\"string\"},\"context\":{\"type\":\"string\"},\"pageNumber\":{\"type\":\"number\"}}}}},\"required\":[\"documentInfo\"],\"additionalProperties\":false}",
"resource": "document",
"inputType": "file",
"operation": "extract",
"binaryPropertyName": "data"
},
"typeVersion": 1
},
{
"id": "analyze-citations",
"name": "인용 분석 및 검증",
"type": "n8n-nodes-base.code",
"notes": "Process citation data",
"position": [
850,
300
],
"parameters": {
"jsCode": "// Process and validate citations\nconst citations = $input.first().json.data;\nconst citationAnalysis = {\n summary: {\n totalCitations: 0,\n caseLaw: citations.caseCitations?.length || 0,\n statutes: citations.statuteCitations?.length || 0,\n regulations: citations.regulatoryCitations?.length || 0,\n academic: citations.academicCitations?.length || 0,\n other: citations.otherCitations?.length || 0\n },\n validation: {\n invalidCitations: [],\n warnings: []\n },\n academicDOIs: [],\n citationNetwork: {}\n};\n\n// Calculate total\ncitationAnalysis.summary.totalCitations = \n citationAnalysis.summary.caseLaw + \n citationAnalysis.summary.statutes + \n citationAnalysis.summary.regulations + \n citationAnalysis.summary.academic + \n citationAnalysis.summary.other;\n\n// Validate case citations\nif (citations.caseCitations) {\n citations.caseCitations.forEach((cite, index) => {\n // Check for required fields\n if (!cite.reporter || !cite.volume || !cite.page) {\n citationAnalysis.validation.invalidCitations.push({\n type: 'case',\n index,\n citation: cite.caseName,\n issue: 'Missing reporter, volume, or page'\n });\n }\n \n // Build citation network (track which cases cite which)\n if (!citationAnalysis.citationNetwork[cite.caseName]) {\n citationAnalysis.citationNetwork[cite.caseName] = {\n citedIn: [citations.documentInfo.title],\n pageNumbers: [cite.pageNumber]\n };\n }\n });\n}\n\n// Validate statute citations\nif (citations.statuteCitations) {\n citations.statuteCitations.forEach((cite, index) => {\n if (!cite.title || !cite.section) {\n citationAnalysis.validation.invalidCitations.push({\n type: 'statute',\n index,\n citation: `${cite.title} ${cite.code}`,\n issue: 'Missing title or section'\n });\n }\n });\n}\n\n// Extract DOIs for academic fetching\nif (citations.academicCitations) {\n citations.academicCitations.forEach(cite => {\n if (cite.doi) {\n citationAnalysis.academicDOIs.push(cite.doi);\n } else {\n // Try to construct search query for papers without DOI\n citationAnalysis.validation.warnings.push({\n type: 'academic',\n citation: cite.title,\n warning: 'No DOI found - manual search may be needed'\n });\n }\n });\n}\n\n// Analyze citation patterns\nconst citationPatterns = {\n mostCitedCases: [],\n primaryAuthorities: [],\n secondaryAuthorities: []\n};\n\n// Identify primary authorities (statutes and binding cases)\ncitationPatterns.primaryAuthorities = [\n ...citations.statuteCitations?.map(c => `${c.title} ${c.code} § ${c.section}`) || [],\n ...citations.caseCitations?.filter(c => c.court?.includes('Supreme'))?.map(c => c.caseName) || []\n];\n\n// Identify secondary authorities\ncitationPatterns.secondaryAuthorities = \n citations.academicCitations?.map(c => `${c.authors}, ${c.title}`) || [];\n\nreturn [{\n json: {\n originalData: citations,\n analysis: citationAnalysis,\n patterns: citationPatterns,\n doisToFetch: citationAnalysis.academicDOIs.join(','),\n processedAt: new Date().toISOString()\n }\n}];"
},
"typeVersion": 1
},
{
"id": "has-dois",
"name": "Has Academic DOIs?",
"type": "n8n-nodes-base.if",
"position": [
1050,
300
],
"parameters": {
"conditions": {
"string": [
{
"value1": "={{ $json.doisToFetch }}",
"operation": "isNotEmpty"
}
]
}
},
"typeVersion": 1
},
{
"id": "pdfvector-fetch",
"name": "PDF Vector - 논문 가져오기",
"type": "n8n-nodes-pdfvector.pdfVector",
"notes": "Retrieve academic papers",
"position": [
1250,
250
],
"parameters": {
"ids": "={{ $json.doisToFetch }}",
"fields": [
"title",
"abstract",
"authors",
"year",
"doi",
"pdfURL",
"totalCitations"
],
"resource": "academic",
"operation": "fetch"
},
"typeVersion": 1
},
{
"id": "generate-report",
"name": "인용 보고서 생성",
"type": "n8n-nodes-base.code",
"notes": "Create final report",
"position": [
1450,
300
],
"parameters": {
"jsCode": "// Generate comprehensive citation report\nconst citationData = $node['Has Academic DOIs?'].json;\nconst academicPapers = $json.publications || [];\n\n// Create citation report\nlet report = `# Legal Citation Analysis Report\\n\\n`;\nreport += `**Document:** ${citationData.originalData.documentInfo.title}\\n`;\nreport += `**Type:** ${citationData.originalData.documentInfo.documentType}\\n`;\nreport += `**Date:** ${citationData.originalData.documentInfo.date}\\n\\n`;\n\nreport += `## Citation Summary\\n\\n`;\nreport += `- **Total Citations:** ${citationData.analysis.summary.totalCitations}\\n`;\nreport += `- **Case Law:** ${citationData.analysis.summary.caseLaw}\\n`;\nreport += `- **Statutes:** ${citationData.analysis.summary.statutes}\\n`;\nreport += `- **Regulations:** ${citationData.analysis.summary.regulations}\\n`;\nreport += `- **Academic:** ${citationData.analysis.summary.academic}\\n`;\nreport += `- **Other:** ${citationData.analysis.summary.other}\\n\\n`;\n\n// Add validation issues\nif (citationData.analysis.validation.invalidCitations.length > 0) {\n report += `## Citation Issues\\n\\n`;\n citationData.analysis.validation.invalidCitations.forEach(issue => {\n report += `- **${issue.type}:** ${issue.citation} - ${issue.issue}\\n`;\n });\n report += `\\n`;\n}\n\n// Add case law section\nif (citationData.originalData.caseCitations?.length > 0) {\n report += `## Case Law Citations\\n\\n`;\n citationData.originalData.caseCitations.forEach(cite => {\n report += `### ${cite.caseName}\\n`;\n report += `- **Citation:** ${cite.volume} ${cite.reporter} ${cite.page} (${cite.year})\\n`;\n report += `- **Court:** ${cite.court || 'Not specified'}\\n`;\n report += `- **Context:** ${cite.context}\\n`;\n report += `- **Page:** ${cite.pageNumber}\\n\\n`;\n });\n}\n\n// Add statute section\nif (citationData.originalData.statuteCitations?.length > 0) {\n report += `## Statutory Citations\\n\\n`;\n citationData.originalData.statuteCitations.forEach(cite => {\n report += `- **${cite.title} ${cite.code} § ${cite.section}**${cite.subsection ? ` (${cite.subsection})` : ''}\\n`;\n report += ` - Context: ${cite.context}\\n`;\n report += ` - Page: ${cite.pageNumber}\\n\\n`;\n });\n}\n\n// Add academic section with fetched data\nif (citationData.originalData.academicCitations?.length > 0) {\n report += `## Academic Citations\\n\\n`;\n citationData.originalData.academicCitations.forEach(cite => {\n report += `### ${cite.title}\\n`;\n report += `- **Authors:** ${cite.authors}\\n`;\n report += `- **Journal:** ${cite.journal}, Vol. ${cite.volume}, p. ${cite.page} (${cite.year})\\n`;\n \n // Add fetched paper data if available\n const fetchedPaper = academicPapers.find(p => p.doi === cite.doi);\n if (fetchedPaper) {\n report += `- **Citations:** ${fetchedPaper.totalCitations || 0}\\n`;\n report += `- **Abstract Available:** Yes\\n`;\n if (fetchedPaper.pdfURL) {\n report += `- **Full Text:** [Available](${fetchedPaper.pdfURL})\\n`;\n }\n }\n \n report += `- **Context:** ${cite.context}\\n`;\n report += `- **Page:** ${cite.pageNumber}\\n\\n`;\n });\n}\n\n// Add citation patterns\nreport += `## Citation Analysis\\n\\n`;\nreport += `### Primary Authorities\\n`;\ncitationData.patterns.primaryAuthorities.forEach(auth => {\n report += `- ${auth}\\n`;\n});\nreport += `\\n### Secondary Authorities\\n`;\ncitationData.patterns.secondaryAuthorities.forEach(auth => {\n report += `- ${auth}\\n`;\n});\n\nreturn [{\n json: {\n report,\n citationData,\n academicPapers,\n exportFormat: 'markdown',\n generatedAt: new Date().toISOString()\n }\n}];"
},
"typeVersion": 1
},
{
"id": "save-report",
"name": "인용 보고서 저장",
"type": "n8n-nodes-base.writeBinaryFile",
"notes": "Export report",
"position": [
1650,
300
],
"parameters": {
"fileName": "citation_report_{{ $now.format('yyyy-MM-dd_HH-mm') }}.md",
"fileContent": "={{ $json.report }}"
},
"typeVersion": 1
}
],
"connections": {
"manual-trigger": {
"main": [
[
{
"node": "google-drive",
"type": "main",
"index": 0
}
]
]
},
"has-dois": {
"main": [
[
{
"node": "pdfvector-fetch",
"type": "main",
"index": 0
}
],
[
{
"node": "generate-report",
"type": "main",
"index": 0
}
]
]
},
"generate-report": {
"main": [
[
{
"node": "save-report",
"type": "main",
"index": 0
}
]
]
},
"pdfvector-fetch": {
"main": [
[
{
"node": "generate-report",
"type": "main",
"index": 0
}
]
]
},
"analyze-citations": {
"main": [
[
{
"node": "has-dois",
"type": "main",
"index": 0
}
]
]
},
"pdfvector-extract": {
"main": [
[
{
"node": "analyze-citations",
"type": "main",
"index": 0
}
]
]
},
"google-drive": {
"main": [
[
{
"node": "pdfvector-extract",
"type": "main",
"index": 0
}
]
]
}
}
}자주 묻는 질문
이 워크플로우를 어떻게 사용하나요?
위의 JSON 구성 코드를 복사하여 n8n 인스턴스에서 새 워크플로우를 생성하고 "JSON에서 가져오기"를 선택한 후, 구성을 붙여넣고 필요에 따라 인증 설정을 수정하세요.
이 워크플로우는 어떤 시나리오에 적합한가요?
중급 - 문서 추출, AI 요약
유료인가요?
이 워크플로우는 완전히 무료이며 직접 가져와 사용할 수 있습니다. 다만, 워크플로우에서 사용하는 타사 서비스(예: OpenAI API)는 사용자 직접 비용을 지불해야 할 수 있습니다.
관련 워크플로우 추천
PDF 벡터와 HIPAA 준수를 통해 의료 문서에서 临床 데이터 추출
PDF Vector와 HIPAA 준수를 통해 의료 문서에서 临床 데이터를 추출
If
Code
Postgres
+
If
Code
Postgres
9 노드PDF Vector
문서 추출
기업 계약 생명 주기 관리 및 AI 위험 분석
기업 계약 생명 주기를 관리하고 AI 위험 분석
If
Code
Merge
+
If
Code
Merge
20 노드PDF Vector
문서 추출
PDF Vector 및 Google Drive를 사용한 자동화 영수증 처리 및 세금 분류
PDF Vector 및 Google Drive를 사용한 자동화된 영수증 처리 및 세금 분류
Code
Google Drive
Google Sheets
+
Code
Google Drive
Google Sheets
9 노드PDF Vector
청구서 처리
PDF 벡터, Google Drive 및 데이터베이스를 사용하여发票 데이터를 추출하고 저장
PDF 벡터, Google Drive, 데이터베이스를 사용하여 청구서 데이터를 추출하고 저장합니다.
If
Code
Slack
+
If
Code
Slack
26 노드PDF Vector
청구서 처리
GPT-4 및 다중 데이터베이스 검색을 사용한 학술 문헌 검토 자동화
GPT-4 및 다중 데이터베이스 검색을 사용한 학술 문헌 리뷰 자동화
If
Set
Code
+
If
Set
Code
13 노드PDF Vector
문서 추출
GPT-4, PDFVector, PostgreSQL를 사용하여 문서에서 데이터 추출
GPT-4、PDFVector와 PostgreSQL을 사용하여 문서에서 데이터를 추출하여 내보내기
Code
Open Ai
Switch
+
Code
Open Ai
Switch
9 노드PDF Vector
문서 추출
워크플로우 정보
난이도
중급
노드 수8
카테고리2
노드 유형6
저자
PDF Vector
@pdfvectorA fully featured PDF APIs for developers - Parse any PDF or Word document, extract structured data, and access millions of academic papers - all through simple APIs.
외부 링크
n8n.io에서 보기 →
이 워크플로우 공유