Split PDF By Text From URL Asynchronously - PowerShell
PDF Split sample in PowerShell demonstrating ‘Split PDF By Text From URL Asynchronously’
SplitPdfFromUrlAsynchronously.ps1
# Cloud API asynchronous "Split PDF" job example.
# Allows to avoid timeout errors when processing huge or scanned PDF documents.
# The authentication key (API Key).
# Get your own by registering at https://app.pdf.co
$API_KEY = "***********************************"
# Source PDF file to split
$SourceFileUrl = "https://bytescout-com.s3-us-west-2.amazonaws.com/files/demo-files/cloud-api/pdf-split/multiple-invoices.pdf"
# Split Search Text
$SplitText = "invoice number"
# (!) Make asynchronous job
$Async = $true
# Prepare URL for `Split PDF By Text` API call
$query = "https://api.pdf.co/v1/pdf/split2"
# Prepare request body (will be auto-converted to JSON by Invoke-RestMethod)
# See documentation: https://apidocs.pdf.co
$body = @{
"searchString" = $SplitText
"url" = $SourceFileUrl
"async" = $Async
} | ConvertTo-Json
try {
# Execute request
$response = Invoke-WebRequest -Method Post -Headers @{ "x-api-key" = $API_KEY; "Content-Type" = "application/json" } -Body $body -Uri $query
$jsonResponse = $response.Content | ConvertFrom-Json
if ($jsonResponse.error -eq $false) {
# Asynchronous job ID
$jobId = $jsonResponse.jobId
# URL of generated JSON file available after the job completion; it will contain URLs of result PDF files.
$resultJsonFileUrl = $jsonResponse.url
# Check the job status in a loop.
do {
$statusCheckUrl = "https://api.pdf.co/v1/job/check?jobid=" + $jobId
$jsonStatus = Invoke-RestMethod -Method Get -Headers @{ "x-api-key" = $API_KEY } -Uri $statusCheckUrl
# Display timestamp and status (for demo purposes)
Write-Host "$(Get-date): $($jsonStatus.status)"
if ($jsonStatus.status -eq "success") {
# Download JSON file with URLs of result PDF files
$jsonPdfUrls = Invoke-RestMethod -Method Get -Headers @{ "x-api-key" = $API_KEY } -Uri $resultJsonFileUrl
# Download generated PDF files
$part = 1;
foreach ($url in $jsonPdfUrls) {
$localFileName = ".\part$($part).pdf"
# Download PDF file
Invoke-WebRequest -Headers @{ "x-api-key" = $API_KEY } -OutFile $localFileName -Uri $url
Write-Host "Downloaded `"$($localFileName)`""
$part++
}
break
}
elseif ($jsonStatus.status -eq "working") {
# Pause for a few seconds
Start-Sleep -Seconds 3
}
else {
Write-Host $jsonStatus.status
break
}
}
while ($true)
}
else {
# Display service reported error
Write-Host $jsonResponse.message
}
}
catch {
# Display request error
Write-Host $_.Exception
}
run.bat
@echo off
powershell -NoProfile -ExecutionPolicy Bypass -Command "& .\SplitPdfFromUrlAsynchronously.ps1"
echo Script finished with errorlevel=%errorlevel%
pause
PDF.co Web API: the Web API with a set of tools for documents manipulation, data conversion, data extraction, splitting and merging of documents. Includes image recognition, built-in OCR, barcode generation and barcode decoders to decode bar codes from scans, pictures and pdf.
Download Source Code (.zip)
return to the previous page explore PDF Split endpoint
Copyright © 2016 - 2024 PDF.co