PDF To XLS
Convert PDF and scanned images to spreadsheet with layout and fonts preserved.
Available Methods
[POST] /pdf/convert/to/xls (xls output)
Auto classification Of Incoming Documents
Use /pdf/classifier
(Document Classifier) endpoint to automatically sort / detect the class of the document based on keywords-based rules. For example, you can define rules to find which vendor provided the document to find which template to apply accordingly.
Parameters
url
required. URL to the source file. Supports links from Google Drive, Dropbox and from built-in PDF.co files storage.
For uploading files via API please check Files Upload section.
If you are randomly gettingToo Many Requests
orAccess Denied
error for your input url, please try to addcache:
to enable built-in url caching.
You can also encrypt data for output files and decrypt data input files with user-controlled data encryption (uses strongAES
encryption with your own keys). Click here to learn more.httpusername
optional - http auth user name if required to access sourceurl
.httppassword
optional - http auth password if required to access sourceurl
.pages
optional. Comma-separated list of page indices (or ranges) to process. IMPORTANT: the very first page starts at0
(zero). To set a range use the dash-
, for example:0,2-5,7-
. To set a range from index to the last page use range like this:2-
(from page #3 as the index starts at zero and till the of the document). For ALL pages just leave this param empty. Example:0,2-5,7-
means first page, then 3rd page to 6th page, and then the range from 8th (index =7
) page till the end of the document. Must be a String.unwrap
optional. Unwrap lines into a single line within table cells whenlineGrouping
is enabled. Must be one of:true
,false
.rect
optional. Defines coordinates for extraction, e.g.51.8, 114.8, 235.5, 204.0
. You can use PDF.co PDF Viewer with coordinates to easily select and copy coordinates. Must be a String.lang
optional. Sets language for OCR (text from image) to use for scanned PDF, PNG, JPG documents input when extracting text. Default is “eng”. Other languages are also supported:deu
,spa
,chi_sim
,jpn
and many others (full list of supported OCR languages is here. You can also use 2 languages simultaneously like this:eng+deu
orjpn+kor
(any combination).inline
optional. Must be one of:true
to return data as inline orfalse
to return link to output file (default).lineGrouping
optional. optional. Line grouping within table cells. Set to1
to enable the grouping. Must be a String.async
optional. Runs processing asynchronously. ReturnsJobId
that you may use with/job/check
to check state of the background job (possible states:working
,failed
,aborted
andsuccess
). Must be one of:true
,false
.name
optional. File name for generated output. Must be a String.expiration
optional. Output link expiration in minutes. Default is60
(i.e. 60 minutes or 1 hour). After this delay generated output file(s) (if any) will be auto-removed from PDF.co temporary files storage. Max allowed expiration period depends on your current subscription plan. To store permanent input files (e.g. re-usable images, pdf, documents), please use PDF.co built-in Files Storage instead.profiles
optional. Must be a String. You can set additional and extra options using this parameter that allows you to set custom configuration. For example, to change the CSV separator for PDF to CSV, set this property to the following string:{ 'CSVSeparatorSymbol': ';' }
. See profiles samples for more examples.- Method: POST
- URL: /v1/pdf/convert/to/xls
Query parameters
No query parameters accepted.
Body payload
{
"url": "https://bytescout-com.s3-us-west-2.amazonaws.com/files/demo-files/cloud-api/pdf-to-excel/sample.pdf",
"async": false
}
Example responses
/pdf/convert/to/xls
{
"url": "https://pdf-temp-files.s3.amazonaws.com/544e26441de7451ab287152b7159d350/sample.xls",
"pageCount": 1,
"error": false,
"status": 200,
"name": "sample.xls",
"remainingCredits": 60572
}
Code Snippet
CURL
curl --location --request POST 'https://api.pdf.co/v1/pdf/convert/to/xls' \
--header 'x-api-key: ' \
--data-raw '{
"url": "https://bytescout-com.s3-us-west-2.amazonaws.com/files/demo-files/cloud-api/pdf-to-excel/sample.pdf",
"async": false
}'
Samples
- AWS Lambda - Convert PDF To XLS From URL (Node.js)
- C# - Advanced Conversion Options
- C# - Advanced Conversion Options With Rotated Input
- C# - Convert PDF To XLS From URL
- C# - Convert PDF To XLS From URL Asynchronously
- C# - Convert PDF To XLS From Uploaded File
- C# - Convert PDF To XLS From Uploaded File (WinForms)
- Java - Advanced Conversion Options
- Java - Advanced Conversion Options With Rotated Input
- Java - Convert PDF To XLS From URL
- Java - Convert PDF To XLS From Uploaded File
- JavaScript - Advanced Conversion Options
- JavaScript - Advanced Conversion Options With Rotated Input
- JavaScript - Convert PDF To XLS From URL (Node.js)
- JavaScript - Convert PDF To XLS From URL (Node.js) - Async API
- JavaScript - Convert PDF To XLS From Uploaded File (Node.js)
- JavaScript - Convert PDF To XLS From Uploaded File (Node.js) - Async API
- PowerShell - Advanced Conversion Options
- PowerShell - Advanced Conversion Options With Rotated Input
- PowerShell - Convert PDF To XLS From URL
- PowerShell - Convert PDF To XLS From URL Asynchronously
- PowerShell - Convert PDF To XLS From Uploaded File
- VB.NET - Advanced Conversion Options
- VB.NET - Advanced Conversion Options With Rotated Input
- VB.NET - Convert PDF To XLS From URL
- VB.NET - Convert PDF To XLS From Uploaded File
- VB.NET - Convert PDF To XLS From Uploaded File (WinForms)
- cURL - Convert PDF to XLS
Copyright © 2016 - 2023 PDF.co