Skip to content

Rest API to access converter

The communication to the converter is done via a REST API with no authentication. The sequence is basically as:

diagram.png

Some of the parameters below have been preserved to keep functionality with previous version of the doconverter service.

POST

To create a new job:

Server Type Endpoint
Prod Upload file https://doconverter.web.cern.ch/doconverter/api/v1.0/uploads
Dev Upload file https://dev-doconverter.web.cern.ch/doconverter/api/v1.0/uploads

While a new document is uploaded to the doconverter server, several parameters need to be specified:

  • uploadedfile: Use multipart/form-data to upload your file.
  • converter: format we would like to convert to, e.g. pdf, pdfa, ps, thumb*...
  • dirresponse: where you expect to store the converted file, it will be use when sending back your new converted document
  • urlresponse: callback that we will use to contact you e.g. https://mydomain.ch/api/receivefiles
  • options: special options depending on type of document conversion

An example can be found within doconverter project functional test module, see Github

Input File Conversion output Options Expected result Comments
Office 2019, Autocad(cdx), OpenOffice, rtf,htm,bmp,jpg,tif converter=[pdf,dfa,ps] N/A pdf file N/A
Special case for Word documents(e.g. doc,docx) converter=[pdf,pdfa] hidedocumentrevisions=[false,true] pdf file You may want to have comments visible on your pdf, by default they are not
PDF file converter=toimg typeofimg=[jpeg,bmp,tiff,png]:imgresh=200:imgresv=200* zip or tif file Depending on the format of file chosen a tif file or zip file containing all the numbered pages of the document will be sent back. E.g. original file mypresentation.pdf -> zip: mypresentation.zip or tif: mypresentation.tif
PDF file converter=thumb imgresh=200:imgresv=200:imgheight=300:imgwidth=300** png file if original file was called mydocument.pdf -> mydocument1.png
PLT file converter=hpgl color=[true,false] pdf file N/A
tif,png,jgp converter=tesocr N/A pdf searchable file It uses tesseract-ocr as engine that just support image files
pdf, tif converter=modiocr language=[english,french] pdf searchable file N/A

Info

imgresh,imgresv: should be one of '72x72','100x100','150x150','200x200','300x300','400x400','600x600','1200x1200' imgresh,imgresv,imgheight,imgwidth:should be one of '72x72','100x100','150x150','200x200','300x300','400x400','600x600','1200x1200' and imgheight_imgwidth should be an integer in pixels e.g.: thumb_200_200_150_150

Callback: receive your job results

You should get a result of your submitted job, either a success in that case you get a file with the expected format or a error in that case you get an error message:

  • directory: where you would like to place your document
  • status: 1 if success, different number otherwise
  • filename: name of the file provided in the content
  • error_message: in case of an error, an explanation why that happened
  • content: expected file coded in a multipart/form-data section

An example can be found at Github


Last update: October 5, 2021