Need advice about which tool to choose?Ask the StackShare community!
OpenPDF vs Pandoc: What are the differences?
Developers describe OpenPDF as "A free Java library for creating and editing PDF files". OpenPDF is a free Java library for creating and editing PDF files with a LGPL and MPL open source license. OpenPDF is based on a fork of iText. On the other hand, Pandoc is detailed as "A universal document converter". It is a free and open-source document converter, widely used as a writing tool and as a basis for publishing workflows. It converts files from one markup format into another. It can convert documents in (several dialects of) Markdown, reStructuredText, textile, HTML, DocBook, LaTeX, MediaWiki markup, TWiki and many more.
OpenPDF and Pandoc belong to "File Conversion" category of the tech stack.
OpenPDF is an open source tool with 1.38K GitHub stars and 162 GitHub forks. Here's a link to OpenPDF's open source repository on GitHub.
Users are uploading huge PDF files of more than 100MB on our platform. We are creating several tools to manage those files, but keeping the raw file will eat up space, as we are handling several of them. After upload, they will be mainly keep stored for future use.
I am looking for a tool to compress and optimize those PDFs, like a library or an external API that can process that for us.
Thanks
I have been using Ghostscript and Python to get JPEG images from PDF files and that way we have reduced PDF size. But if your average is 100MB then probably those are hi-res images and not sure if your users will accept a quality reduction.
You can store raw files on a cdn service like bunnyCdn. If they want to work with the raw files, you can get it from the cdn service. Compressing is not a persistent solution for space problem. Also it's more safe way, because cdn providers copy your files more than one servers..
Modern cdn solutions have ftp / ssh support so you can easily send files to them...
Pros of OpenPDF
Pros of Pandoc
- Markdown2
- More popular and active on github1