Examples:
- `nano-pdf edit deck.pdf 5 "Update the revenue chart to show Q3 at $2.5M"`
- `nano-pdf add deck.pdf 15 "Create an executive summary slide with 5 bullet points"`
Features:
- Edit multiple pages in parallel
- Add entirely new slides that match your deck's style
- Google Search enabled by default so the model can look up current data
- Preserves text layer for copy/paste and search
It can work with any kind of PDF but I expect it would be most useful for a quick edit to a deck or something similar.
Does this mean the text only pdf page is transformed into an image that covers the full page, but the text is still under there. So, any machine based extraction would still get the text, but would probably loose all the bounding box information and regular users cannot just use their mouse to select text anymore?
My Text to Speech app uses bounding box to display what text in PDF is being read and would not work well PDF's from this project.
Many thanks to humanity for failing to standardise PDF and this project for paying interest on that tech debt with datacenter levels of energy consumption.
Has anyone given any it a go? Does it work?
I haven’t tried it, but there are plenty of examples.
But people here are probably also looking for example input and output PDFs (or images/screenshots) showing the actual work done to get a sense of what to expect.
Basically, .avif is an "animated image" format, like .gif, but .webm is only a video format.
edit: just realized .webp i think can be an animated image! So that seems like the alternative
PS: in my quick test of editing a PDF text -- the output PDF had weirdly added an extra "&" symbol at the end of every existing line of text. will try out more to see if it was something in the input PDF that was causing it.
After several iterations of edits, would the image quality decrease?
I wish an agent with a validation and rendering tools could instead manipulate the structure to accomplish those edits way less destructively, checking its progress with the tools.