cm0002@lemmy.world to Programmer Humor@programming.dev · 2 days agoDOGE employeelemmy.worldimagemessage-square98fedilinkarrow-up1534arrow-down110cross-posted to: programmerhumor@lemmy.ml
arrow-up1524arrow-down1imageDOGE employeelemmy.worldcm0002@lemmy.world to Programmer Humor@programming.dev · 2 days agomessage-square98fedilinkcross-posted to: programmerhumor@lemmy.ml
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up15arrow-down1·edit-210 hours ago$ pandoc doc.pdf -o doc.txt Edit: welp, pandoc can’t do that. pdftotext it is.
minus-squaremexicancartel@lemmy.dbzer0.comlinkfedilinkEnglisharrow-up2·edit-213 hours agomagick file.jpg file.html Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up2·11 hours agonot really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
minus-squarestetech@lemmy.worldlinkfedilinkarrow-up1·17 hours agoI thought pandoc didn’t support from PDF, only to?!
minus-squarelime!@feddit.nulinkfedilinkEnglisharrow-up2·11 hours agodamn it, you’re right. should probably have checked that…
minus-squarestetech@lemmy.worldlinkfedilinkarrow-up1·11 hours agoDon’t worry, I didn’t know either and had to check to check too :P
$ pandoc doc.pdf -o doc.txt
Edit: welp, pandoc can’t do that.
pdftotext
it is.Imagemagick be converting anything into anything (Actually in this case, it make an html file and a png file which is referenced in html file and html page displays it)
not really a good way to get the text out of a pdf though. then again, turns out neither is pandoc.
I thought pandoc didn’t support from PDF, only to?!
damn it, you’re right. should probably have checked that…
Don’t worry, I didn’t know either and had to check to check too :P