Download a pdf direct to memory to use it with python

220 Views Asked by Luca2501 At 29 March 2023 at 11:32

The goal is to download a pdf file via requests (Python) without saving it on the hard disk. The i'd like to access it with PdfReader from PyPDF2, again without saving it.

def readFile(self, id):
        req = get(f'{self.workUrl}{id}/content', headers={'OTCSTicket': self.ticket})
        if req.status_code == 200: return req.raw
        else: raise Exception(f'Error Code {req.status_code}')

obj = server.readFile(id)
reader = PdfReader(obj)

Original Q&A

There are 1 best solutions below

Robert Fisher On 07 April 2023 at 02:25 BEST ANSWER

Instead of simply returning the raw object, you can wrap it or the req.content variable in io.BytesIO, which creates a file-like object you can open with PdfReader.

Like this:

def readFile(self, id):
    req = requests.get(
        url=f'{self.workUrl}{id}content/',
        headers={'OTCSTicket': self.ticket}
)
    if req.ok:
        return io.BytesIO(req.content)
    raise Exception(f'Error Code:  {req.status_code}')

reader = PdfReader(readFile(id))

Download a pdf direct to memory to use it with python

There are 1 best solutions below

Related Questions in PYTHON

Related Questions in PYTHON-REQUESTS

Related Questions in PYPDF

Related Questions in BYTESTREAM

Trending Questions

Popular # Hahtags

Popular Questions