I am interested in retrieving a large number (millions) of Binary resources for an NLP related project, and my understanding is that the bulk export feature is not available for the Binary resource.
I've developed a program to retrieve the Binary resources of interest sequentially based on id (and in parallel using multi-tasking/threading), but I am looking for ways to speed up this process.
Currently I can retrieve a Binary resource on average from 750 milliseconds to up to a few seconds. I am using caching where appropriate so I don't need to keep retrieving the access token from the authorization server (unless it times out).
Anyone have experience in either (1) speeding up the process, or (2) retrieving large chunks of Binary resources at a time without the Bulk export feature?
Thanks…