I'm on mobile, otherwise I'd do it, but can you grep through all documents and create a word list sorted by how many times a word appears? ( sort | uniq -c | sort -n ) It might be an easy way to find names if not redacted. You could also look for sentences that have capitalized words in the middle to find names and places.
Here's a link to a ZIP download of all of the documents listed in the pastebin at https://pastebin.com/VHiDdX3V .
4 of the links gave 404s, so those files are not included. It'll take me a bit to get which ones, I'll update this comment later today.
EDIT: This is the list of files that 404ed. https://archive.org/download/gov.uscourts.nysd.447706/gov.uscourts.nysd.447706.31.0.pdf
https://archive.org/download/gov.uscourts.nysd.447706/gov.uscourts.nysd.447706.66.0.pdf
https://archive.org/download/gov.uscourts.nysd.447706/gov.uscourts.nysd.447706.82.0.pdf
https://archive.org/download/gov.uscourts.nysd.447706/gov.uscourts.nysd.447706.136.0.pdf
I wrote my downlaod script a bit hacky, so they are organized by the URL. With each "/" making a subfolder.
It's not ideal, but I did it fast.
https://mega.nz/file/yeIEEa6B#Ql2XFgBLmujtw12QOV6p8913GK7OofnL9hSNXSYkRlM
I took your mega upload and combined all 122 PDFs into one 1796 page searchable PDF using Adobe Acrobat.
HERE
Thx!
Cant copy out redacted info from thes? just shows jibberish for redacted parts when I paste into notepad++
I pasted into wordpad and it worked fine. There's an example at my last post.
Someone says I'm still missing some PDFs though.
I'm on mobile, otherwise I'd do it, but can you grep through all documents and create a word list sorted by how many times a word appears? ( sort | uniq -c | sort -n ) It might be an easy way to find names if not redacted. You could also look for sentences that have capitalized words in the middle to find names and places.
Extracting all of the text from the PDFs is a bit outside of my wheelhouse, but I might take a crack at it later on
Thanks for the easy download.
No, fast IS ideal. Props man.
Active torrent with all the documents. See my post https://thedonald.win/p/GbotVCR3/archived-giuffre-v-maxwell-docs/